Code for ALBEF: a new vision-language pre-training method
-
Updated
Sep 20, 2022 - Python
Code for ALBEF: a new vision-language pre-training method
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
The largest multilingual image-text classification dataset. It contains fashion products.
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)
Data release for the ImageInWords (IIW) paper.
A server powering LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
Write texts on images with php
Wrapper for PHP's GD Library for easy image manipulation. Support for scaling multi-line text, shapes, filters and smart resize.
The first public Vietnamese visual linguistic foundation model(s)
Some Python scripts to load Vietnamese visual linguistic data
caption generator using lavis and argostranslate
WWDC22: Enabling Live Text interactions with images in SwiftUI
11000-Image-Video-caption-data-of-human-action
Character Recognition system using CNN and Streamlit
Add a description, image, and links to the image-text topic page so that developers can more easily learn about it.
To associate your repository with the image-text topic, visit your repo's landing page and select "manage topics."