textvqa

Here are 3 public repositories matching this topic...

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.

PyTorch DataLoader for many VQA datasets

Add a description, image, and links to the textvqa topic page so that developers can more easily learn about it.

To associate your repository with the textvqa topic, visit your repo's landing page and select "manage topics."