#

video-question-answering

Here are 43 public repositories matching this topic...

mlvlab / OVQA

Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 2023)

multi-modal visual-question-answering video-question-answering iccv2023

Updated Apr 23, 2024
Python

nicolas-dufour / video-question-answering

Given a video, we are able to automaticaly answer questions about what is happening in the video.

nlp computer-vision video-question-answering

Updated Jul 4, 2021
Jupyter Notebook

whwu95 / FreeVA

FreeVA: Offline MLLM as Training-Free Video Assistant

chatbot video-understanding zero-shot-video-captioning video-question-answering chatgpt vision-language-model llava training-free multimodal-large-language-models

Updated May 22, 2024
Python

Abdelrhman-Yasser / multimedia_question_answering

A simple attention deep learning model to answer questions about a given video with the most relevant video intervals as answers.

python nlp deep-learning tensorflow python3 video-question-answering

Updated Jul 6, 2019
Python

lyuchenyang / Efficient-VideoQA

Code for ACL SustaiNLP 2023 paper "Is a Video worth n × n Images? A Highly Efficient Approach to Transformer-based Video Question Answering"

machine-learning natural-language-processing deep-learning artificial-intelligence video-question-answering multi-modal-learning

Updated Jul 4, 2023
Python

lyuchenyang / Semantic-aware-VideoQA

Code for ACL SRW 2023 paepr "Semantic-aware Dynamic Retrospective-Prospective Reasoning for Event-level Video Question Answering"

machine-learning natural-language-processing deep-learning artificial-intelligence video-question-answering multi-modal-learning

Updated Jul 4, 2023
Python

mmazab / LifeQA

Data and PyTorch code for the LifeQA LREC 2020 paper.

nlp machine-learning natural-language-processing youtube research computer-vision deep-learning pytorch dataset videos question-answering real-life videoqa video-question-answering lrec2020 lrec lifeqa

Updated May 24, 2021
Python

jena-shreyas / Efficient-VidQA

Part of my work for my Bachelor's Thesis Project on Counterfactual Reasoning for Videos.

deep-learning multimodal-deep-learning scene-understanding video-question-answering

Updated Oct 4, 2023
Python

zchoi / PKOL

[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”

pytorch pytorch-implementation video-retrieval vision-language video-question-answering

Updated Jan 27, 2024
Python

MichiganNLP / lifeqa

LifeQA website code

nlp machine-learning natural-language-processing youtube research computer-vision deep-learning pytorch dataset videos question-answering real-life videoqa video-question-answering lrec2020 lrec lifeqa

Updated Feb 3, 2023
HTML

engindeniz / DialogSummary-VideoQA

[ICCV 2021] On the hidden treasure of dialog in video question answering

language-models video-understanding vision-language video-question-answering knowledge-base-videoqa

Updated Mar 30, 2022
Python

MichiganNLP / wildqa

WildQA website code

nlp machine-learning youtube research computer-vision deep-learning pytorch dataset videos question-answering in-the-wild coling videoqa video-question-answering natual-language-processing coling2022 wildqa

Updated May 10, 2023
HTML

declare-lab / Sealing

[NAACL 2024] Official Implementation of paper "Self-Adaptive Sampling for Efficient Video Question Answering on Image--Text Models"

multimodality video-understanding video-question-answering visual-language-models naacl2024

Updated Apr 26, 2024
Python

doc-doc / NExT-GQA

Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)

videoqa video-grounding video-question-answering video-language-understanding trustworthy-vqa visual-evidence-grounding

Updated May 3, 2024
Python

gzcsudo / MSPAN-VideoQA

Multi-Scale Progressive Attention Network for Video Question Answering

visual-question-answering video-question-answering acl2021

Updated Jan 11, 2023
Python

doc-doc / CoVGT

Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)

videoqa video-question-answering contrastive-learning dynamic-visual-graph video-language-understanding

Updated Mar 9, 2024
Python

bcmi / Causal-VidQA

[CVPR 2022] A large-scale public benchmark dataset for video question-answering, especially about evidence and commonsense reasoning. The code used in our paper "From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-Answering", CVPR2022.

commonsense-reasoning video-question-answering evidence-reason visual-understanding video-question-answering-dataset

Updated Aug 22, 2022
Python

tsujuifu / pytorch_empirical-mvm

A PyTorch implementation of EmpiricalMVM

pytorch video-captioning vision-and-language pre-training video-retrieval video-question-answering cvpr2023

Updated Dec 18, 2023
Python

bytedance / Shot2Story

A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.

benchmark video-summarization dataset video-captioning video-story vision-language video-question-answering video-language large-language-models video-language-pretraining video-story-generation

Updated May 24, 2024
Python

XLiu443 / Tem-adapter

[ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer

video-understanding video-question-answering clip-model

Updated Oct 18, 2023
Python

Improve this page

Add a description, image, and links to the video-question-answering topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the video-question-answering topic, visit your repo's landing page and select "manage topics."