[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
-
Updated
May 27, 2024 - Python
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Video Foundation Models & Data for Multimodal Understanding
[ICCV 2021] A new codebase containing various methods for Group Activity Recognition. Paper title: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition.
A Large Short-video Recommendation Dataset with Raw Text/Audio/Image/Videos (Talk Invited by DeepMind).
The official code of "CSTA: CNN-based Spatiotemporal Attention for Video Summarization"
【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
FreeVA: Offline MLLM as Training-Free Video Assistant
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap (CVPR24 - CVSports workshop)
Official code for MiniGPT4-video
Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Code release for "Training a Large Video Model on a Single Machine in a Day"
(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentation"
[IJCNN 2024] Unifying Global and Local Scene Entities Modelling for Precise Action Spotting
[NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking
VTC: Improving Video-Text Retrieval with User Comments
Add a description, image, and links to the video-understanding topic page so that developers can more easily learn about it.
To associate your repository with the video-understanding topic, visit your repo's landing page and select "manage topics."