video-captioning

A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.

dataset video-captioning video-to-text video-retrieval video-description vision-language video-text video-language

Updated Feb 18, 2024

UARK-AICV / VLTinT

Star

[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning

pytorch video-captioning transformer-architecture vision-language video-paragraph-captioning aaai2023

Updated Feb 16, 2024
Jupyter Notebook

imshaikot / srt-webvtt

Star

Convert SRT formatted subtitle to WebVTT on the fly over HTML5/browser environment

converter video html5 html5-video srt-subtitles video-captioning web-vtt

Updated Feb 13, 2024
TypeScript

yangbang18 / CARE

Star

(TIP) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information

pytorch video-captioning concept-detection

Updated Jan 3, 2024
Jupyter Notebook

zjr2000 / LLMVA-GEBC

Star

Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)

video-captioning pytorch-implementation long-video-understanding

Updated Jan 1, 2024
Python

pyserve / Real-Time-Video-Captioning

Star

Visio Text is a real-time video captioning project that leverages the capabilities of artificial intelligence to provide dynamic text captions for videos.

video-captioning cnn-lstm transformer-architecture

Updated Dec 18, 2023
Jupyter Notebook

tsujuifu / pytorch_empirical-mvm

Star

A PyTorch implementation of EmpiricalMVM

pytorch video-captioning vision-and-language pre-training video-retrieval video-question-answering cvpr2023

Updated Dec 18, 2023
Python

antoyang / VidChapters

Star

[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale

video-understanding weakly-supervised-learning video-captioning multimodal-learning vision-and-language dense-video-captioning pre-training temporal-language-grounding video-chapter-generation vid2seq

Updated Nov 13, 2023
Jupyter Notebook

jssprz / video_captioning_datasets

Star

Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*

review video-captioning state-of-the-art vision-and-language charades video-to-text msvd video-dataset video-description activitynet-captions trecvid tgif-dataset msr-vtt vatex

Updated Oct 27, 2023
Jupyter Notebook

yangbang18 / CLIP-Captioner

Star

(PRCV'2022) CLIP Meets Video Captioning: Concept-Aware Representation Learning Does Matter

pytorch clip video-captioning

Updated Oct 22, 2023
Jupyter Notebook

amirh-khali / aavdc-collection

Star

Data collection and automatic labeling for dense video captioning models

python youtube jupyter-notebook data-collection video-captioning auto-annotation

Updated Oct 12, 2023
Jupyter Notebook

jayleicn / TVCaption

Star

[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset

pytorch dataset video-captioning

Updated Sep 6, 2023
Python

AI-14 / video-captioning-for-arabic-sign-language-recognition-at-sentence-level

Star

An encoder-decoder deep learning model (with/without attention mechanism) where the input is an arabic sign-language video and the output is its translation in text format.

python deep-learning pytorch lstm vgg16 attention-mechanism video-captioning encoder-decoder-model

Updated Aug 29, 2023
Jupyter Notebook

willyfh / msvd-indonesian

Star

MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian (Bahasa Indonesia).

deep-learning neural-network bahasa-indonesia video-captioning msvd video-retrieval video-description multimodal-dataset video-text indonesian-dataset msvd-indonesian

Updated Aug 4, 2023

TXH-mercury / COSA

Star

Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

video-captioning video-qa video-retrieval vision-language-pretraining video-language-pretrainng

Updated Aug 1, 2023
Python

Improve this page

Add a description, image, and links to the video-captioning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the video-captioning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

video-captioning

Here are 76 public repositories matching this topic...

Skyline-9 / Shotluck-Holmes

bytedance / Shot2Story

mlvlab / MELTR

jpthu17 / EMCL

acherstyx / CoCap

willyfh / awesome-video-text-datasets

UARK-AICV / VLTinT

imshaikot / srt-webvtt

yangbang18 / CARE

zjr2000 / LLMVA-GEBC

pyserve / Real-Time-Video-Captioning

tsujuifu / pytorch_empirical-mvm

antoyang / VidChapters

jssprz / video_captioning_datasets

yangbang18 / CLIP-Captioner

amirh-khali / aavdc-collection

jayleicn / TVCaption

AI-14 / video-captioning-for-arabic-sign-language-recognition-at-sentence-level

willyfh / msvd-indonesian

TXH-mercury / COSA

Improve this page

Add this topic to your repo