How did you get the audio for "datasets/srcdata/msrvtt/audios"? #17

wonzin · 2024-04-14T12:42:57Z

The original msrvtt folder structure is the below.

msrvtt
├── annotation
│ ├── MSR_VTT.json
├── high-quality
│ ├── structured-symlinks
│ │ ├── jsfusion_val_caption_idx.pkl
│ │ ├── ... many other files....
├── structured-symlinks
│ ├── jsfusion_val_caption_idx.pkl
│ ├── ... many other files....
├── videos
│ ├── all
│ │ ├── video1.mp4
│ │ ├── ....
│ │ ├── video9999.mp4
│ ├── tmp
│ │ ├──MSRVTT.zip
│ ├── vids
│ │ ├──data
│ │ │ ├── MSRVTT.zip

However, there is no audios for msrvtt.

How did you get the audio?
Is there specific way to extract the audio for example, bitrate, sample rate, audio channel, type of codec.
Any kind of audio file is valid?
"datasets/src/data/msrvtt/videos" == "msrvtt/videos/all" ?

wonzin · 2024-04-15T05:38:45Z

ffmpeg video.mp4 -ac 1 -ar 16000 audio.wav
I use this options to convert into audios. but it still has other error.

04/15/2024 14:39:07 - INFO - main - data_cfg_msrvtt_cap_val_batch_size : 64
04/15/2024 14:39:07 - INFO - main - msrvtt_cap Using clip mean and std.
04/15/2024 14:39:07 - INFO - main - msrvtt_cap transforms crop_flip
04/15/2024 14:39:07 - INFO - main - Create Dataset msrvtt_cap Success
04/15/2024 14:39:07 - INFO - main - loader cap%tvas--msrvtt_cap , ratio 10170 , bs_pergpu 16, n_workers 8

not have audios video6446
not have audios video51
...

DelusionalLogic · 2024-04-27T19:54:11Z

I've used the script in https://github.com/TXH-mercury/VAST/blob/410ca47acf40d4ab098e345b76159df66bc42239/utils/offline_process_data.py to extract the audio.

As for still getting no audio errors. I think some of the videos don't contain any audio, and those errors are expected.

wonzin · 2024-06-03T07:01:57Z

Thank you you save my day :)

wonzin closed this as completed Jun 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How did you get the audio for "datasets/srcdata/msrvtt/audios"? #17

How did you get the audio for "datasets/srcdata/msrvtt/audios"? #17

wonzin commented Apr 14, 2024

wonzin commented Apr 15, 2024 •

edited

DelusionalLogic commented Apr 27, 2024

wonzin commented Jun 3, 2024

How did you get the audio for "datasets/srcdata/msrvtt/audios"? #17

How did you get the audio for "datasets/srcdata/msrvtt/audios"? #17

Comments

wonzin commented Apr 14, 2024

wonzin commented Apr 15, 2024 • edited

DelusionalLogic commented Apr 27, 2024

wonzin commented Jun 3, 2024

wonzin commented Apr 15, 2024 •

edited