Timing of subtitles is way off if I limit max_line_count and max_line_width==bad things? #807

adhocinterest · 2024-05-18T20:36:37Z

This is the command I used

whisperx ~/Downloads/Telegram\ Desktop/purpose\ of\ govt.mp3 --max_line_count 7 --max_line_width 11 --model small 
--batch_size 1 --compute_type float32

The transcription timing is way off. The first couple sections have the exact same start and end and the the next few have text in areas where there is silence or the words are not part of that section.

I can supply the audio if necessary. Is this part of the bad things can happen because of the float32 due to my hardware? Or is there a deeper issue?

The text was updated successfully, but these errors were encountered:

adhocinterest · 2024-05-19T15:15:12Z

As a side note, I can't tell for sure yet, but the JSON file seems to be accurate. I've got to put it in my editor to validate that's the case, but at first glance it definitely doesn't have the problems the set and vtt files have.

nikola1975 · 2024-05-21T08:39:17Z

I did not notice a problem in the scope you are mentioning. Have you confirmed the problem with various files?

From my experience, WhisperX subtitles are more precise than OpenAI Whisper's API implementation.

adhocinterest · 2024-05-21T20:49:26Z

I just tried it with a different file and it didn't seem repeat, but it did repeatedly on the previous file. My purpose has changed somewhat and the file format will not fit my needs any longer. I instead will be creating what I need by parsing the json into what I need. Thank you. I'll go ahead and close the issue with this comment as I don't plan on assisting in troubleshooting anymore.

adhocinterest changed the title ~~Timing of subtitles is way off if I limit max_line_count and max_line_lidth==bad things?~~ Timing of subtitles is way off if I limit max_line_count and max_line_width==bad things? May 18, 2024

adhocinterest closed this as completed May 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Timing of subtitles is way off if I limit max_line_count and max_line_width==bad things? #807

Timing of subtitles is way off if I limit max_line_count and max_line_width==bad things? #807

adhocinterest commented May 18, 2024

adhocinterest commented May 19, 2024

nikola1975 commented May 21, 2024

adhocinterest commented May 21, 2024

Timing of subtitles is way off if I limit max_line_count and max_line_width==bad things? #807

Timing of subtitles is way off if I limit max_line_count and max_line_width==bad things? #807

Comments

adhocinterest commented May 18, 2024

adhocinterest commented May 19, 2024

nikola1975 commented May 21, 2024

adhocinterest commented May 21, 2024