-
-
Notifications
You must be signed in to change notification settings - Fork 988
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Timing of subtitles is way off if I limit max_line_count and max_line_width==bad things? #807
Comments
As a side note, I can't tell for sure yet, but the JSON file seems to be accurate. I've got to put it in my editor to validate that's the case, but at first glance it definitely doesn't have the problems the set and vtt files have. |
I did not notice a problem in the scope you are mentioning. Have you confirmed the problem with various files? From my experience, WhisperX subtitles are more precise than OpenAI Whisper's API implementation. |
I just tried it with a different file and it didn't seem repeat, but it did repeatedly on the previous file. My purpose has changed somewhat and the file format will not fit my needs any longer. I instead will be creating what I need by parsing the json into what I need. Thank you. I'll go ahead and close the issue with this comment as I don't plan on assisting in troubleshooting anymore. |
This is the command I used
The transcription timing is way off. The first couple sections have the exact same start and end and the the next few have text in areas where there is silence or the words are not part of that section.
I can supply the audio if necessary. Is this part of the bad things can happen because of the float32 due to my hardware? Or is there a deeper issue?
The text was updated successfully, but these errors were encountered: