-
Notifications
You must be signed in to change notification settings - Fork 811
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Duplicate words #811
Comments
Then you are posting in the wrong repo.
It can't cause any issue as it's just srt/vtt writing setting and it has no effect in your example as output there is json. |
I'm transcribing a relatively long video and I'm often get a bunch of duplicated words for the same timestamp, e.g.:
I use
distil-large-v2
model with faster-whisper standalone executable. Here are the arguments I'm passing into faster-whisper.I saw a relevant discussion, but it proposed a fix already, which did not fix the issue for me: #716
I made sure I'm on the latest version as of today. I also tried playing around with beam_size setting, but no effect, just slower transcription. I need the one_word setting, though it might be causing the issue, but haven't tested yet (might test it later). The video I'm testing with is this one: https://www.youtube.com/watch?v=q3xN1iYeTNI (downloaded with youtube-dl)
The text was updated successfully, but these errors were encountered: