Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ASR SpeechT5 training - model predicts same output for different inputs #62

Open
L7uan opened this issue Sep 25, 2023 · 0 comments
Open

Comments

@L7uan
Copy link

L7uan commented Sep 25, 2023

Hi!
I am currently trying to train a SpeechT5forSpeechToText model for an ASR task from scratch. My traing goes quite well most of the time, however when i try to use the model for inference with model.generate(**input) the predicts the same output for different inputs... I'm using the huggingface implementation and I followed every step on how to train the model but I just cant find the error in my code, why my model predicts the same output for every input...
Might this be a general error with the SpeechT5ForSpeechToText implementation on huggingface? Or am I doing anything wrong??
Any fast help would be really appreceated!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant