Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing Feature! #58

Closed
ehabtawfikelbwab opened this issue Oct 19, 2023 · 6 comments
Closed

Missing Feature! #58

ehabtawfikelbwab opened this issue Oct 19, 2023 · 6 comments

Comments

@ehabtawfikelbwab
Copy link

Hello there,

I need to give the model a hint by couple of words to get better result .
How i can do that ?

Also i need an small or tiny Arabic model for mobile.
Can you help me please ?

Thank you!

@Macoron
Copy link
Owner

Macoron commented Oct 19, 2023

I need to give the model a hint by couple of words to get better result .
How i can do that ?

Check Initial Prompt field in WhisperManager. You can try to put words there, separated by commas. It doesn't always work, so you might want to experiment with different models weights and prompt varaints.

Also i need an small or tiny Arabic model for mobile.

tiny.bin and small.bin are multilingual. They should support Arabic. Try to force Arabic language by changing Language field in WhisperManager to "ar".

I don't know if there are any distilled networks fine-tuned for Arabic. OpenAI only provided English fine-tunes.

@ehabtawfikelbwab
Copy link
Author

What do you mean by Initial Prompt doesn't always work?
do you mean it is up to model can support that or not ?
and if there a model support Initial Prompt it will always work or what ?

also can you give me link of models please

@Macoron
Copy link
Owner

Macoron commented Oct 20, 2023

What do you mean by Initial Prompt doesn't always work?
do you mean it is up to model can support that or not ?

All models support Initial Prompt, however it's not always work as intended. For instance, if you type something like "LLAMA" in your prompt, whisper still can transcribe it as "lama".

You would need to experiment to find right prompt which works with your model.

also can you give me link of models please

Check repo readme. It should have all models there.

@ehabtawfikelbwab
Copy link
Author

All models support Initial Prompt, however it's not always work as intended. For instance, if you type something like "LLAMA" in your prompt, whisper still can transcribe it as "lama".<

This is weird, Is there any update can fix that ? because I'm using custom names like "Ehab, Ihab" / "Mahmoud, Mahmud" etc.
so the transcribe will be like a correction and that not good because I'm using this transcribe result later to check if exist or not (bool)

Check repo readme. It should have all models there.<

Thank you very much.

@Macoron
Copy link
Owner

Macoron commented Oct 20, 2023

This is weird, Is there any update can fix that ?

Your examples should work fine with a bigger networks. If you need to check names, you can try to use all names variations or do commands classification, like here #52.

I don't know any other solution except retraining network for your specific domain. You can try to ask in whisper.cpp or OpenAI repository. Maybe they have some ideas.

@ehabtawfikelbwab
Copy link
Author

Okay, Thank you!
feel free to close this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants