Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Phoneme-level pronunciation control #94

Open
danablend opened this issue Mar 14, 2024 · 2 comments
Open

Phoneme-level pronunciation control #94

danablend opened this issue Mar 14, 2024 · 2 comments
Labels
feature request New feature or request

Comments

@danablend
Copy link

Hey! I understand that the text tokens are currently encoded on the character-level and the model is trained with these tokens.

What would be the process for getting phoneme level control over the output audio to correct pronunciations for exotic words or different accents during runtime? One could maybe fine tune models for this, but getting the phoneme level control on the input side would be great.

This would be an amazing add. Would be happy to contribute.

@vatsalaggarwal
Copy link
Contributor

Where have you had these kinds of issues? Are you able to share examples?

@StephennFernandes
Copy link

@danablend

I get what you are trying to express. you should try dict TTS

It used a dictionary of pronounciation of exotic words with could of sentence as context. It's a context aware exotic word pronounciation model.

@lucapericlp lucapericlp added the feature request New feature or request label May 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants