🐸 TTS roadmap #378

erogol · 2021-03-13T14:14:51Z

These are the main dev plans for 🐸 TTS.

If you want to contribute to 🐸 TTS and don't know where to start you can pick one here and start with our Contribution Guideline. We're also always here to help.

Feel free to pick one or suggest a new one.

Contributions are always welcome 💪 .

v0.1.0 Milestones

v0.2.0 Milestones

Grapheme 2 Phoneme in-house conversion. (Thx to gruut 👍 )
Implement VITS model.

v0.3.0 Milestones

Implement generic ForwardTTS API.
Implement Fast Speech model.
Implement Fast Pitch model.

v0.4.0 Milestones

Trainer API v2 - join the discussion
Multi-speaker VCTK recipes for all the TTS.tts models.

v0.5.0 Milestones

Support for multi-lingual models
YourTTS release 🚀

v0.6.0 Milestones

Add ESpeak support
New Tokenizer and Phonemizer APIs New Tokenizer API #937
New Model API Update models (Rebased) #1078
Splitting the trainer as a separate repo 👟Trainer
Update VITS model API
Gradient accumulation. Accumulate grads ( Larger batch size for low gpu memory) #560 (in 👟)

v0.7.0 Milestones

Implement Capacitron 👑 @a-froghyar 👑 @WeberJulian
Release pretrained Capacitron

v0.8.0 Milestones

Separate numpy transforms
Better data sampling for VITS
New Thorsten DE models 👑 @thorstenMueller

🏃‍♀️ Milestones along the way

🤖 New TTS models

The text was updated successfully, but these errors were encountered:

lucascassiano · 2021-03-22T22:03:43Z

great project! Excited to see this growing!

AndrewBarfield · 2021-04-17T21:12:21Z

I'm learning the code/API and performing experiments. I hope to contribute soon.

I'm also wondering if I can donate (money) to Coqui?

kdavis-coqui · 2021-04-18T08:39:37Z

I'm learning the code/API and performing experiments. I hope to contribute soon.

I'm also wondering if I can donate (money) to Coqui?

Wow! Thanks! Humbling.

We were setting up GitHub sponsors, but the tax implications were onerous.

We're currently exploring Patreon. So stay tuned!

agrinh · 2021-04-26T11:09:35Z

@erogol Thanks for sharing the plans!

Do you have any thoughts (or need help to) simplifying the dependencies a bit? I'm thinking that if TTS is used as a lib installed over pip it might be nice to remove visualisation dependencies only used in notebooks, removing test/dev dependencies and moving e.g. tensorflow into extras to reduce the footprint. Personally would love to use this as a dependency rather than maintaining my own fork.

erogol · 2021-04-26T11:19:38Z

@agrinh Why do you need to keep your own fork exactly? It'd be better to expand the conversation on gitter if you like.

agrinh · 2021-04-26T11:27:36Z

@agrinh Why do you need to keep your own fork exactly? It'd be better to expand the conversation on gitter if you like.

Wow, thanks for the super fast reply. Sure, we can move the discussion to gitter.

Sadam1195 · 2021-05-06T00:27:19Z

Please add DC-TTS to the the list of models.

DC-TTS implementation available with MIT Licence code available here
EFFICIENTLY TRAINABLE TEXT-TO-SPEECH SYSTEM BASED ON DEEP CONVOLUTIONAL NETWORKS WITH GUIDED ATTENTION paper
@erogol

will-rice · 2021-08-20T23:05:57Z

What were you thinking about the "TensorFlow run-time for training models"? Like giving the user the option of using TensorFlow or PyTorch? I wouldn't mind taking a stab at the TensorFlow part.

erogol · 2021-08-23T11:58:49Z

@will-rice the plan is to mirror what we have in torch to TF as much as possible. It'd be great if you initiate the work

lucashueda · 2021-08-30T12:44:05Z

Are you guys planning to develop some expressive TTS architectures? I'm currently studying this topic and planning to implement some of them based on Coqui, part of them just controlling latent space using GST Kwon et al 2020 or RE Sorin et al 2020, and others that actually changes the architecture by adding VAE, normalizing flows and gradient reversal

a-froghyar · 2021-08-30T12:46:08Z

@lucashueda Capacitron VAE: #510

lucashueda · 2021-08-30T13:15:01Z

@lucashueda Capacitron VAE: #510

Oh nice, hope to see Capacitron integrated soon. So maybe, in the future I'll be able to contribute with some others expressive architectures

BillyBobQuebec · 2021-09-18T21:59:35Z

@erogol Look forward to new End-to-End models being implemented, specfically Efficient-TTS! if the paper is accurate, it should blow most 2 stage configurations out of the water, considering it seems to have higher MOS than tacotron2+hifigan, while also seeming to have significantly faster speed than glowtts+fastest vocoder! I have not seen a single repo replicating the EFTS-Wav architecture described in the paper released 10 months ago, it would be amazing to see it in Coqui first!

erogol · 2021-09-18T23:24:19Z

@BillyBobQuebec I don't think I will implement these models anytime soon. But as they stand, contributions are welcome

WeberJulian · 2021-09-18T23:30:13Z

@BillyBobQuebec but you can try VITS which is close to what you're describing :)

BillyBobQuebec · 2021-09-18T23:41:10Z

@BillyBobQuebec but you can try VITS which is close to what you're describing :)

Agreed, I am currently trying VITS actually, I have some issues training with the coqui implementation unfortunately, I've posted the issue about the bug today and hope I can get it resolved.

hemath1001 · 2022-02-02T06:04:45Z

Hi there! Thanks for your great work! I'm looking forward to training YourTTS on other languages. Will training and fine-tuning code of YourTTS be published soon? I would be very grateful if you could tell me an approximate time~ Have a nice day :-D

nfaraji2002 · 2022-12-18T05:55:27Z

Hi
thanks for delightful codes!
I want to use this version of TTS on raspberry pi 4, but I think this version does not support real time processing.
Are there TF utilities provided as in Mozilla TTS to convert trained models to tf-lite?
Can the strategy of quantization work here for real-time processing?
I need some roadmaps in this regard.

Thanks
Neda

jhj0517 · 2023-01-16T10:06:12Z

Thank you for your great work for TTS.

Is there any progress on Let the user pass a custom text cleaner function. ?
If it's possible, I want to pass my own Korean cleaners.

erogol · 2023-01-16T23:12:18Z

You can currently do it by creating your own tokenizer or overloading the class.

stale · 2023-02-17T19:08:08Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

MaxIakovliev · 2023-02-20T01:40:04Z

Marvelous project.
Any ways to donate to core contributors?
I would prefer to use paypal.

stale · 2023-03-22T19:23:54Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

erogol · 2023-03-23T11:12:35Z

@MaxIakovliev you can use https://coqui.ai/ :)

erogol · 2023-03-23T11:14:23Z

This roadmap issue is quite outdated. I'll keep it open to keep the references to some of the issues and models we like to tackle but won't be updating until one day officially becomes 48 hours.

jmlcoliveira · 2023-05-10T16:42:17Z

Any update regarding SSML implementation?

erogol · 2023-05-11T08:48:55Z

We are not working on SSML currently, it is back in the list without a precise timeline.

offside609 · 2023-05-18T03:19:30Z

Please do!!

stale · 2023-06-17T08:14:12Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

violet17 · 2023-09-21T08:28:00Z

Will you support bark-small? Thanks.

csukuangfj · 2023-11-11T05:15:34Z

Any plan to a port of coqui-ai engine for android? TTS on android is very robotic (espeak, rhvoice, festival lite).

@paolo-caroni

Please take a look at
#3194

You can use sherpa-onnx to run VITS models from Coqui on Android and also embedded devices, e.g., raspberry pi.

We have pre-built Android APKs for the VITS English models from Coqui.
https://k2-fsa.github.io/sherpa/onnx/tts/apk.html

DmitryVN · 2023-11-21T20:45:28Z

Fix it plz #3039 #3282
The problem persists and because of this, normal correct use is not possible. Also at the moment it kind of breaks off the phrase at the end of each sentence and it turns out a jerky reading.

MarkChrisE2091 · 2023-12-23T12:51:30Z

Any new update?

csukuangfj · 2023-12-31T14:41:13Z

Any plan to a port of coqui-ai engine for android? TTS on android is very robotic (espeak, rhvoice, festival lite).

@paolo-caroni

We have supported it in k2-fsa/sherpa-onnx#508

The following is a YouTube video
https://www.youtube.com/watch?v=33QYuVzDORA

You can use all coqui-ai/TTS models and piper models listed in
https://github.com/k2-fsa/sherpa-onnx/releases/tag/tts-models
with k2-fsa/sherpa-onnx#508

imevro · 2024-04-29T15:19:43Z

hi guys, why?

upd: found https://twitter.com/_josh_meyer_/status/1742522906041635166

NicoleKai · 2024-04-29T15:34:26Z

Their ability to exist and be profitable was dependent on how much better their tech was compared to everyone else. It may not feel like it, but we are in the middle of an AI singularity. Coqui's business model might have stood a chance if they started with this tech 5 years earlier, but it was probably too little too late. Eleven labs is probably eating their lunch :/

erogol added the TODOs label Mar 13, 2021

erogol changed the title ~~Main Development plans for 🐸 TTS.~~ Main Development plans for 🐸 TTS. Mar 13, 2021

erogol pinned this issue Mar 13, 2021

stale bot added the wontfix This will not be worked on but feel free to help. label Jul 4, 2021

coqui-ai deleted a comment from stale bot Jul 5, 2021

stale bot removed the wontfix This will not be worked on but feel free to help. label Jul 5, 2021

erogol mentioned this issue Aug 30, 2021

[Feature request] [TTS] Support SSML in input text #752

Closed

stale bot added the wontfix This will not be worked on but feel free to help. label Oct 30, 2021

coqui-ai deleted a comment from stale bot Nov 1, 2021

stale bot removed the wontfix This will not be worked on but feel free to help. label Nov 1, 2021

stale bot added the wontfix This will not be worked on but feel free to help. label Dec 30, 2021

coqui-ai deleted a comment from stale bot Jan 1, 2022

stale bot removed the wontfix This will not be worked on but feel free to help. label Jan 1, 2022

erogol removed the wontfix This will not be worked on but feel free to help. label Dec 5, 2022

erogol reopened this Dec 5, 2022

stale bot added the wontfix This will not be worked on but feel free to help. label Feb 17, 2023

stale bot removed the wontfix This will not be worked on but feel free to help. label Feb 20, 2023

stale bot added the wontfix This will not be worked on but feel free to help. label Mar 22, 2023

stale bot removed the wontfix This will not be worked on but feel free to help. label Mar 23, 2023

stale bot added the wontfix This will not be worked on but feel free to help. label Apr 22, 2023

coqui-ai deleted a comment from stale bot Apr 23, 2023

stale bot removed the wontfix This will not be worked on but feel free to help. label Apr 23, 2023

stale bot added the wontfix This will not be worked on but feel free to help. label Jun 17, 2023

stale bot closed this as completed Jun 25, 2023

🐸 TTS roadmap #378

🐸 TTS roadmap #378

Comments

erogol commented Mar 13, 2021 • edited

v0.1.0 Milestones

v0.2.0 Milestones

v0.3.0 Milestones

v0.4.0 Milestones

v0.5.0 Milestones

v0.6.0 Milestones

v0.7.0 Milestones

v0.8.0 Milestones

🏃‍♀️ Milestones along the way

🤖 New TTS models

lucascassiano commented Mar 22, 2021

AndrewBarfield commented Apr 17, 2021

kdavis-coqui commented Apr 18, 2021

agrinh commented Apr 26, 2021

erogol commented Apr 26, 2021

agrinh commented Apr 26, 2021

Sadam1195 commented May 6, 2021

will-rice commented Aug 20, 2021

erogol commented Aug 23, 2021

lucashueda commented Aug 30, 2021

a-froghyar commented Aug 30, 2021

lucashueda commented Aug 30, 2021

BillyBobQuebec commented Sep 18, 2021 • edited

erogol commented Sep 18, 2021

WeberJulian commented Sep 18, 2021

BillyBobQuebec commented Sep 18, 2021

hemath1001 commented Feb 2, 2022

nfaraji2002 commented Dec 18, 2022

jhj0517 commented Jan 16, 2023

erogol commented Jan 16, 2023

stale bot commented Feb 17, 2023

MaxIakovliev commented Feb 20, 2023

stale bot commented Mar 22, 2023

erogol commented Mar 23, 2023

erogol commented Mar 23, 2023

jmlcoliveira commented May 10, 2023

erogol commented May 11, 2023

offside609 commented May 18, 2023

stale bot commented Jun 17, 2023

violet17 commented Sep 21, 2023

csukuangfj commented Nov 11, 2023

DmitryVN commented Nov 21, 2023 • edited

MarkChrisE2091 commented Dec 23, 2023

csukuangfj commented Dec 31, 2023

imevro commented Apr 29, 2024 • edited

NicoleKai commented Apr 29, 2024 • edited

erogol commented Mar 13, 2021 •

edited

BillyBobQuebec commented Sep 18, 2021 •

edited

DmitryVN commented Nov 21, 2023 •

edited

imevro commented Apr 29, 2024 •

edited

NicoleKai commented Apr 29, 2024 •

edited