Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG]: NaturalSpeech2 training issue #115

Open
KevinLee1993 opened this issue Jan 22, 2024 · 2 comments
Open

[BUG]: NaturalSpeech2 training issue #115

KevinLee1993 opened this issue Jan 22, 2024 · 2 comments
Assignees
Labels
bug Something isn't working

Comments

@KevinLee1993
Copy link

Describe the bug

Thank you so much for sharing this wonderful project. However, I have some problem about the tts ns2 training.
./egs/tts/NaturalSpeech2/README.md suggests us to follow other Amphion TTS recipes for the data processing. But After I finish the features that need to be used in ns2 using fs2 and valle data preprocess script, I find I can not run the training script of ns2 successfully. In ./models/tts/naturalspeech2/ns2_dataset.py, some of the features seems to be obtained by refer to "phones" and "num_frames" in metadata, which is NOT included in the train.txt file.
Is there anything else I can do to run ns2 training successfully. Or should I just wait for the official update of ns2 preprocess as I have seen in other issue.
Can any of the author tell me when would the preprocess script be ready? Looking forward for your reply.

@KevinLee1993 KevinLee1993 added the bug Something isn't working label Jan 22, 2024
@eschmidbauer
Copy link

eschmidbauer commented Feb 5, 2024

Im experiencing same issue @KevinLee1993 described. Any information on a solution would be helpful.
also thanks for sharing this project!

@HeCheng0625
Copy link
Collaborator

Hi, you can use any G2P module to get the phone sequence, and "num_frames" is the number of frames of the melspec. (For example, if the hopsize is 200, the num_frame of an 1s 16KHz audio is 80)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants