[BUG]: NaturalSpeech2 training issue #115

KevinLee1993 · 2024-01-22T11:53:04Z

Describe the bug

Thank you so much for sharing this wonderful project. However, I have some problem about the tts ns2 training.
./egs/tts/NaturalSpeech2/README.md suggests us to follow other Amphion TTS recipes for the data processing. But After I finish the features that need to be used in ns2 using fs2 and valle data preprocess script, I find I can not run the training script of ns2 successfully. In ./models/tts/naturalspeech2/ns2_dataset.py, some of the features seems to be obtained by refer to "phones" and "num_frames" in metadata, which is NOT included in the train.txt file.
Is there anything else I can do to run ns2 training successfully. Or should I just wait for the official update of ns2 preprocess as I have seen in other issue.
Can any of the author tell me when would the preprocess script be ready? Looking forward for your reply.

eschmidbauer · 2024-02-05T21:46:02Z

Im experiencing same issue @KevinLee1993 described. Any information on a solution would be helpful.
also thanks for sharing this project!

HeCheng0625 · 2024-04-02T12:07:46Z

Hi, you can use any G2P module to get the phone sequence, and "num_frames" is the number of frames of the melspec. (For example, if the hopsize is 200, the num_frame of an 1s 16KHz audio is 80)

KevinLee1993 added the bug Something isn't working label Jan 22, 2024

ArkhamImp assigned lmxue and HeCheng0625 Jan 23, 2024

HarryHe11 mentioned this issue Jan 29, 2024

Librilight Preprocess Scripts Revision #125

Open

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: NaturalSpeech2 training issue #115

[BUG]: NaturalSpeech2 training issue #115

KevinLee1993 commented Jan 22, 2024

eschmidbauer commented Feb 5, 2024 •

edited

HeCheng0625 commented Apr 2, 2024

[BUG]: NaturalSpeech2 training issue #115

[BUG]: NaturalSpeech2 training issue #115

Comments

KevinLee1993 commented Jan 22, 2024

Describe the bug

eschmidbauer commented Feb 5, 2024 • edited

HeCheng0625 commented Apr 2, 2024

eschmidbauer commented Feb 5, 2024 •

edited