Generation issues with inputs which contain commas, punctuation, and ellipsis #130

somewheresy · 2024-04-11T13:23:04Z

When inputs contain commas and periods, the output generations often create too much interstitial space / silence between words before and after the characters.

When ellipsis are present in the input, instead of creating more interstitial space (silence), the model tends to hallucinate.

Could this be from removal of characters in the 100K hour training set?

vatsalaggarwal · 2024-04-18T14:20:58Z

Are you able to share some of the texts?

It tends to be dependent on the speaker reference used, and it will be improved in one of the 2 upcoming releases

somewheresy · 2024-04-18T18:06:07Z

This issue has seemingly been resolved by finetuning the base model for 20 epochs on hand-crafted voice data which includes commas, multiple sentences, and ellipses (41 < len < 261, n=600, 0.2 train split, no LR decay, default training config). Still seems to have trouble having correct / accurate prosody -- even when zero-shot cloning voice is in the 'train' segmentation of the dataset. We call this effect the "William Shatner" effect... 😸

lucapericlp · 2024-05-14T21:07:55Z

Thanks for the update @somewheresy & sorry for the delay in my response - I'll go ahead & close out the issue but feel free to reopen if there's anything else to raise.

lucapericlp closed this as completed May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generation issues with inputs which contain commas, punctuation, and ellipsis #130

Generation issues with inputs which contain commas, punctuation, and ellipsis #130

somewheresy commented Apr 11, 2024

vatsalaggarwal commented Apr 18, 2024 •

edited

somewheresy commented Apr 18, 2024

lucapericlp commented May 14, 2024

Generation issues with inputs which contain commas, punctuation, and ellipsis #130

Generation issues with inputs which contain commas, punctuation, and ellipsis #130

Comments

somewheresy commented Apr 11, 2024

vatsalaggarwal commented Apr 18, 2024 • edited

somewheresy commented Apr 18, 2024

lucapericlp commented May 14, 2024

vatsalaggarwal commented Apr 18, 2024 •

edited