About Guided Attention loss #45

ywh-my · 2021-11-03T12:52:40Z

Hello . I try to use your model to finish melspec to melspec conversion task.

I try to add a guided attention like this :

dal_loss = (hp.lamda_attn_dal * dal_g * attn_probs).abs().mean()
dal_g = guided_attention(Nt, Ns) ## This function is included in your code ,but not used .

And I found that the loss can not decay while seq loss decline fastly . And my train can not converge.

I wish you help ,please！Thank you.

vskadandale · 2023-03-31T08:30:51Z

Hi, did you get this working finally? While training on LJSpeech dataset, I notice that the diagonal alignment doesn't appear in decoder attention and encoder-decoder attention, but only in the encoder attention around 160K iterations. Please let know if you had similar issues. Many thanks!

ywh-my · 2023-03-31T08:47:43Z

sorry , I abandoned this project long ago. >_<

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About Guided Attention loss #45

About Guided Attention loss #45

ywh-my commented Nov 3, 2021

vskadandale commented Mar 31, 2023

ywh-my commented Mar 31, 2023

About Guided Attention loss #45

About Guided Attention loss #45

Comments

ywh-my commented Nov 3, 2021

vskadandale commented Mar 31, 2023

ywh-my commented Mar 31, 2023