Moves LayerNorm to output of the Encoder's sub-layers #47

patrickgadd · 2022-09-06T14:45:26Z

Hi there,

As explained in #46, I believe that there is a minor lack of correspondence between the implementation and what's written in the paper ("Context-Aware Learning to Rank with Self-Attention") when it comes to the Transformer architecture.

Sadly I can't say whether this in practice affects performance, as I'm attempting to utilize this work for something entirely different.
However, it looks like that with the fix, learning is a tad more stable.

At any rate, thank you once again for this work and publishing it!

PrzemekPobrotyn · 2022-09-07T11:43:43Z

please see my response in #46

Moves LayerNorm to output of the Encoder's sub-layers

e7d3614

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Moves LayerNorm to output of the Encoder's sub-layers #47

Moves LayerNorm to output of the Encoder's sub-layers #47

patrickgadd commented Sep 6, 2022

PrzemekPobrotyn commented Sep 7, 2022

Moves LayerNorm to output of the Encoder's sub-layers #47

Are you sure you want to change the base?

Moves LayerNorm to output of the Encoder's sub-layers #47

Conversation

patrickgadd commented Sep 6, 2022

PrzemekPobrotyn commented Sep 7, 2022