need to swap layer norm op for triton-based layer norm? #57

ankitvgupta · 2024-03-22T18:56:00Z

In the Flash-attention repo here, there is now a note that the fused CUDA op has been replaced with a Triton op.

in light of that, is it now reasonable to remove from the dependencies section of this readme the suggestion to pip install the layer norm op?

The text was updated successfully, but these errors were encountered:

ankitvgupta · 2024-03-22T22:52:15Z

It looks like on this line, you check if the custom layer norm op is installed. if so, this param is set to true. Following the call stack, that sets this param in the Flash-Attention package. That implementation here has moved to a Triton implementation.

However, later in the original hyena-DNA code, we are using the non-Triton function. Does that need to be swapped out?

relevant PR: Dao-AILab/flash-attention@abbc131

ankitvgupta · 2024-03-22T23:25:07Z

In case the answer is yes, think this should do it: #58

ankitvgupta changed the title ~~no longer need for layer_norm op~~ need to swap layer norm op for triton-based layer norm? Mar 22, 2024

This was referenced Mar 22, 2024

Using Triton-based layer-norm ankitvgupta/hyena-dna#1

Closed

Using Triton-based layer-norm #58

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

need to swap layer norm op for triton-based layer norm? #57

need to swap layer norm op for triton-based layer norm? #57

ankitvgupta commented Mar 22, 2024 •

edited

ankitvgupta commented Mar 22, 2024 •

edited

ankitvgupta commented Mar 22, 2024

need to swap layer norm op for triton-based layer norm? #57

need to swap layer norm op for triton-based layer norm? #57

Comments

ankitvgupta commented Mar 22, 2024 • edited

ankitvgupta commented Mar 22, 2024 • edited

ankitvgupta commented Mar 22, 2024

ankitvgupta commented Mar 22, 2024 •

edited

ankitvgupta commented Mar 22, 2024 •

edited