-
Notifications
You must be signed in to change notification settings - Fork 823
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[QST] Epilogue Reduction #1518
Labels
Comments
This issue has been labeled |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
What is your question?
I'm looking to define a GEMM that does the following (in pseudocode):
That is, the epilogue should a) compute the column-wise
2-norm
ofD
and b) storeF
to global, no need to storeD
. (2-norm being thesqrt
of thesum of squares
alongaxis=1
).What's the most appropriate epilogue type for this pattern specific for
Ampere
?EVT
would fit this well -- are there examples for this NOT forstream-k
? Always get compilation errors when trying to instantiate an EVT device GEMM forAmpere
(see [QST] Epilogue Broadcast:Adapter
vsGemmUniversal
#1459).The text was updated successfully, but these errors were encountered: