[Dialect] [Linalgx] Add linalgx ops: 3 vnni matmuls and multi_batch_matmul #89

LongshengDu · 2024-05-20T07:29:35Z

Added mm2d_vnni, mm4d_vnni, batch_reduce_matmul_vnni with custom verifier, static indexing_maps and iterator_types, and using vnni dims to get constant symbol for indexing_maps.

Added multi_batch_matmul with LinalgContractionOpInterface, dynamic indexing_maps and iterator_types.

Tracking: #14

include/gc/Dialect/Linalgx/LinalgxStructuredOps.td

lib/gc/Dialect/Linalgx/LinalgxOps.cpp

ciyongch · 2024-05-21T02:34:54Z

lib/gc/Dialect/Linalgx/LinalgxOps.cpp

+ bool matchK =
+ shapeA.getDimSize(1) ==
+ (shapeB.getDimSize(1) * shapeB.getDimSize(2) * shapeB.getDimSize(4));
+ bool matchVnni = (shapeB.getDimSize(4) == 1) || (shapeB.getDimSize(4) == 2) ||


Same as above, why do we consider 1 as vnni format here?

input: MKmk/MK weight: NKkn4k/NKkn2k/NKkn/KN output: MNmn/MN

For NKkn format, can we treat it as NKkn1k so we can reuse this op?

Please leave a note here if you'd like to reuse it for F32 datatypes, as vnni always refers to low precision with blk_size 2 or 4.

Is there a name for NKkn format (differentiating from mmt4d's Nknk)?

Is there a name for NKkn format (differentiating from mmt4d's Nknk)?

The name is NKkn. for op naming, I think it's mm4d_vnni(no transpose).

ciyongch · 2024-05-21T02:45:03Z

test/gc/Dialect/Linlagx/generalize-named-ops.mlir

@@ -24,3 +24,84 @@ func.func @generalize_sigmoid(%arg0: tensor<4x256x64xbf16>, %arg1: tensor<4x256x
 // CHECK-NEXT: linalg.yield %[[DIV]] : bf16

 // -----
+
+func.func @generalize_mmt2d_vnni(%arg0: tensor<256x64xf32>, %arg1: tensor<16x2x8x32x4xf32>, 


Shall we also add failed case for checking?

lib/gc/Dialect/Linalgx/LinalgxOps.cpp

ZhennanQin · 2024-05-21T02:49:43Z

include/gc/Dialect/Linalgx/LinalgxStructuredOps.td

+ }];
+}
+
+def Linalgx_MultiBatchMatmulOp : LinalgxStructuredBase_Op<"multi_batch_matmul", 


Also define multi_batch_matmul_4d and multi_batch_matmul_4d_vnni?

Is this define correct for multi_batch_matmul_4d and multi_batch_matmul_4d_vnni? Do we also want transposed weight?

input: BMKmk weight: BNKkn4k/BNKkn2k/BNKkn1k output: BMNmn

Correct. We don't use transposed weight.

lib/gc/Dialect/Linalgx/LinalgxOps.cpp

zhczhong · 2024-05-21T08:21:26Z

Do we need to add batch_reduce_matmul_vnni so that the matmul_vnni could be lowered to brgemm_vnni named op?

input: BMK
weight: BKN4k/BKN2k
output: MN

LongshengDu · 2024-05-21T08:39:56Z

Do we need to add batch_reduce_matmul_vnni so that the matmul_vnni could be lowered to brgemm_vnni named op?
input: BMK
weight: BKN4k/BKN2k
output: MN

Yeah, will add this in the future

yifeizh2 · 2024-05-26T12:02:08Z

lib/gc/Dialect/Linalgx/LinalgxOps.cpp

+ bool matchK =
+ shapeA.getDimSize(1) ==
+ (shapeB.getDimSize(1) * shapeB.getDimSize(2) * shapeB.getDimSize(4));
+ bool matchVnni = (shapeB.getDimSize(4) == 2) || (shapeB.getDimSize(4) == 4);


Shall we also check vnni dim value based on dtype? (e.g. we restrain vnni to be 2 under bf16; and 4 under u8/s8).

lib/gc/Dialect/Linalgx/LinalgxOps.cpp

LongshengDu added 5 commits May 20, 2024 14:49

add basic linalgx

959eb93

Merge branch 'main' into longsheng/add_linalgx

239d1ac

merge

5e88a8c

fix

9bae572

add

3ac1a40

LongshengDu changed the title ~~[WIP] [Linalgx Dialect] Add linalgx ops: mmt2d_vnni, mmt4d_vnni, multi_batch_matmul~~ [WIP] [Dialect] [Linalgx] Add linalgx ops: mmt2d_vnni, mmt4d_vnni, multi_batch_matmul May 20, 2024

LongshengDu added the WIP work in progress label May 20, 2024

LongshengDu changed the title ~~[WIP] [Dialect] [Linalgx] Add linalgx ops: mmt2d_vnni, mmt4d_vnni, multi_batch_matmul~~ [Dialect] [Linalgx] Add linalgx ops: mmt2d_vnni, mmt4d_vnni, multi_batch_matmul May 20, 2024

WangJialei-A and others added 2 commits May 20, 2024 15:56

action: generate multiple inc target in one makefile

2cd9906

fix

6022f40

LongshengDu requested review from ciyongch, yifeizh2 and kurapov-peter May 20, 2024 08:10

LongshengDu added 3 commits May 20, 2024 16:15

Merge branch 'longsheng/add_linalgx' into longsheng/add_linalgx_mm

284f83d

update

c397b88

update

4ae4f17

ciyongch reviewed May 21, 2024

View reviewed changes

ZhennanQin reviewed May 21, 2024

View reviewed changes

zhczhong reviewed May 21, 2024

View reviewed changes

lib/gc/Dialect/Linalgx/LinalgxOps.cpp Outdated Show resolved Hide resolved

LongshengDu added 2 commits May 21, 2024 14:24

update

e192a7a

fix mmt2d

a032b70

LongshengDu added 3 commits May 21, 2024 17:50

Merge branch 'main' into longsheng/add_linalgx

c8e31ba

Merge branch 'longsheng/add_linalgx' into longsheng/add_linalgx_mm

546fccb

add brgemm vnni

5529faf

LongshengDu requested a review from huanghaixin008 May 22, 2024 09:09

LongshengDu changed the title ~~[Dialect] [Linalgx] Add linalgx ops: mmt2d_vnni, mmt4d_vnni, multi_batch_matmul~~ [Dialect] [Linalgx] Add linalgx ops: mm2d_vnni, mm4d_vnni, batch_reduce_matmul_vnni, multi_batch_matmul May 22, 2024

LongshengDu changed the title ~~[Dialect] [Linalgx] Add linalgx ops: mm2d_vnni, mm4d_vnni, batch_reduce_matmul_vnni, multi_batch_matmul~~ [Dialect] [Linalgx] Add linalgx ops: 3 vnni matmuls and multi_batch_matmul May 22, 2024

yifeizh2 reviewed May 26, 2024

View reviewed changes

Base automatically changed from longsheng/add_linalgx to main May 29, 2024 04:55

LongshengDu added 2 commits June 3, 2024 13:07

Merge branch 'main' into longsheng/add_linalgx_mm

5bf2b66

update

d97b267

LongshengDu removed the WIP work in progress label Jun 3, 2024

LongshengDu requested review from yifeizh2, ZhennanQin and zhczhong June 3, 2024 06:07

fix test

fcbb679

ZhennanQin reviewed Jun 3, 2024

View reviewed changes

lib/gc/Dialect/Linalgx/LinalgxOps.cpp Outdated Show resolved Hide resolved

fix const affine_map

76e6291

zhczhong approved these changes Jun 3, 2024

View reviewed changes

ciyongch approved these changes Jun 3, 2024

View reviewed changes

yifeizh2 approved these changes Jun 3, 2024

View reviewed changes

LongshengDu merged commit cecc53c into main Jun 3, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Dialect] [Linalgx] Add linalgx ops: 3 vnni matmuls and multi_batch_matmul #89

[Dialect] [Linalgx] Add linalgx ops: 3 vnni matmuls and multi_batch_matmul #89

LongshengDu commented May 20, 2024 •

edited

ciyongch May 21, 2024

LongshengDu May 21, 2024

ciyongch May 21, 2024

LongshengDu May 21, 2024

ZhennanQin May 21, 2024

ciyongch May 21, 2024

ZhennanQin May 21, 2024

LongshengDu May 21, 2024

ZhennanQin May 21, 2024

zhczhong commented May 21, 2024

LongshengDu commented May 21, 2024

yifeizh2 May 26, 2024

LongshengDu Jun 3, 2024

[Dialect] [Linalgx] Add linalgx ops: 3 vnni matmuls and multi_batch_matmul #89

[Dialect] [Linalgx] Add linalgx ops: 3 vnni matmuls and multi_batch_matmul #89

Conversation

LongshengDu commented May 20, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zhczhong commented May 21, 2024

LongshengDu commented May 21, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LongshengDu commented May 20, 2024 •

edited