[mlir][amdgpu] Add `amdgpu.swizzle_bitmode` op #135513

Hardcode84 · 2025-04-13T01:48:41Z

High level wrapper on top of rocdl.ds_swizzle. Also some DPP op cleanup while I'm at here.

Will do lowering in separate PR.

llvmbot · 2025-04-13T01:49:16Z

@llvm/pr-subscribers-mlir-gpu
@llvm/pr-subscribers-mlir-amdgpu

@llvm/pr-subscribers-backend-amdgpu

Author: Ivan Butygin (Hardcode84)

Changes

High level wrapper on top of rocdl.ds_swizzle. Also some DPP op cleanup while I'm at here.

Will do lowering in separate PR.

Full diff: https://github.com/llvm/llvm-project/pull/135513.diff

2 Files Affected:

(modified) mlir/include/mlir/Dialect/AMDGPU/IR/AMDGPU.td (+35-8)
(modified) mlir/test/Dialect/AMDGPU/ops.mlir (+7)

diff --git a/mlir/include/mlir/Dialect/AMDGPU/IR/AMDGPU.td b/mlir/include/mlir/Dialect/AMDGPU/IR/AMDGPU.td
index 108d7237ff703..a92ebf6d8e108 100644
--- a/mlir/include/mlir/Dialect/AMDGPU/IR/AMDGPU.td
+++ b/mlir/include/mlir/Dialect/AMDGPU/IR/AMDGPU.td
@@ -35,6 +35,11 @@ def AMDGPU_Dialect : Dialect {
   let useDefaultAttributePrinterParser = 1;
 }
 
+def AnyIntegerOrFloat : AnyTypeOf<[AnySignlessInteger, AnyFloat], "Integer or Float">;
+
+def AnyIntegerOrFloatOr1DVector :
+  AnyTypeOf<[AnyIntegerOrFloat, VectorOfRankAndType<[1], [AnyIntegerOrFloat]>]>;
+
 //===----------------------------------------------------------------------===//
 // AMDGPU general attribute definitions
 //===----------------------------------------------------------------------===//
@@ -533,14 +538,15 @@ def AMDGPU_DPPPerm : I32EnumAttr<"DPPPerm",
 def AMDGPU_DPPPermAttr : EnumAttr<AMDGPU_Dialect, AMDGPU_DPPPerm,
   "dpp_perm">;
 
-def AMDGPU_DPPOp : AMDGPU_Op<"dpp", [SameTypeOperands, AllTypesMatch<["result", "old", "src"]>]>,
+def AMDGPU_DPPOp : AMDGPU_Op<"dpp",
+    [Pure, SameTypeOperands, AllTypesMatch<["result", "old", "src"]>]>,
   Arguments<(ins AnyType:$old,
-                  AnyType:$src,
-                  AMDGPU_DPPPermAttr:$kind,
-                  OptionalAttr<AnyAttrOf<[I32Attr, ArrayAttr, UnitAttr]>>:$permArgument,
-                  DefaultValuedAttr<I32Attr, "0xf">:$row_mask,
-                  DefaultValuedAttr<I32Attr, "0xf">:$bank_mask,
-                  DefaultValuedAttr<BoolAttr, "false">:$bound_ctrl)> {
+                 AnyType:$src,
+                 AMDGPU_DPPPermAttr:$kind,
+                 OptionalAttr<AnyAttrOf<[I32Attr, ArrayAttr, UnitAttr]>>:$permArgument,
+                 DefaultValuedAttr<I32Attr, "0xf">:$row_mask,
+                 DefaultValuedAttr<I32Attr, "0xf">:$bank_mask,
+                 DefaultValuedAttr<BoolAttr, "false">:$bound_ctrl)> {
   let summary = "AMDGPU DPP operation";
   let description = [{
     This operation represents DPP functionality in a GPU program.
@@ -565,6 +571,27 @@ def AMDGPU_DPPOp : AMDGPU_Op<"dpp", [SameTypeOperands, AllTypesMatch<["result",
   let hasVerifier = 1;
 }
 
+def AMDGPU_SwizzleBitModeOp : AMDGPU_Op<"swizzle_bitmode",
+    [Pure, AllTypesMatch<["result", "src"]>]>,
+  Arguments<(ins AnyIntegerOrFloatOr1DVector:$src,
+                 I32Attr:$and_mask,
+                 I32Attr:$or_mask,
+                 I32Attr:$xor_mask
+             )> {
+  let summary = "AMDGPU ds_swizzle op, bitmode variant";
+  let description = [{
+    High-level wrapper on bitmode `rocdl.ds_swizzle` op, masks are represented
+    as separate fields so user won't need to do manual bitpacking.
+
+    Supports arbitrary int/float/vector types, which will be repacked to i32 and
+    one or more `rocdl.ds_swizzle` ops during lowering.
+  }];
+  let results = (outs AnyIntegerOrFloatOr1DVector:$result);
+  let assemblyFormat = [{
+    $src $and_mask $or_mask $xor_mask attr-dict `:` type($result)
+  }];
+}
+
 def AMDGPU_LDSBarrierOp : AMDGPU_Op<"lds_barrier"> {
   let summary = "Barrier that includes a wait for LDS memory operations.";
   let description = [{
@@ -794,7 +821,7 @@ def AMDGPU_GatherToLDSOp :
 
     The `$dst`, along with its indices, points to the memory location the subgroup of this thread
     will write to.
-  
+
     Note: only enabled for gfx942 and later.
   }];
   let assemblyFormat = [{
diff --git a/mlir/test/Dialect/AMDGPU/ops.mlir b/mlir/test/Dialect/AMDGPU/ops.mlir
index 665674f2a7873..16b3193d270cb 100644
--- a/mlir/test/Dialect/AMDGPU/ops.mlir
+++ b/mlir/test/Dialect/AMDGPU/ops.mlir
@@ -157,3 +157,10 @@ func.func @wmma(%arg0 : vector<16xf16>, %arg1 : vector<8xf16>) -> vector<8xf16>
   %0 = amdgpu.wmma %arg0 * %arg0 + %arg1 : vector<16xf16>, vector<16xf16>, vector<8xf16>
   func.return %0 : vector<8xf16>
 }
+
+// CHECK-LABEL: func @swizzle_bitmode
+func.func @swizzle_bitmode(%arg0 : f32) -> f32 {
+  // CHECK: amdgpu.swizzle_bitmode
+  %0 = amdgpu.swizzle_bitmode %arg0 1 2 4 : f32
+  func.return %0 : f32
+}

krzysz00

I'm curious about why swizzle_bitmode ... Oh, right, probably a special case of DS_SWIZZLE.

Minor comment, but I don't have any substantial objections to the design. Maybe worth noting that this covers one of the cases of ds_swizzle?

mlir/include/mlir/Dialect/AMDGPU/IR/AMDGPU.td

Hardcode84 · 2025-04-16T06:07:26Z

ds_swizzle can work in either QDMode or BitMode, controlled by bit 15 in offset https://gpuopen.com/learn/amd-gcn-assembly-cross-lane-operations/, I only need BitMode variant for now.

kuhar · 2025-04-16T14:23:39Z

mlir/test/Dialect/AMDGPU/ops.mlir

+  // CHECK: amdgpu.swizzle_bitmode
+  %0 = amdgpu.swizzle_bitmode %arg0 1 2 4 : f32


Should we add some negative tests for unsupported data types?

High level wrapper on top of `rocdl.ds_swizzle`. Also some DPP op cleanup while I'm at here.

High level wrapper on top of `rocdl.ds_swizzle`. Also some DPP op cleanup while I'm at here. Will do lowering in separate PR.

Hardcode84 requested a review from krzysz00 April 13, 2025 01:48

llvmbot added backend:AMDGPU mlir:gpu mlir mlir:amdgpu labels Apr 13, 2025

krzysz00 reviewed Apr 14, 2025

View reviewed changes

mlir/include/mlir/Dialect/AMDGPU/IR/AMDGPU.td Show resolved Hide resolved

krzysz00 approved these changes Apr 16, 2025

View reviewed changes

kuhar reviewed Apr 16, 2025

View reviewed changes

Hardcode84 added 2 commits April 17, 2025 20:49

[mlir][amdgpu] Add amdgpu.swizzle_bitmode op

7bc22f9

High level wrapper on top of `rocdl.ds_swizzle`. Also some DPP op cleanup while I'm at here.

invalid type test

4b5e10b

Hardcode84 force-pushed the amdgpu_swizzle branch from 5f82ca3 to 4b5e10b Compare April 17, 2025 21:49

Hardcode84 merged commit 0abf227 into llvm:main Apr 17, 2025
11 checks passed

Hardcode84 deleted the amdgpu_swizzle branch April 17, 2025 22:23

IanWood1 pushed a commit to IanWood1/llvm-project that referenced this pull request May 6, 2025

[mlir][amdgpu] Add amdgpu.swizzle_bitmode op (llvm#135513)

b3014ca

High level wrapper on top of `rocdl.ds_swizzle`. Also some DPP op cleanup while I'm at here. Will do lowering in separate PR.

IanWood1 pushed a commit to IanWood1/llvm-project that referenced this pull request May 6, 2025

[mlir][amdgpu] Add amdgpu.swizzle_bitmode op (llvm#135513)

7cd6d4b

High level wrapper on top of `rocdl.ds_swizzle`. Also some DPP op cleanup while I'm at here. Will do lowering in separate PR.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir][amdgpu] Add `amdgpu.swizzle_bitmode` op #135513

[mlir][amdgpu] Add `amdgpu.swizzle_bitmode` op #135513

Uh oh!

Hardcode84 commented Apr 13, 2025

Uh oh!

llvmbot commented Apr 13, 2025 •

edited

Loading

Uh oh!

krzysz00 left a comment

Uh oh!

Uh oh!

Hardcode84 commented Apr 16, 2025

Uh oh!

kuhar Apr 16, 2025

Uh oh!

Hardcode84 Apr 17, 2025

Uh oh!

Uh oh!

Uh oh!

		// CHECK: amdgpu.swizzle_bitmode
		%0 = amdgpu.swizzle_bitmode %arg0 1 2 4 : f32

[mlir][amdgpu] Add amdgpu.swizzle_bitmode op #135513

[mlir][amdgpu] Add amdgpu.swizzle_bitmode op #135513

Uh oh!

Conversation

Hardcode84 commented Apr 13, 2025

Uh oh!

llvmbot commented Apr 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

krzysz00 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Hardcode84 commented Apr 16, 2025

Uh oh!

kuhar Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

Hardcode84 Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

[mlir][amdgpu] Add `amdgpu.swizzle_bitmode` op #135513

[mlir][amdgpu] Add `amdgpu.swizzle_bitmode` op #135513

llvmbot commented Apr 13, 2025 •

edited

Loading