[BACKEND] Fix crash in coalesce pass with blocked ptr #3866

etiotto · 2024-05-09T17:33:58Z

The setCoalescedEncoding function can handle operations that have a 'mem access ptr' with type RankedTensorType:

  void
  setCoalescedEncoding(ModuleAxisInfoAnalysis &axisInfoAnalysis, Operation *op,
                       int numWarps, int threadsPerWarp,
                       llvm::MapVector<Operation *, Attribute> &layoutMap) {
    Value ptr = getMemAccessPtr(op);
    auto refTensorType = cast<RankedTensorType>(ptr.getType());

Therefore the caller in runOnOperation should avoid calling it when the 'mem access ptr' does not have RankedTensorType (otherwise the cast in the callee will fail).

…ensorType' Signed-off-by: Tiotto, Ettore <[email protected]>

jlebar · 2024-05-09T18:10:56Z

Is it possible to write a test?

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto · 2024-05-10T21:49:04Z

Is it possible to write a test?

@jlebar I have added a lit test in coalesce.mlir

ThomasRaoux · 2024-05-10T21:57:37Z

test/TritonGPU/coalesce.mlir

+#mma = #triton_gpu.nvidia_mma<{warpsPerCTA = [8, 1], CTAsPerCGA = [1, 1], CTASplitNum = [1, 1], CTAOrder = [1, 0], instrShape = [16, 256, 16]}>
+
+// CHECK-LABEL: @fwd_kernel
+module attributes {"triton_gpu.num-warps" = 4 : i32, "triton_gpu.threads-per-warp" = 16 : i32} {
+ tt.func public @fwd_kernel(%arg0: !tt.ptr<f16> , %arg1: !tt.ptr<f16> , %arg2: !tt.ptr<f16> , %arg3: f32, %arg4: !tt.ptr<f32> , %arg5: !tt.ptr<f16> , %arg6: i32 , %arg7: i32 , %arg8: i32 , %arg9: i32 , %arg10: i32 , %arg11: i32 , %arg12: i32 , %arg13: i32 , %arg14: i32 , %arg15: i32 , %arg16: i32 , %arg17: i32 , %arg18: i32, %arg19: i32, %arg20: i32 , %arg21: i32) {
+ %cst = arith.constant dense<0.000000e+00> : tensor<128x16xf32, #mma> 
+ %cst_0 = arith.constant dense<0.000000e+00> : tensor<128xf32, #triton_gpu.slice<{dim = 1, parent = #mma}>> 


can you minimize the test?

The test was reduced (minimized) from a much larger test. I might be able to reduce it a bit more (remove some arguments). Let me try.

The test was reduced (minimized) from a much larger test.

If we accept patches with large testcases like this, our codebase quickly becomes difficult to maintain. Indeed I've lost many days dealing with failures in tests like this.

You may need to write a testcase by hand. Coming up with clear, small testcases is a critical part of contributing to Triton.

Done. Now the test contains just a couple of operations.

… fix_coalesce

Signed-off-by: Tiotto, Ettore <[email protected]>

Remove unused code: setCoalescedEncoding handles only ptr of 'RankedT…

88e9e24

…ensorType' Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto requested a review from ptillet as a code owner May 9, 2024 17:33

etiotto mentioned this pull request May 9, 2024

Modify pass pipeline to allow lowering tt.load to 2DBlockRead intel/intel-xpu-backend-for-triton#1061

Merged

etiotto added 2 commits May 10, 2024 21:46

Add lit test

b3dabd6

Signed-off-by: Tiotto, Ettore <[email protected]>

Merge branch 'main' into fix_coalesce

58e2c99

ThomasRaoux reviewed May 10, 2024

View reviewed changes

etiotto added 3 commits May 14, 2024 18:28

Merge remote-tracking branch 'upstream/main' into fix_coalesce

3b9ac7a

Merge branch 'fix_coalesce' of https://github.com/etiotto/triton into…

87cbcd2

… fix_coalesce

Reduce lit test

896e639

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto requested review from jlebar and ThomasRaoux May 14, 2024 19:11

etiotto mentioned this pull request May 14, 2024

Upstream fix for Coalesce pass intel/intel-xpu-backend-for-triton#1128

Closed

ThomasRaoux approved these changes May 15, 2024

View reviewed changes

ThomasRaoux enabled auto-merge (squash) May 15, 2024 17:35

ThomasRaoux disabled auto-merge May 15, 2024 17:44

ThomasRaoux changed the title ~~Fix CoalescePass~~ [BACKEND] Fix crash in coalesce pass with blocked ptr May 15, 2024

ThomasRaoux merged commit 25b4212 into triton-lang:main May 15, 2024
5 checks passed

etiotto deleted the fix_coalesce branch May 16, 2024 13:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BACKEND] Fix crash in coalesce pass with blocked ptr #3866

[BACKEND] Fix crash in coalesce pass with blocked ptr #3866

etiotto commented May 9, 2024

jlebar commented May 9, 2024

etiotto commented May 10, 2024

ThomasRaoux May 10, 2024

etiotto May 14, 2024

jlebar May 14, 2024

etiotto May 14, 2024

[BACKEND] Fix crash in coalesce pass with blocked ptr #3866

[BACKEND] Fix crash in coalesce pass with blocked ptr #3866

Conversation

etiotto commented May 9, 2024

jlebar commented May 9, 2024

etiotto commented May 10, 2024

ThomasRaoux May 10, 2024

Choose a reason for hiding this comment

etiotto May 14, 2024

Choose a reason for hiding this comment

jlebar May 14, 2024

Choose a reason for hiding this comment

etiotto May 14, 2024

Choose a reason for hiding this comment