Do not outline elementwise operations #954

newling · 2024-12-03T21:03:59Z

What observations motivate this PR?

I compared the input.opt.ll files with and without coalescing. One difference I noticed was in the number of calls to the elementwise outlined function. Specifically, grepping in the output directory of the run.py script after 2 runs creating directories no_coalescing and with_coalescing:

grep "call void @generic_elementwise" no_coalescing/matmul_truncf_128_256_bf16_f32/input.opt.ll | wc -l
16
grep "call void @generic_elementwise" with_coalescing/matmul_truncf_128_256_bf16_f32/input.opt.ll | wc -l
0

Above notice that there are no function calls to the elementwise func when coalescing is enabled. i.e. the calls must have been inlined. Note that for matmul the number of calls is the same:

grep "call void @generic_matmul" no_coalescing/matmul_truncf_128_256_bf16_f32/input.opt.ll | wc -l
128
grep "call void @generic_matmul" with_coalescing/matmul_truncf_128_256_bf16_f32/input.opt.ll | wc -l
128

Background fact: we go out of program memory quite badly without coalescing (we hit 30 kB, much more than the allowed 16 kB). With coalescing, we fit.

So it would seem that inlining the function calls dramatically reduces the memory required. I can't explain this.

With the above observation, this PR removes function outlining for the elementwise ops, so that they are effectively always inline. With this change, program fits in memory even without coalescing.

Abhishek-Varma

This seems like a very good find @newling ! Thank you!

Another supporting argument that adds to what you mentioned above: since we don't outline linalg.fill (a producer fused to matmul - part of a single loop - PROLOGUE), there shouldn't be any need to outline an elementwise (a consumer fused to matmul - part of a single loop - EPILOGUE).

LGTM % one review comment.

Abhishek-Varma · 2024-12-04T07:22:09Z

compiler/plugins/target/AMD-AIE/iree-amd-aie/Transforms/test/linalg_function_outlining.mlir

+// CHECK-NEXT: func.call @[[MATMUL_K6]](%[[A1]], %[[B1]], %[[C]])
+// CHECK-NEXT: amdaie.end
+// CHECK:      return
+func.func @matmul_example_2(%A0: memref<4x4xbf16>, %B0: memref<4x4xbf16>,


No need for this example. If required, you can add one linalg.matmul to an already existing test case above.

I like this test, I think it's good to have a sequence of tests which slowly change what's being tested from the preceding test, makes diagnosing failures easier. I should use a more descriptive name though...

newling marked this pull request as ready for review December 3, 2024 23:42

newling requested review from MaheshRavishankar, nirvedhmeshram, yzhang93, Abhishek-Varma and jtuyls as code owners December 3, 2024 23:42

newling removed the request for review from nirvedhmeshram December 3, 2024 23:42

Abhishek-Varma approved these changes Dec 4, 2024

View reviewed changes

newling added 2 commits December 4, 2024 08:48

squash

fc739d8

update

dce2ab1

newling force-pushed the do_not_outline_elmwise branch from ae7aa5a to dce2ab1 Compare December 4, 2024 16:52

better lit test function names

cc86c2b

newling enabled auto-merge (squash) December 4, 2024 17:16

newling changed the title ~~Do not outline elementwise operation~~ Do not outline elementwise operations Dec 4, 2024

Merge branch 'main' into do_not_outline_elmwise

0e04317

newling merged commit cbb17c9 into nod-ai:main Dec 4, 2024
7 checks passed

newling deleted the do_not_outline_elmwise branch December 12, 2024 23:08

newling mentioned this pull request Jan 28, 2025

[LinalgFunctionOutlining] Add none, all and balanced outlining strategies #1062

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not outline elementwise operations #954

Do not outline elementwise operations #954

newling commented Dec 3, 2024 •

edited

Loading

Abhishek-Varma left a comment

Abhishek-Varma Dec 4, 2024

newling Dec 4, 2024

Do not outline elementwise operations #954

Do not outline elementwise operations #954

Conversation

newling commented Dec 3, 2024 • edited Loading

Abhishek-Varma left a comment

Choose a reason for hiding this comment

Abhishek-Varma Dec 4, 2024

Choose a reason for hiding this comment

newling Dec 4, 2024

Choose a reason for hiding this comment

newling commented Dec 3, 2024 •

edited

Loading