Use op.dtype to create aten.empty.memory_format during decomposition. #3941

sahas3 · 2025-01-06T04:53:00Z

Prior to the change in this PR torch-mlir-opt --convert-torch-to-linalg was running into the following error:

error: 'tensor.cast' op operand type 'tensor<200x200x26xf32>' and result type 'tensor<200x200x26xf64>' are cast incompatible
    %1 = torch.aten.empty.memory_format %0, %none, %none, %none, %false, %none : !torch.list<int>, !torch.none, !torch.none, !torch.none, !torch.bool, !torch.none -> !torch.vtensor<[200,200,26],f64>
         ^
note: see current operation: %12 = "tensor.cast"(%11) : (tensor<200x200x26xf32>) -> tensor<200x200x26xf64

This is because when dtype of the aten.empty.memory_format is none, by default f32 was being selected as the element type of the resulting tensor which doesn't match with the actual element type of the result.

sahas3 · 2025-01-06T04:57:38Z

Hi @ramiro050, I see that I am reverting your change here. It's not clear to me why the resultElementType was changed in that commit. I verified that the test you added passes with my change. Can you please take a look at this PR? Thanks!

sahas3 · 2025-01-22T17:13:28Z

Hello @ramiro050 and @vivekkhandelwal1 can you please review this PR when you have a chance? Thanks!

vivekkhandelwal1 · 2025-02-05T05:37:24Z

test/Conversion/TorchToLinalg/basic.mlir

+func.func @torch.aten.empty.memory_format$noneDtype() -> !torch.vtensor<[200,200,26],f64> attributes {torch.assume_strict_symbolic_shapes} {
+    %int200 = torch.constant.int 200
+    %int26 = torch.constant.int 26
+    %false = torch.constant.bool false
+    %none = torch.constant.none
+    %0 = torch.prim.ListConstruct %int200, %int200, %int26 : (!torch.int, !torch.int, !torch.int) -> !torch.list<int>
+    %1 = torch.aten.empty.memory_format %0, %none, %none, %none, %false, %none : !torch.list<int>, !torch.none, !torch.none, !torch.none, !torch.bool, !torch.none -> !torch.vtensor<[200,200,26],f64>
+    return %1 : !torch.vtensor<[200,200,26],f64>
+}


Hi @sahas3, the above test case is an invalid test since the tensor element type is f32 when none is specifed for the aten.empty.memory_format op.

I just tried it. See:

>>> a = torch.ops.aten.empty.memory_format([2, 3]) >>> a tensor([[-3.4054e+29, 4.2543e-41, -3.4085e+29], [ 4.2543e-41, -3.4086e+29, 4.2543e-41]]) >>> a.dtype torch.float32

Also, the changes that you've done are not correct and hence this patch should not be merged.

Great flag, @vivekkhandelwal1. I missed that torch.ops.aten.empty.memory_format produces f32 for dtype=none in PyTorch as I assumed the aten.empty.memory_format op being produced during conversion of ExportedProgram to torch dialect was correct.

Upon further investigation, I think the bug is instead in the DecomposeComplexOps pass where we decompose different torch ops to aten.empty.memory_format. For empty_like op if dtype is not specified then it defaults to input dtype as per https://pytorch.org/docs/stable/generated/torch.empty_like.html. This was not being captured when decomposing to empty.memory_format.

I have pushed new changes, reverting the original change that addresses this issue. Can you please take another look at these new changes? Thanks!

Hi @vivekkhandelwal1 , can you please take a look at the changes after your initial feedback. Thanks!

vivekkhandelwal1 · 2025-03-03T15:57:43Z

lib/Dialect/Torch/Transforms/DecomposeComplexOps.cpp

+
+    FailureOr<Value> dtype = getDtypeFromOp(rewriter, op);
+    if (failed(dtype)) {
+      return rewriter.notifyMatchFailure(
+          op, "could not determine dtype from the op.");
+    }
+


Hi @sahas3, as you mentioned that in this case, if the dtype is not present then it defaults to input dtype. But this function returns the result dtype if the dtype arg is none. So, I am not able to understand how does your comment and the changes relate.

Hi @vivekkhandelwal1, my understanding is that dtype specifies the type of the returned tensor of any op. For ops like empty_like this dtype if not specified defaults to that of input tensor as per: https://pytorch.org/docs/stable/generated/torch.empty_like.html. So looking at the torch.empty_like op itself I think type of result tensor and input tensor will be the same if dtype is not specified.

On the other hand, for ops like empty_strided, dtype defaults to a global value as per https://pytorch.org/docs/stable/generated/torch.empty_strided.html. In this case, there's no input tensor but the result tensor should already have the type set correctly when it's in the torch dialect.

So for both cases, I think we can generalize the construction of AtenEmptyMemoryFormatOp based on the dtype of the result tensor when dtype for the op to be rewritten is not specified. We don't need to check for the input tensor dtype though I can add an assert that types of input and output tensors are same for ops like empty_like. The reason to generalize to using result tensor dtype is because ops like empty_strided don't have an input tensor and hence no op.getSelf() method on those ops.

I hope this clarifies the changes. I'll add a gist of this as comment as well for future reference.

sahas3 requested a review from ramiro050 January 6, 2025 04:53

sahas3 requested a review from vivekkhandelwal1 January 9, 2025 03:59

vivekkhandelwal1 reviewed Feb 5, 2025

View reviewed changes

sahas3 force-pushed the memoryformat branch from 4523f15 to d9fe932 Compare February 5, 2025 18:52

sahas3 changed the title ~~Use resultElementType for tensor creation for AtenMemoryFormatOp with none dtype.~~ Use op.dtype to create aten.empty.memory_format during decomposition. Feb 5, 2025

sahas3 force-pushed the memoryformat branch from d9fe932 to ced0ad5 Compare February 5, 2025 19:42

sahas3 requested a review from vivekkhandelwal1 February 18, 2025 01:18

vivekkhandelwal1 reviewed Mar 3, 2025

View reviewed changes

[Task] : Use op.dtype to create EmptyMemorFormat during decomposition.

b732ef7

sahas3 force-pushed the memoryformat branch from ced0ad5 to b732ef7 Compare March 4, 2025 13:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use op.dtype to create aten.empty.memory_format during decomposition. #3941

Use op.dtype to create aten.empty.memory_format during decomposition. #3941

sahas3 commented Jan 6, 2025

sahas3 commented Jan 6, 2025

sahas3 commented Jan 22, 2025

vivekkhandelwal1 Feb 5, 2025

vivekkhandelwal1 Feb 5, 2025

sahas3 Feb 5, 2025 •

edited

Loading

sahas3 Feb 18, 2025

vivekkhandelwal1 Mar 3, 2025

sahas3 Mar 3, 2025

Use op.dtype to create aten.empty.memory_format during decomposition. #3941

Are you sure you want to change the base?

Use op.dtype to create aten.empty.memory_format during decomposition. #3941

Conversation

sahas3 commented Jan 6, 2025

sahas3 commented Jan 6, 2025

sahas3 commented Jan 22, 2025

vivekkhandelwal1 Feb 5, 2025

Choose a reason for hiding this comment

vivekkhandelwal1 Feb 5, 2025

Choose a reason for hiding this comment

sahas3 Feb 5, 2025 • edited Loading

Choose a reason for hiding this comment

sahas3 Feb 18, 2025

Choose a reason for hiding this comment

vivekkhandelwal1 Mar 3, 2025

Choose a reason for hiding this comment

sahas3 Mar 3, 2025

Choose a reason for hiding this comment

sahas3 Feb 5, 2025 •

edited

Loading