[DO NOT REVIEW][SMOKE TEST] Skip prims shape in grad transform #1704

jjsjann123 · 2025-01-27T21:57:27Z

No description provided.

…xies

for more information, see https://pre-commit.ci

…backward_transform_dependency_fix

Co-authored-by: Masaki Kozuki <[email protected]>

for more information, see https://pre-commit.ci

…ndency_fix

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

jjsjann123 · 2025-01-28T23:46:01Z

thunder/core/transform_common.py

@@ -123,7 +123,7 @@ def keep_or_swap(p):
        if not isinstance(p, NumberProxyInterface):
            return p
        if p.name in seen:
-            return p.value  # don't make it a duplicate
+            return None  # don't make it a duplicate


note for myself, I needed this to avoid an error in rematerialization pass. I'll work on a separate PR with a repro when I get that.

errrrr... this one doesn't seem to be working any more... I'm seeing the assert in rematerialization.py again.

import thunder def foo(a, b): return a + b import torch a = torch.randn(1, 32, 232, 232) a.requires_grad_() b = torch.randn(1, 1, 232, 232) #b.requires_grad_() jfoo = thunder.jit(foo, cache="symbolic values") out = jfoo(a, b)

took me a while to get a repro here.
I think the issue is coming from saving for backward not properly identifying which gradient path isn't required. So after dce kicks in, the saved_for_backward is inconsistent... now I feel that I just missed a dce somewhere.

trying my luck with #1725

jjsjann123 · 2025-01-28T23:46:24Z

thunder/core/trace_interpreter.py

            else:
                assert isinstance(new, ProxyInterface), (old, new)
-                swap_map[variableify(new)] = old
+                if variableify(old) != variableify(new):
+                    swap_map[variableify(new.primal)] = old


note for myself, this is a separate thing. break this into a separate PR.

jjsjann123 and others added 20 commits January 21, 2025 14:54

fixing dce pass to remove duplicate shape queries.

df892e7

Merge remote-tracking branch 'origin/main' into dce_remove_number_pro…

ef867dc

…xies

[pre-commit.ci] auto fixes from pre-commit.com hooks

4f2b4c6

for more information, see https://pre-commit.ci

fixing grad transform dependency

d2ee824

Merge remote-tracking branch 'origin/dce_remove_number_proxies' into …

2ac2397

…backward_transform_dependency_fix

typo

8aad4bd

Merge remote-tracking branch 'origin/main' into HEAD

43d626f

Merge remote-tracking branch 'origin/dce_remove_number_proxies' into …

dc78491

…backward_transform_dependency_fix

Update thunder/core/transform_common.py

f8bc671

Co-authored-by: Masaki Kozuki <[email protected]>

fixing logic; adding test

8942ea0

fixing tests; adding DONT_DCE to put_grad

4034167

fixing tests

316f47c

[pre-commit.ci] auto fixes from pre-commit.com hooks

74f2fea

for more information, see https://pre-commit.ci

Merge branch 'dce_remove_number_proxies' into backward_transform_depe…

171dc22

…ndency_fix

patching shape query in TensorProxy to avoid prims (#1696)

91ce25f

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

adding more comments per review request

10d9937

comment

ff248e7

Merge remote-tracking branch 'origin/main' into HEAD

88cdb00

skip prims.shape in grad transform

6b3a2c9

some more arbitrary fixes

f4a89eb

Base automatically changed from backward_transform_dependency_fix to main January 28, 2025 21:37

jjsjann123 commented Jan 28, 2025

View reviewed changes

Merge remote-tracking branch 'origin/main' into HEAD

6872d53

jjsjann123 mentioned this pull request Jan 29, 2025

Get dynamic shapes to work with Phi-3-mini-128k-instruct #1579

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DO NOT REVIEW][SMOKE TEST] Skip prims shape in grad transform #1704

[DO NOT REVIEW][SMOKE TEST] Skip prims shape in grad transform #1704

jjsjann123 commented Jan 27, 2025

jjsjann123 Jan 28, 2025

jjsjann123 Jan 30, 2025

jjsjann123 Jan 31, 2025

jjsjann123 Jan 31, 2025

jjsjann123 Jan 28, 2025

[DO NOT REVIEW][SMOKE TEST] Skip prims shape in grad transform #1704

Are you sure you want to change the base?

[DO NOT REVIEW][SMOKE TEST] Skip prims shape in grad transform #1704

Conversation

jjsjann123 commented Jan 27, 2025

jjsjann123 Jan 28, 2025

Choose a reason for hiding this comment

jjsjann123 Jan 30, 2025

Choose a reason for hiding this comment

jjsjann123 Jan 31, 2025

Choose a reason for hiding this comment

jjsjann123 Jan 31, 2025

Choose a reason for hiding this comment

jjsjann123 Jan 28, 2025

Choose a reason for hiding this comment