[inductor] support linear & layer_norm unbacked by ColinPeppler · Pull Request #155267 · pytorch/pytorch

ColinPeppler · 2025-06-05T21:54:00Z

What

Use statically_known_true over guard_size_oblivious in cases where we're checking an optimization path. Otherwise, it will DDE and we can't take the safe/slower path.
For broadcast checks, use fallback=False if we encounter a DDE. Typically, unbackeds would be ≥2 and that falls inline with size-oblivious reasoning (i.e. when size_oblivious=True).

Example DDE

torch._inductor.exc.InductorError: LoweringException: GuardOnDataDependentSymNode: Could not guard on data-dependent expression Eq((u0//387), 1) (unhinted: Eq((u0//387), 1)).  (Size-like symbols: u0)

Caused by: (_inductor/lowering.py:488 in broadcast_symbolic_shapes)

torch._inductor.exc.InductorError: LoweringException: GuardOnDataDependentSymNode: Could not guard on data-dependent expression Eq((u0//387), 1) (unhinted: Eq((u0//387), 1)).  (Size-like symbols: u0)

Caused by: (_inductor/ir.py:2797 in create)

Stack from ghstack (oldest at bottom):

-> [inductor] support linear & layer_norm unbacked #155267

cc @ezyang @penguinwu @bobrenjc93 @voznesenskym @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

[ghstack-poisoned]

pytorch-bot · 2025-06-05T21:54:04Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/155267

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

linux.aws.h100.8 instance is down, potentially longer queue on linux.aws.h100

✅ You can merge normally! (1 Unrelated Failure)

As of commit b37ff27 with merge base 25fbf09 ():

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / linux-jammy-py3_9-clang9-xla / test (xla, 1, 1, linux.12xlarge) (gh) (#158876)
sccache: error: couldn't connect to server

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 628173e Pull Request resolved: #155267

torch/_refs/__init__.py

eellison

cc @zou3519 , @bdhirsh - what's our policy for this ? How do we reconcile "strides wrong is a ubn" and now introducing meta functions where we are potentially not computing strides correctly.

zou3519 · 2025-06-17T02:48:59Z

cc @zou3519 , @bdhirsh - what's our policy for this ? How do we reconcile "strides wrong is a ubn" and now introducing meta functions where we are potentially not computing strides correctly.

We discussed this at composability sync. The conclusion was:

strides wrong for backed symints is a UBN
unbacked symints are explicitly wrong. There's no way to fulfill the requirements for unbacked symints and have correct strides at the same time. So unbacked symints being "wrong" is not a problem.

### What - Use `statically_known_true` over `guard_size_oblivious` in cases where we're checking an optimization path. Otherwise, it will DDE and we can't take the safe/slower path. - For broadcast checks, use fallback=False if we encounter a DDE. Typically, unbackeds would be ≥2 and that falls inline with size-oblivious reasoning. - In `coerce_tangent_and_suggest_memory_format`, use the torch._refs version of `contiguous` over Aten's version because Aten doesn't have a DDE-friendly path (yet). ### Example DDE ``` GuardOnDataDependentSymNode: Could not guard on data-dependent expression Eq(128*((u0//387)), 0) (unhinted: Eq(128*((u0//387)), 0)). (Size-like symbols: u0) Caused by: (_functorch/_aot_autograd/collect_metadata_analysis.py:84 in coerce_tangent_and_suggest_memory_format) ``` ``` torch._inductor.exc.InductorError: LoweringException: GuardOnDataDependentSymNode: Could not guard on data-dependent expression Eq((u0//387), 1) (unhinted: Eq((u0//387), 1)). (Size-like symbols: u0) Caused by: (_inductor/lowering.py:488 in broadcast_symbolic_shapes) ``` ``` torch._inductor.exc.InductorError: LoweringException: GuardOnDataDependentSymNode: Could not guard on data-dependent expression Eq((u0//387), 1) (unhinted: Eq((u0//387), 1)). (Size-like symbols: u0) Caused by: (_inductor/ir.py:2797 in create) ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: bdccfdc Pull Request resolved: #155267

### What - Use `statically_known_true` over `guard_size_oblivious` in cases where we're checking an optimization path. Otherwise, it will DDE and we can't take the safe/slower path. - For broadcast checks, use fallback=False if we encounter a DDE. Typically, unbackeds would be ≥2 and that falls inline with size-oblivious reasoning. - In `coerce_tangent_and_suggest_memory_format`, use the torch._refs version of `contiguous` over Aten's version because Aten doesn't have a DDE-friendly path (yet). ### Example DDE ``` GuardOnDataDependentSymNode: Could not guard on data-dependent expression Eq(128*((u0//387)), 0) (unhinted: Eq(128*((u0//387)), 0)). (Size-like symbols: u0) Caused by: (_functorch/_aot_autograd/collect_metadata_analysis.py:84 in coerce_tangent_and_suggest_memory_format) ``` ``` torch._inductor.exc.InductorError: LoweringException: GuardOnDataDependentSymNode: Could not guard on data-dependent expression Eq((u0//387), 1) (unhinted: Eq((u0//387), 1)). (Size-like symbols: u0) Caused by: (_inductor/lowering.py:488 in broadcast_symbolic_shapes) ``` ``` torch._inductor.exc.InductorError: LoweringException: GuardOnDataDependentSymNode: Could not guard on data-dependent expression Eq((u0//387), 1) (unhinted: Eq((u0//387), 1)). (Size-like symbols: u0) Caused by: (_inductor/ir.py:2797 in create) ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 902c7d0 Pull Request resolved: #155267

laithsakka · 2025-07-21T20:59:49Z

@eellison the old used to used size_oblivious=True anyway so this is not changing existing behaviour for unbacked anwyay size_oblivious=True means assume unbacked>=2

laithsakka · 2025-07-21T21:00:18Z

torch/_inductor/lowering.py

                (
                    V.graph.sizevars.shape_env.evaluate_expr(
-                        sympy.Eq(a, 1), size_oblivious=True
+                        sympy.Eq(a, 1), size_oblivious=True, fallback_value=False


remove size_oblivious=True ditto for others.

### What - Use `statically_known_true` over `guard_size_oblivious` in cases where we're checking an optimization path. Otherwise, it will DDE and we can't take the safe/slower path. - For broadcast checks, use fallback=False if we encounter a DDE. Typically, unbackeds would be ≥2 and that falls inline with size-oblivious reasoning. ### Example DDE ``` torch._inductor.exc.InductorError: LoweringException: GuardOnDataDependentSymNode: Could not guard on data-dependent expression Eq((u0//387), 1) (unhinted: Eq((u0//387), 1)). (Size-like symbols: u0) Caused by: (_inductor/lowering.py:488 in broadcast_symbolic_shapes) ``` ``` torch._inductor.exc.InductorError: LoweringException: GuardOnDataDependentSymNode: Could not guard on data-dependent expression Eq((u0//387), 1) (unhinted: Eq((u0//387), 1)). (Size-like symbols: u0) Caused by: (_inductor/ir.py:2797 in create) ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: db1a366 Pull Request resolved: #155267

eellison

Looks reasonable

eellison · 2025-07-22T15:11:35Z

torch/_inductor/lowering.py

                    V.graph.sizevars.shape_env.evaluate_expr(
-                        sympy.Eq(a, 1), size_oblivious=True
+                        sympy.Eq(a, 1), fallback_value=False
                    )
                    and not V.graph.sizevars.shape_env.evaluate_expr(
-                        sympy.Eq(b, 1), size_oblivious=True
+                        sympy.Eq(b, 1), fallback_value=False
                    )
                )
                or (
                    not V.graph.sizevars.shape_env.evaluate_expr(
-                        sympy.Eq(a, 1), size_oblivious=True
+                        sympy.Eq(a, 1), fallback_value=False
                    )
                    and V.graph.sizevars.shape_env.evaluate_expr(
-                        sympy.Eq(b, 1), size_oblivious=True
+                        sympy.Eq(b, 1), fallback_value=False
                    )


Can we reuse the above broadcast_symbolic_shapes here instead of duplicating the logic ? (i know this was preexisting to your change)

this one would look a lil bit forced since broadcast_symbolic_shapes returns the broadcasted shape vs. here we just want to check for broadcast.

But agreed the code looks duplicated, I'm gonna find a more reusable way.

### What - Use `statically_known_true` over `guard_size_oblivious` in cases where we're checking an optimization path. Otherwise, it will DDE and we can't take the safe/slower path. - For broadcast checks, use `fallback=False` if we encounter a DDE. Typically, unbackeds would be ≥2 and that falls inline with size-oblivious reasoning (i.e. when `size_oblivious=True`). ### Example DDE ``` torch._inductor.exc.InductorError: LoweringException: GuardOnDataDependentSymNode: Could not guard on data-dependent expression Eq((u0//387), 1) (unhinted: Eq((u0//387), 1)). (Size-like symbols: u0) Caused by: (_inductor/lowering.py:488 in broadcast_symbolic_shapes) ``` ``` torch._inductor.exc.InductorError: LoweringException: GuardOnDataDependentSymNode: Could not guard on data-dependent expression Eq((u0//387), 1) (unhinted: Eq((u0//387), 1)). (Size-like symbols: u0) Caused by: (_inductor/ir.py:2797 in create) ``` cc ezyang penguinwu bobrenjc93 voznesenskym EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov coconutruben [ghstack-poisoned]

ghstack-source-id: 2c714f5 Pull Request resolved: #155267

ColinPeppler · 2025-07-23T02:30:12Z

@pytorchbot merge

pytorchmergebot · 2025-07-23T02:32:02Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[inductor] support linear & layer_norm unbacked

9414ff8

[ghstack-poisoned]

ColinPeppler requested a review from bdhirsh as a code owner June 5, 2025 21:54

ColinPeppler mentioned this pull request Jun 5, 2025

[export] support linear & layer_norm unbacked #155260

Closed

pytorch-bot bot added the ciflow/inductor label Jun 5, 2025

ColinPeppler added a commit that referenced this pull request Jun 5, 2025

[inductor] support linear & layer_norm unbacked

da43098

ghstack-source-id: 628173e Pull Request resolved: #155267

pytorch-bot bot added the module: inductor label Jun 5, 2025

ColinPeppler added the topic: not user facing topic category label Jun 5, 2025

ColinPeppler commented Jun 5, 2025

View reviewed changes

torch/_refs/__init__.py Outdated Show resolved Hide resolved

eellison reviewed Jun 5, 2025

View reviewed changes

ColinPeppler added a commit that referenced this pull request Jul 18, 2025

[inductor] support linear & layer_norm unbacked

0199106

ghstack-source-id: bdccfdc Pull Request resolved: #155267

ColinPeppler added a commit that referenced this pull request Jul 18, 2025

[inductor] support linear & layer_norm unbacked

42e7b39

ghstack-source-id: 902c7d0 Pull Request resolved: #155267

ColinPeppler requested a review from laithsakka July 21, 2025 19:51

laithsakka reviewed Jul 21, 2025

View reviewed changes

ColinPeppler added a commit that referenced this pull request Jul 22, 2025

[inductor] support linear & layer_norm unbacked

75f4bcf

ghstack-source-id: db1a366 Pull Request resolved: #155267

ColinPeppler requested review from eellison and laithsakka July 22, 2025 02:08

eellison requested review from bobrenjc93 and ezyang July 22, 2025 14:59

eellison added the module: dynamic shapes label Jul 22, 2025

eellison approved these changes Jul 22, 2025

View reviewed changes

ColinPeppler added a commit that referenced this pull request Jul 22, 2025

[inductor] support linear & layer_norm unbacked

6098c69

ghstack-source-id: 2c714f5 Pull Request resolved: #155267

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 23, 2025

pytorchmergebot added the merging label Jul 23, 2025

pytorchmergebot closed this in a6b7bea Jul 23, 2025

pytorchmergebot added Merged and removed merging labels Jul 23, 2025

github-actions bot deleted the gh/ColinPeppler/72/head branch August 23, 2025 02:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[inductor] support linear & layer_norm unbacked#155267

[inductor] support linear & layer_norm unbacked#155267
ColinPeppler wants to merge 5 commits intogh/ColinPeppler/72/basefrom
gh/ColinPeppler/72/head

ColinPeppler commented Jun 5, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jun 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

eellison left a comment

Uh oh!

zou3519 commented Jun 17, 2025

Uh oh!

laithsakka commented Jul 21, 2025

Uh oh!

laithsakka Jul 21, 2025 •

edited

Loading

Uh oh!

eellison left a comment

Uh oh!

eellison Jul 22, 2025

Uh oh!

ColinPeppler Jul 22, 2025

Uh oh!

ColinPeppler commented Jul 23, 2025

Uh oh!

pytorchmergebot commented Jul 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

ColinPeppler commented Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Example DDE

Uh oh!

pytorch-bot bot commented Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/155267

❗ 1 Active SEVs

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

Uh oh!

eellison left a comment

Choose a reason for hiding this comment

Uh oh!

zou3519 commented Jun 17, 2025

Uh oh!

laithsakka commented Jul 21, 2025

Uh oh!

laithsakka Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eellison left a comment

Choose a reason for hiding this comment

Uh oh!

eellison Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

ColinPeppler Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

ColinPeppler commented Jul 23, 2025

Uh oh!

pytorchmergebot commented Jul 23, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ColinPeppler commented Jun 5, 2025 •

edited

Loading

pytorch-bot bot commented Jun 5, 2025 •

edited

Loading

laithsakka Jul 21, 2025 •

edited

Loading