[Inductor] Lower fallback nodes annotated with "should_fallback" by chenmillie · Pull Request #166339 · pytorch/pytorch

chenmillie · 2025-10-27T21:28:27Z

Summary:
This PR introduces an inductor-level fallback mechanism that gives users control over which operations or subgraphs Inductor should lower and which should fall back to preexisting kernels. This has similar motivation as #164776 in providing flexibility to selectively disable Inductor lowering for specific nodes.

The implementation simply adds a check for the "should_fallback" metadata annotation on FX graph nodes. If this is set to True, the lowerer falls back before attempting the normal lowering path. Note that since these are user-directed fallbacks dependent upon specific, customized conditions, use add_to_fallback_set=False to avoid permanent overwrites of inductor's lowering/fallback rules.

Simple example marking nodes for fallback based on custom predicates:

def should_fallback_predicate(node: torch.fx.Node, pred: Callable[torch.fx.Node, bool]):
    # Apply predicate and mark for fallback if needed
    if self.predicate(node):
         node.meta["should_fallback"] = True

Test Plan: added a CI test

Differential Revision: D85347587

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

pytorch-bot · 2025-10-27T21:28:31Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166339

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Long queue for ROCM runners, also B200 and XPU queueing is observed

✅ No Failures

As of commit 76cfd5d with merge base 5e7272b ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2025-10-27T21:28:34Z

@chenmillie has exported this pull request. If you are a Meta employee, you can view the originating Diff in D85347587.

BoyuanFeng · 2025-10-27T22:26:36Z

maybe compose with regional inductor compile #164776

cc @anijain2305

BoyuanFeng · 2025-10-27T22:52:06Z

could you try make_fallback?

Example: make_fallback(aten.randint)

Suggested by @angelayi

…6339) Summary: This PR introduces an inductor-level fallback mechanism that gives users control over which operations or subgraphs Inductor should lower and which should fall back to preexisting kernels. This has similar motivation as #164776 in providing flexibility to selectively disable Inductor lowering for specific nodes. The implementation simply adds a check for the `"should_fallback"` metadata annotation on FX graph nodes. If this is set to `True`, the lowerer falls back before attempting the normal lowering path. Note that since these are user-directed fallbacks dependent upon specific, customized conditions, use `add_to_fallback_set=False` to avoid permanent overwrites of inductor's lowering/fallback rules. Simple example marking nodes for fallback based on custom predicates: ``` def should_fallback_predicate(node: torch.fx.Node, pred: Callable[torch.fx.Node, bool]): # Apply predicate and mark for fallback if needed if self.predicate(node): node.meta["should_fallback"] = True ``` Test Plan: added a CI test Differential Revision: D85347587

chenmillie · 2025-10-28T19:33:27Z

@BoyuanFeng @angelayi Thanks for taking a look. I did consider make_fallback at first, but it registers fallbacks at the op level by default, whereas we want fallback behavior at the node level, where the decision can depend on node-specific attributes. For that reason make_fallback may not be suitable for this feature.

…orch#166339) Summary: This PR introduces an inductor-level fallback mechanism that gives users control over which operations or subgraphs Inductor should lower and which should fall back to preexisting kernels. This has similar motivation as pytorch#164776 in providing flexibility to selectively disable Inductor lowering for specific nodes. The implementation simply adds a check for the `"should_fallback"` metadata annotation on FX graph nodes. If this is set to `True`, the lowerer falls back before attempting the normal lowering path. Note that since these are user-directed fallbacks dependent upon specific, customized conditions, use `add_to_fallback_set=False` to avoid permanent overwrites of inductor's lowering/fallback rules. Simple example marking nodes for fallback based on custom predicates: ``` def should_fallback_predicate(node: torch.fx.Node, predicate: Callable[torch.fx.Node, bool]): # Apply predicate and mark for fallback if needed if predicate(node): node.meta["should_fallback"] = True ``` Test Plan: added a CI test Differential Revision: D85347587

blaine-rister

LGTM! This is a very important feature for MTIA, where we want to customize the fallback logic based on node arguments and not only the node target. Past experience has shown FX passes to be a very convenient way to control fallbacks. Though I'll defer to @BoyuanFeng @eellison or @ezyang to comment on the Inductor implementation.

…orch#166339) Summary: This PR introduces an inductor-level fallback mechanism that gives users control over which operations or subgraphs Inductor should lower and which should fall back to preexisting kernels. This has similar motivation as pytorch#164776 in providing flexibility to selectively disable Inductor lowering for specific nodes. The implementation simply adds a check for the `"should_fallback"` metadata annotation on FX graph nodes. If this is set to `True`, the lowerer falls back before attempting the normal lowering path. Note that since these are user-directed fallbacks dependent upon specific, customized conditions, use `add_to_fallback_set=False` to avoid permanent overwrites of inductor's lowering/fallback rules. Simple example marking nodes for fallback based on custom predicates: ``` def should_fallback_predicate(node: torch.fx.Node, predicate: Callable[torch.fx.Node, bool]): # Apply predicate and mark for fallback if needed if predicate(node): node.meta["should_fallback"] = True ``` Test Plan: added a CI test Reviewed By: eellison, blaine-rister Differential Revision: D85347587

facebook-github-bot · 2025-10-29T16:26:19Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2025-10-29T16:28:11Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…6339) Summary: This PR introduces an inductor-level fallback mechanism that gives users control over which operations or subgraphs Inductor should lower and which should fall back to preexisting kernels. This has similar motivation as #164776 in providing flexibility to selectively disable Inductor lowering for specific nodes. The implementation simply adds a check for the `"should_fallback"` metadata annotation on FX graph nodes. If this is set to `True`, the lowerer falls back before attempting the normal lowering path. Note that since these are user-directed fallbacks dependent upon specific, customized conditions, use `add_to_fallback_set=False` to avoid permanent overwrites of inductor's lowering/fallback rules. Simple example marking nodes for fallback based on custom predicates: ``` def should_fallback_predicate(node: torch.fx.Node, pred: Callable[torch.fx.Node, bool]): # Apply predicate and mark for fallback if needed if self.predicate(node): node.meta["should_fallback"] = True ``` Test Plan: added a CI test Differential Revision: D85347587 Pull Request resolved: #166339 Approved by: https://github.com/blaine-rister, https://github.com/eellison

pytorch-bot bot added ciflow/inductor module: inductor release notes: fx release notes category labels Oct 27, 2025

meta-codesync bot added fb-exported meta-exported labels Oct 27, 2025

BoyuanFeng requested review from BoyuanFeng and angelayi October 27, 2025 22:49

chenmillie force-pushed the export-D85347587 branch from 4cedc02 to 0aa2296 Compare October 28, 2025 16:58

chenmillie force-pushed the export-D85347587 branch from 0aa2296 to 9789e9f Compare October 28, 2025 21:52

chenmillie force-pushed the export-D85347587 branch from 9789e9f to 7d12a00 Compare October 28, 2025 22:03

blaine-rister requested review from blaine-rister and ezyang October 29, 2025 06:21

blaine-rister approved these changes Oct 29, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 29, 2025

blaine-rister requested a review from eellison October 29, 2025 06:24

eellison approved these changes Oct 29, 2025

View reviewed changes

chenmillie force-pushed the export-D85347587 branch from 7d12a00 to 76cfd5d Compare October 29, 2025 12:22

pytorchmergebot added the merging label Oct 29, 2025

pytorchmergebot closed this in 398fdd3 Oct 29, 2025

pytorchmergebot added Merged and removed merging labels Oct 29, 2025

yf225 mentioned this pull request Oct 30, 2025

[Fix upcoming CI error] Set current node in inductor lowering pytorch/helion#1052

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inductor] Lower fallback nodes annotated with "should_fallback"#166339

[Inductor] Lower fallback nodes annotated with "should_fallback"#166339
chenmillie wants to merge 1 commit intopytorch:mainfrom
chenmillie:export-D85347587

chenmillie commented Oct 27, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Oct 27, 2025 •

edited

Loading

Uh oh!

meta-codesync bot commented Oct 27, 2025

Uh oh!

BoyuanFeng commented Oct 27, 2025

Uh oh!

BoyuanFeng commented Oct 27, 2025 •

edited

Loading

Uh oh!

chenmillie commented Oct 28, 2025 •

edited

Loading

Uh oh!

blaine-rister left a comment •

edited

Loading

Uh oh!

facebook-github-bot commented Oct 29, 2025

Uh oh!

pytorchmergebot commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

chenmillie commented Oct 27, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166339

❗ 1 Active SEVs

✅ No Failures

Uh oh!

meta-codesync bot commented Oct 27, 2025

Uh oh!

BoyuanFeng commented Oct 27, 2025

Uh oh!

BoyuanFeng commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chenmillie commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

blaine-rister left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Oct 29, 2025

Uh oh!

pytorchmergebot commented Oct 29, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

chenmillie commented Oct 27, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Oct 27, 2025 •

edited

Loading

BoyuanFeng commented Oct 27, 2025 •

edited

Loading

chenmillie commented Oct 28, 2025 •

edited

Loading

blaine-rister left a comment •

edited

Loading