[Inductor] Delay codegen for fallback arguments and improve typing by benjaminglass1 · Pull Request #154371 · pytorch/pytorch

benjaminglass1 · 2025-05-26T18:59:20Z

Stack from ghstack (oldest at bottom):

Delays code generation for arguments to fallback ops. This is inspired by #155642, and likely fixes similar memory leaks.

Additionally, prepare for the next PR in the stack by tightening up typing on a cpp_wrapper interface that's only used in one (well-typed) place, as well as downstream effects of that change. In particular, this enabled:

removing a number of now clearly unnecessary asserts
adding a few more targeted asserts to validate the code's current assumptions
removing some unneeded control flow in several functions

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2025-05-26T18:59:24Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154371

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 1d86eec with merge base 3819584 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / linux-jammy-py3.9-clang12 / test (dynamo_wrapped, 1, 3, linux.2xlarge) (gh) (similar failure)
'test/test_reductions.py::TestReductionsCPU::test_sum_all_cpu_float64'

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

benjaminglass1 · 2025-05-27T19:10:08Z

@pytorchbot merge

pytorchmergebot · 2025-05-27T19:12:04Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

benjaminglass1 · 2025-05-27T20:37:23Z

@pytorchbot revert -c nosignal -m "Appears to have broken main"

pytorchmergebot · 2025-05-27T20:38:57Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

…I C-shim dispatching (#154371)" This reverts commit 6169ca0. Reverted #154371 on behalf of https://github.com/benjaminglass1 due to Appears to have broken main ([comment](#154371 (comment)))

pytorchmergebot · 2025-05-27T20:39:12Z

@benjaminglass1 your PR has been successfully reverted.

[ghstack-poisoned]

benjaminglass1 · 2025-06-13T17:02:28Z

@henrylhtsang When CI finishes running (I fully believe it will pass, but just to be sure), I'm ready for you to test this PR internally. I've addressed the one plausible memory leak I can see by delaying code generation on some arguments that could theoretically get leaked.

[ghstack-poisoned]

henrylhtsang · 2025-06-13T17:55:21Z

@henrylhtsang has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

[ghstack-poisoned]

henrylhtsang · 2025-06-13T17:55:58Z

umm do I need to re-import

benjaminglass1 · 2025-06-13T17:56:38Z

@henrylhtsang Yes, sorry, I caught a bug locally. Should be gtg now.

henrylhtsang · 2025-06-13T17:57:13Z

@henrylhtsang has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

henrylhtsang · 2025-06-13T19:12:06Z

Yeah LGTM cc @desertfire

henrylhtsang · 2025-06-13T19:50:43Z

umm it seems to regress the latency by 1%. Any possible ideas?

benjaminglass1 · 2025-06-13T19:52:04Z

@henrylhtsang Which latency? Compile-time latency?

henrylhtsang · 2025-06-13T19:59:41Z

@henrylhtsang Which latency? Compile-time latency?

runtime latency

benjaminglass1 · 2025-06-13T20:03:30Z

@henrylhtsang I've sent some questions to you offline; I'll put any conclusions we come to in this PR.

[ghstack-poisoned]

henrylhtsang · 2025-06-13T21:30:34Z

LGTM, false alarm

benjaminglass1 · 2025-06-13T21:40:28Z

@henrylhtsang Excellent! Once you reimport the (cosmetic) changes I made, we should be GTG!

henrylhtsang · 2025-06-14T00:33:33Z

@henrylhtsang has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

benjaminglass1 · 2025-06-16T17:52:25Z

@pytorchbot merge

pytorchmergebot · 2025-06-16T17:54:15Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Update

d745b4b

[ghstack-poisoned]

benjaminglass1 mentioned this pull request May 26, 2025

[AOTInductor] Call most runtime fallback ops without calling into Python #154142

Closed

pytorch-bot bot added ciflow/inductor module: inductor release notes: inductor (aoti) labels May 26, 2025

pytorchbot added the open source label May 26, 2025

benjaminglass1 requested a review from desertfire May 26, 2025 19:58

benjaminglass1 marked this pull request as ready for review May 26, 2025 19:59

benjaminglass1 self-assigned this May 26, 2025

benjaminglass1 added the ciflow/trunk Trigger trunk jobs on your pull request label May 26, 2025

benjaminglass1 added 2 commits May 26, 2025 19:59

Update

8a6cee7

[ghstack-poisoned]

Update

465add3

[ghstack-poisoned]

desertfire approved these changes May 27, 2025

View reviewed changes

pytorchmergebot added the merging label May 27, 2025

pytorchmergebot closed this in 6169ca0 May 27, 2025

pytorchmergebot added Merged and removed merging labels May 27, 2025

pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels May 27, 2025

pytorchmergebot reopened this May 27, 2025

Update

e87c172

[ghstack-poisoned]

benjaminglass1 mentioned this pull request May 28, 2025

[cpp_wrapper] Build main and kernel code in separate threads #154551

Closed

benjaminglass1 mentioned this pull request Jun 13, 2025

[NOT FOR MERGE] Exploratory work on AOTInductor training #155877

Closed

5 tasks

Update

4db6554

[ghstack-poisoned]

benjaminglass1 changed the title ~~[Inductor] Improve typing, and prepare for ABI-compatible AOTI C-shim dispatching~~ [Inductor] Delay codegen for fallback arguments and improve typing Jun 13, 2025

benjaminglass1 requested a review from desertfire June 13, 2025 17:01

Update

764dbcb

[ghstack-poisoned]

Update

7f3cdc6

[ghstack-poisoned]

Update

1d86eec

[ghstack-poisoned]

pytorchmergebot added the merging label Jun 16, 2025

pytorchmergebot closed this in 42ff6a4 Jun 16, 2025

pytorchmergebot removed the merging label Jun 16, 2025

github-actions bot deleted the gh/benjaminglass1/85/head branch July 19, 2025 02:21

Conversation

benjaminglass1 commented May 26, 2025 • edited by henrylhtsang Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154371

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

benjaminglass1 commented May 27, 2025

Uh oh!

pytorchmergebot commented May 27, 2025

Merge started

Uh oh!

benjaminglass1 commented May 27, 2025

Uh oh!

pytorchmergebot commented May 27, 2025

Uh oh!

pytorchmergebot commented May 27, 2025

Uh oh!

benjaminglass1 commented Jun 13, 2025

Uh oh!

henrylhtsang commented Jun 13, 2025

Uh oh!

henrylhtsang commented Jun 13, 2025

Uh oh!

benjaminglass1 commented Jun 13, 2025

Uh oh!

henrylhtsang commented Jun 13, 2025

Uh oh!

henrylhtsang commented Jun 13, 2025

Uh oh!

henrylhtsang commented Jun 13, 2025

Uh oh!

benjaminglass1 commented Jun 13, 2025

Uh oh!

henrylhtsang commented Jun 13, 2025

Uh oh!

benjaminglass1 commented Jun 13, 2025

Uh oh!

henrylhtsang commented Jun 13, 2025

Uh oh!

benjaminglass1 commented Jun 13, 2025

Uh oh!

henrylhtsang commented Jun 14, 2025

Uh oh!

benjaminglass1 commented Jun 16, 2025

Uh oh!

pytorchmergebot commented Jun 16, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

benjaminglass1 commented May 26, 2025 •

edited by henrylhtsang

Loading

pytorch-bot bot commented May 26, 2025 •

edited

Loading