[AOTInductor] Call most runtime fallback ops without calling into Python by benjaminglass1 · Pull Request #154142 · pytorch/pytorch

benjaminglass1 · 2025-05-22T19:22:58Z

Stack from ghstack (oldest at bottom):

Uses the new aoti_torch_call_dispatcher interface to call runtime fallback ops without calling back into Python. This supports a limited subset of input and output datatypes, but a significant majority of remaining fallback ATen ops are covered.

Fixes #150988
Fixes #153478

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2025-05-22T19:23:02Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154142

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 06bfb9c with merge base 2625c70 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Uses the new aoti_torch_call_dispatcher interface to call runtime fallback ops without calling back into Python. This supports a limited subset of input and output datatypes, but a large number of fallback ATen ops are covered. Fixes #150988 Fixes #153478 ghstack-source-id: 70cbe2a Pull Request resolved: #154142

[ghstack-poisoned]

Uses the new aoti_torch_call_dispatcher interface to call runtime fallback ops without calling back into Python. This supports a limited subset of input and output datatypes, but a large number of fallback ATen ops are covered. Fixes #150988 Fixes #153478 ghstack-source-id: dd3d05a Pull Request resolved: #154142

[ghstack-poisoned]

Uses the new aoti_torch_call_dispatcher interface to call runtime fallback ops without calling back into Python. This supports a limited subset of input and output datatypes, but a large number of fallback ATen ops are covered. Fixes [#150988](#150988) Fixes [#153478](#153478) ghstack-source-id: a8ece52 Pull Request resolved: #154142

[ghstack-poisoned]

torch/_inductor/codegen/cpp_wrapper_cpu.py

[ghstack-poisoned]

torch/_inductor/codegen/cpp_wrapper_cpu.py

[ghstack-poisoned]

benjaminglass1 · 2025-06-17T20:53:31Z

Benchmarking failures appear to be present on main at this time, please ignore.

desertfire

So what was the story about the previous memory leak?

benjaminglass1 · 2025-06-18T17:14:39Z

So what was the story about the previous memory leak?

@desertfire I was never able to definitively determine what happened, but the leak went away when we delayed codegen for fallback kernel arguments until the last possible second (since we don't need to codegen in some cases). I suspect the issue was that we codegen'ed the arguments, but then in some circumstances never deallocated them (since we didn't actually use them).

[ghstack-poisoned]

benjaminglass1 · 2025-06-18T19:38:04Z

@pytorchbot merge

pytorchmergebot · 2025-06-18T19:40:31Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-06-18T23:42:46Z

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

pull / linux-jammy-py3_9-clang9-xla / test (xla, 1, 1, linux.12xlarge)

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

benjaminglass1 · 2025-06-19T15:19:38Z

@pytorchbot merge

pytorchmergebot · 2025-06-19T15:21:34Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Update

1676672

[ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor release notes: inductor (aoti) labels May 22, 2025

benjaminglass1 self-assigned this May 22, 2025

pytorchbot added the open source label May 22, 2025

benjaminglass1 mentioned this pull request May 22, 2025

Support C shim for customized OP #150988

Closed

Update

58fe94b

[ghstack-poisoned]

Update

c1c3d28

[ghstack-poisoned]

benjaminglass1 mentioned this pull request May 26, 2025

[Inductor] Delay codegen for fallback arguments and improve typing #154371

Closed

benjaminglass1 requested a review from desertfire May 26, 2025 19:58

benjaminglass1 marked this pull request as ready for review May 26, 2025 19:58

benjaminglass1 added the ciflow/trunk Trigger trunk jobs on your pull request label May 26, 2025

Update

6933273

[ghstack-poisoned]

Update

ed45456

[ghstack-poisoned]

desertfire reviewed May 27, 2025

View reviewed changes

torch/_inductor/codegen/cpp_wrapper_cpu.py Show resolved Hide resolved

torch/_inductor/codegen/cpp_wrapper_cpu.py Show resolved Hide resolved

Update

f9e4adb

[ghstack-poisoned]

benjaminglass1 mentioned this pull request May 28, 2025

[cpp_wrapper] Build main and kernel code in separate threads #154551

Closed

benjaminglass1 requested a review from desertfire May 28, 2025 20:07

desertfire reviewed May 29, 2025

View reviewed changes

torch/_inductor/codegen/cpp_wrapper_cpu.py Show resolved Hide resolved

benjaminglass1 and others added 3 commits May 29, 2025 15:19

Update

00aef17

[ghstack-poisoned]

Update

3dad7a1

[ghstack-poisoned]

Update

ee85a22

[ghstack-poisoned]

desertfire approved these changes Jun 2, 2025

View reviewed changes

benjaminglass1 added 3 commits June 9, 2025 15:55

Update

663253b

[ghstack-poisoned]

Update

bb19cc0

[ghstack-poisoned]

Update

2a38b85

[ghstack-poisoned]

benjaminglass1 mentioned this pull request Jun 13, 2025

[NOT FOR MERGE] Exploratory work on AOTInductor training #155877

Closed

5 tasks

benjaminglass1 added 6 commits June 13, 2025 16:58

Update

95371af

[ghstack-poisoned]

Update

d63d7ad

[ghstack-poisoned]

Update

e5ef577

[ghstack-poisoned]

Update

230d9f2

[ghstack-poisoned]

Update

61a35ae

[ghstack-poisoned]

Update

0987675

[ghstack-poisoned]

benjaminglass1 requested review from desertfire and janeyx99 June 17, 2025 20:51

desertfire reviewed Jun 18, 2025

View reviewed changes

Update

06bfb9c

[ghstack-poisoned]

pytorchmergebot added the merging label Jun 18, 2025

pytorchmergebot removed the merging label Jun 18, 2025

pytorchmergebot added the merging label Jun 19, 2025

pytorchmergebot added the Merged label Jun 19, 2025

pytorchmergebot closed this in c9afcff Jun 19, 2025

pytorchmergebot removed the merging label Jun 19, 2025

benjaminglass1 mentioned this pull request Jun 19, 2025

RFC: Use torch.compile to reduce Python overhead mingfeima/sglang#73

Closed

github-actions bot deleted the gh/benjaminglass1/84/head branch July 20, 2025 02:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AOTInductor] Call most runtime fallback ops without calling into Python#154142

[AOTInductor] Call most runtime fallback ops without calling into Python#154142
benjaminglass1 wants to merge 20 commits intogh/benjaminglass1/84/basefrom
gh/benjaminglass1/84/head

benjaminglass1 commented May 22, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented May 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

benjaminglass1 commented Jun 17, 2025

Uh oh!

desertfire left a comment

Uh oh!

benjaminglass1 commented Jun 18, 2025

Uh oh!

benjaminglass1 commented Jun 18, 2025

Uh oh!

pytorchmergebot commented Jun 18, 2025

Uh oh!

pytorchmergebot commented Jun 18, 2025

Uh oh!

benjaminglass1 commented Jun 19, 2025

Uh oh!

pytorchmergebot commented Jun 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

benjaminglass1 commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154142

✅ No Failures

Uh oh!

Uh oh!

Uh oh!

Uh oh!

benjaminglass1 commented Jun 17, 2025

Uh oh!

desertfire left a comment

Choose a reason for hiding this comment

Uh oh!

benjaminglass1 commented Jun 18, 2025

Uh oh!

benjaminglass1 commented Jun 18, 2025

Uh oh!

pytorchmergebot commented Jun 18, 2025

Merge started

Uh oh!

pytorchmergebot commented Jun 18, 2025

Merge failed

Uh oh!

benjaminglass1 commented Jun 19, 2025

Uh oh!

pytorchmergebot commented Jun 19, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

benjaminglass1 commented May 22, 2025 •

edited

Loading

pytorch-bot bot commented May 22, 2025 •

edited

Loading