[precompile] Integrate AOTI as a backend. by zhxchen17 · Pull Request #167338 · pytorch/pytorch

zhxchen17 · 2025-11-07T18:06:58Z

Fixes #ISSUE_NUMBER

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @Lucaskabela

pytorch-bot · 2025-11-07T18:07:01Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/167338

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (4 Unrelated Failures)

As of commit 62bea25 with merge base d8384e2 ():

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

inductor / inductor-cpu-test / test (cpu_inductor_torchbench, 2, 2, linux.2xlarge.amx) (gh) (trunk failure)
stable_diffusion_unet
inductor / inductor-cpu-test / test (dynamic_cpu_inductor_torchbench, 2, 2, linux.2xlarge.amx) (gh) (trunk failure)
stable_diffusion_unet
inductor / inductor-test / test (inductor_torchbench, 2, 2, linux.g5.4xlarge.nvidia.gpu) (gh) (trunk failure)
stable_diffusion_unet

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

trunk / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, linux.2xlarge, unstable) (gh) (#166072)
backends/xnnpack/test/recipes/test_xnnpack_recipes.py::TestXnnpackRecipes::test_int8_static_quant_recipe

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jamesjwu · 2025-11-07T19:54:05Z

test/dynamo/test_aot_compile.py

+        with torch.device("cuda"):
+            from torch._dynamo.hooks import Hooks
+
+            mod = SimpleLinearModule()


Instead of testing it this way, let's test it with actual aot_compile_module, passing in the AOTI backend with ModelInput.

jamesjwu

Let's get tests passing and add some tests around aot_compile_module. Otherwise, though, this looks good, hopefully we can land it quickly.

jamesjwu · 2025-11-10T17:38:49Z

torch/__init__.py

+
+        fake_mode = detect_fake_mode(inputs_)
+        ctx = (
+            mock.patch.object(fake_mode, "allow_non_fake_inputs", True)


Why do we need to do this? Also isn't it equivalent to fake_mode.allow_non_fake_inputs?

I think normally aoti assumes this flag has been set from upper layer of the call stack, but in our case we are a new toplevel function so we need to set this properly.

jamesjwu · 2025-11-10T17:39:53Z

torch/__init__.py

                reset_cudagraph_trees()


+class _TorchCompileAOTInductorWrapper(_TorchCompileInductorWrapper):


Let's say I'm a random user with torch.compile, how do I use this? Do I have to pass in this specific backend to fullgraph_compile? Should we make it lower friction somehow?

I.e. backend="inductor" + some config = AOTI
backend="inductor" without config = Python Wrapper

zhxchen17 · 2025-11-10T21:04:28Z

Updated with:

unittest fixes
aot_compile_module() test
A toplevel option to use aoti torch.compile(options={'use_aoti': True})

zhxchen17 · 2025-11-12T18:11:54Z

@pytorchbot merge

pytorchmergebot · 2025-11-12T18:13:51Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-11-12T18:35:19Z

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / macos-py3-arm64 / test (mps, 1, 1, macos-m2-15)

Details for Dev Infra team

Raised by workflow job

desertfire · 2025-11-12T19:05:58Z

torch/_inductor/output_code.py

+        if self.device_type.startswith("cuda"):
+            current_callable = (
+                torch._C._aoti.AOTIModelContainerRunnerCuda(  # type: ignore[call-arg]
+                    current_callable, 1, self.device_type


One perf trick here is to set run_single_threaded to True, otherwise it won't compose with cudagraphs, see #148601 for more backgrounds.

zhxchen17 · 2025-11-12T19:53:51Z

@pytorchbot merge

zhxchen17 · 2025-11-12T21:31:29Z

@pytorchbot merge

pytorchmergebot · 2025-11-12T21:33:31Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

jeanschmidt · 2025-11-13T17:37:26Z

@pytorchbot revert -m "seems to be breaking internal tests and builds, see D86919103" -c ghfirst

pytorchmergebot · 2025-11-13T17:39:01Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

This reverts commit 273babe. Reverted #167338 on behalf of https://github.com/jeanschmidt due to seems to be breaking internal tests and builds, see D86919103 ([comment](#167338 (comment)))

pytorchmergebot · 2025-11-13T17:39:06Z

@zhxchen17 your PR has been successfully reverted.

zhxchen17 · 2025-11-14T15:24:53Z

@pytorchbot merge

pytorchmergebot · 2025-11-14T15:27:21Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Fixes #ISSUE_NUMBER Pull Request resolved: pytorch#167338 Approved by: https://github.com/jamesjwu

This reverts commit 273babe. Reverted pytorch#167338 on behalf of https://github.com/jeanschmidt due to seems to be breaking internal tests and builds, see D86919103 ([comment](pytorch#167338 (comment)))

Fixes #ISSUE_NUMBER Pull Request resolved: pytorch#167338 Approved by: https://github.com/jamesjwu

zhxchen17 requested a review from bdhirsh as a code owner November 7, 2025 18:06

pytorch-bot bot added ciflow/inductor module: dynamo module: inductor labels Nov 7, 2025

zhxchen17 requested a review from jamesjwu November 7, 2025 18:07

zhxchen17 force-pushed the zhxchen17/precompile/aoti branch from 929d9b9 to 13b69e4 Compare November 7, 2025 18:18

zhxchen17 added the topic: not user facing topic category label Nov 7, 2025

jamesjwu reviewed Nov 7, 2025

View reviewed changes

jamesjwu requested a review from desertfire November 10, 2025 16:32

jamesjwu mentioned this pull request Nov 10, 2025

Add option for AOTI backend for AOT precompile meta-pytorch/attention-gym#177

Merged

jamesjwu suggested changes Nov 10, 2025

View reviewed changes

zhxchen17 force-pushed the zhxchen17/precompile/aoti branch from 13b69e4 to cc82255 Compare November 10, 2025 21:03

zhxchen17 requested a review from jamesjwu November 10, 2025 21:36

mlazos self-requested a review November 11, 2025 10:00

jamesjwu approved these changes Nov 12, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 12, 2025

pytorchmergebot added the merging label Nov 12, 2025

pytorchmergebot removed the merging label Nov 12, 2025

desertfire reviewed Nov 12, 2025

View reviewed changes

zhxchen17 force-pushed the zhxchen17/precompile/aoti branch 2 times, most recently from 35425f9 to ffff272 Compare November 12, 2025 19:18

pytorch-bot bot added the release notes: inductor (aoti) label Nov 12, 2025

zhxchen17 removed the release notes: inductor (aoti) label Nov 12, 2025

pytorchmergebot added the merging label Nov 12, 2025

pytorch-bot bot added the release notes: inductor (aoti) label Nov 12, 2025

zhxchen17 removed the release notes: inductor (aoti) label Nov 12, 2025

pytorchmergebot added the merging label Nov 12, 2025

pytorchmergebot added the Merged label Nov 13, 2025

pytorchmergebot closed this in 273babe Nov 13, 2025

pytorchmergebot removed the merging label Nov 13, 2025

pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels Nov 13, 2025

pytorchmergebot reopened this Nov 13, 2025

zhxchen17 force-pushed the zhxchen17/precompile/aoti branch from 44985e1 to e9cc1a8 Compare November 13, 2025 18:12

pytorch-bot bot added the release notes: inductor (aoti) label Nov 13, 2025

[precompile] Integrate AOTI as a backend.

62bea25

zhxchen17 force-pushed the zhxchen17/precompile/aoti branch from e9cc1a8 to 62bea25 Compare November 13, 2025 20:46

pytorchmergebot added the merging label Nov 14, 2025

pytorchmergebot closed this in b657061 Nov 14, 2025

pytorchmergebot removed the merging label Nov 14, 2025

Silv3S pushed a commit to Silv3S/pytorch that referenced this pull request Nov 18, 2025

[precompile] Integrate AOTI as a backend. (pytorch#167338)

9ed1d69

Fixes #ISSUE_NUMBER Pull Request resolved: pytorch#167338 Approved by: https://github.com/jamesjwu

Silv3S pushed a commit to Silv3S/pytorch that referenced this pull request Nov 18, 2025

[precompile] Integrate AOTI as a backend. (pytorch#167338)

18c9da7

Fixes #ISSUE_NUMBER Pull Request resolved: pytorch#167338 Approved by: https://github.com/jamesjwu

github-actions bot deleted the zhxchen17/precompile/aoti branch December 15, 2025 02:21

		reset_cudagraph_trees()


		class _TorchCompileAOTInductorWrapper(_TorchCompileInductorWrapper):

Conversation

zhxchen17 commented Nov 7, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/167338

✅ You can merge normally! (4 Unrelated Failures)

Uh oh!

jamesjwu Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

jamesjwu left a comment

Choose a reason for hiding this comment

Uh oh!

jamesjwu Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

zhxchen17 Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

jamesjwu Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

zhxchen17 commented Nov 10, 2025

Uh oh!

zhxchen17 commented Nov 12, 2025

Uh oh!

pytorchmergebot commented Nov 12, 2025

Merge started

Uh oh!

pytorchmergebot commented Nov 12, 2025

Merge failed

Uh oh!

desertfire Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

zhxchen17 commented Nov 12, 2025

Uh oh!

zhxchen17 commented Nov 12, 2025

Uh oh!

pytorchmergebot commented Nov 12, 2025

Merge started

Uh oh!

jeanschmidt commented Nov 13, 2025

Uh oh!

pytorchmergebot commented Nov 13, 2025

Uh oh!

pytorchmergebot commented Nov 13, 2025

Uh oh!

zhxchen17 commented Nov 14, 2025

Uh oh!

pytorchmergebot commented Nov 14, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

zhxchen17 commented Nov 7, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Nov 7, 2025 •

edited

Loading