Inline dispatch_and_compile into its call site. by ezyang · Pull Request #158150 · pytorch/pytorch

ezyang · 2025-07-11T21:19:16Z

Stack from ghstack (oldest at bottom):

Signed-off-by: Edward Z. Yang ezyang@meta.com

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @kadeng @chauhang @amjames

[ghstack-poisoned]

pytorch-bot · 2025-07-11T21:19:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158150

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Unrelated Failure

As of commit 35389cc with merge base 4b9a6f7 ():

NEW FAILURE - The following job has failed:

pull / linux-jammy-py3-clang12-mobile-build / build (gh)
Final attempt failed. Child_process exited with error code 1

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / cuda12.8-py3.10-gcc9-sm75 / test (pr_time_benchmarks, 1, 1, linux.g4dn.metal.nvidia.gpu, unstable) (gh) (#153987)
MISSING REGRESSION TEST

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

ezyang · 2025-07-14T17:15:47Z

@pytorchbot merge

pytorchmergebot · 2025-07-14T17:17:35Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-07-14T17:36:01Z

Starting merge as part of PR stack under #158176

pytorchmergebot · 2025-07-14T18:36:59Z

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / macos-py3-arm64 / test (default, 1, 3, macos-m1-stable)

Details for Dev Infra team

Raised by workflow job

[ghstack-poisoned]

ezyang · 2025-07-15T19:01:18Z

@pytorchbot merge -i

pytorchmergebot · 2025-07-15T19:03:02Z

Merge started

Your change will be merged while ignoring the following 2 checks: pull / linux-jammy-py3-clang12-mobile-build / build, pull / cuda12.8-py3.10-gcc9-sm75 / test (pr_time_benchmarks, 1, 1, linux.g4dn.metal.nvidia.gpu, unstable)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-07-15T19:03:50Z

Starting merge as part of PR stack under #158176

pytorchmergebot · 2025-07-15T19:04:19Z

Starting merge as part of PR stack under #158213

pytorchmergebot · 2025-07-15T19:04:54Z

Starting merge as part of PR stack under #158251

pytorchmergebot · 2025-07-15T19:05:26Z

Starting merge as part of PR stack under #158319

Two main things of note: - Review this diff without whitespace changes - To ensure that context managers correctly propagate to later pipeline stages, I am using the ExitStack trick: there is an ExitStack which is in scope for the entire pipeline, and inside of the individual pipeline stages we push context managers onto this stack when we want them to survive into the next pipeline stage. This is not obviously what the best final form of the code is, but create_aot_dispatcher_function is called from multiple locations so I can't just inline the context managers into the call site. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: #158173 Approved by: https://github.com/jamesjwu, https://github.com/wconstab ghstack dependencies: #158149, #158150

…8176) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: #158176 Approved by: https://github.com/jamesjwu ghstack dependencies: #158149, #158150, #158173

The starting point for this refactor is that I need access to the fully general joint graph representation in an export-like interface, but I then subsequently need a way to feed this joint graph into the rest of the compilation pipeline so I can get an actual callable that I can run once I've finished modifying it. Previously, people had added export capabilities to AOTAutograd by having an export flag that toggled what exactly the functions return and triggering aot_dispatch to go to a different "export" implementation, but I've found this difficult to understand and has lead to a bit of duplicate code for the export path. So the idea here is to reorganize the structure of the function calls in AOTAutograd. Here, it is helpful to first describe how things used to work: * Start with aot_autograd.py top level functions like aot_function, _aot_export_function and aot_module_simplified. These call: * create_aot_dispatcher_function. This does a bunch of stuff (forward metadata collection) and adds many context managers. This calls: * One of aot_dispatch_base, aot_dispatch_export or aot_dispatch_autograd, which: * Call aot_dispatch_autograd_graph or aot_dispatch_base_graph to actually do the graph capture * Do some base/export/autograd specific post-processing on the graph Notice the pattern of nested function invocations means that there is no way to easily get the graph capture result from the autograd case; furthermore, the export path is "bolted" on to force the entire chain of functions to have a different return result than normal, and no way to *resume* the rest of the post-processing to actually get a callable. Here is the new structure: * Start with aot_autograd.py top level functions like aot_function, _aot_export_function and aot_module_simplified. These now orchestrate this top level flow: * Start a context manager (stack); this stateful context block takes care of all of the nested context managers which originally necessitated the nested call structure * Call create_aot_state to do initial setup and setup all the context managers on stack. These context managers do NOT exit upon return of this. * Call aot_stage1_graph_capture to do the graph capture * Call aot_stage2_compile or aot_stage2_export depending on what postprocessing you want With this new structure, it's now possible (although not done in this PR) to return the graph after aot_stage1_graph_capture and do something with it, before running aot_stage2_compile to finish the job. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: #158213 Approved by: https://github.com/jamesjwu ghstack dependencies: #158149, #158150, #158173, #158176

…nd functions to frontend_utils (#158251) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: #158251 Approved by: https://github.com/jamesjwu ghstack dependencies: #158149, #158150, #158173, #158176, #158213

Also a small amount of extra code cleanup. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: #158319 Approved by: https://github.com/jingsh ghstack dependencies: #158149, #158150, #158173, #158176, #158213, #158251

Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: 3fbda2b Pull-Request: pytorch/pytorch#158150

Update

7668cf6

[ghstack-poisoned]

ezyang requested a review from bdhirsh as a code owner July 11, 2025 21:19

ezyang mentioned this pull request Jul 11, 2025

Avoid AOTAutogradCache.load in stack trace on cache miss path #158149

Closed

pytorch-bot bot added ciflow/inductor release notes: AO frontend labels Jul 11, 2025

github-actions bot requested review from SherlockNoMad, albanD, antoniojkim and miladm July 11, 2025 21:19

Update

fc4b4f5

[ghstack-poisoned]

This was referenced Jul 12, 2025

Pipeline _create_aot_dispatcher_function #158173

Closed

Hoist choose_dispatcher to top level, remove unnecessary returns #158176

Closed

albanD removed their request for review July 12, 2025 06:43

ezyang mentioned this pull request Jul 14, 2025

Introduce stages to aot_dispatch #158213

Closed

ezyang added the topic: not user facing topic category label Jul 14, 2025

ezyang requested review from jamesjwu and zhxchen17 July 14, 2025 14:33

jamesjwu approved these changes Jul 14, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 14, 2025

pytorchmergebot added the merging label Jul 14, 2025

ezyang mentioned this pull request Jul 14, 2025

Move functions from torch._functorch.aot_autograd that are not frontend functions to frontend_utils #158251

Closed

pytorchmergebot removed the merging label Jul 14, 2025

wconstab approved these changes Jul 14, 2025

View reviewed changes

ezyang mentioned this pull request Jul 15, 2025

Extract out prepare_aot_module_simplified for use in next PR #158319

Closed

Update

35389cc

[ghstack-poisoned]

pytorch-bot bot added the module: dynamo label Jul 15, 2025

pytorchmergebot added the merging label Jul 15, 2025

ezyang mentioned this pull request Jul 15, 2025

Add aot_export_joint_opaque and aot_compile_joint_opaque #158363

Closed

pytorchmergebot added the Merged label Jul 15, 2025

pytorchmergebot closed this in 7afb834 Jul 15, 2025

pytorchmergebot removed the merging label Jul 15, 2025

github-actions bot deleted the gh/ezyang/3098/head branch August 15, 2025 02:20

Khanaksahu pushed a commit to Khanaksahu/pytorch that referenced this pull request Nov 17, 2025

Inline dispatch_and_compile into its call site.

cac5880

Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: 3fbda2b Pull-Request: pytorch/pytorch#158150

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inline dispatch_and_compile into its call site.#158150

Inline dispatch_and_compile into its call site.#158150
ezyang wants to merge 3 commits intogh/ezyang/3098/basefrom
gh/ezyang/3098/head

ezyang commented Jul 11, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 11, 2025 •

edited

Loading

Uh oh!

ezyang commented Jul 14, 2025

Uh oh!

pytorchmergebot commented Jul 14, 2025

Uh oh!

pytorchmergebot commented Jul 14, 2025

Uh oh!

pytorchmergebot commented Jul 14, 2025

Uh oh!

ezyang commented Jul 15, 2025

Uh oh!

pytorchmergebot commented Jul 15, 2025

Uh oh!

pytorchmergebot commented Jul 15, 2025

Uh oh!

pytorchmergebot commented Jul 15, 2025

Uh oh!

pytorchmergebot commented Jul 15, 2025

Uh oh!

pytorchmergebot commented Jul 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ezyang commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158150

❌ 1 New Failure, 1 Unrelated Failure

Uh oh!

ezyang commented Jul 14, 2025

Uh oh!

pytorchmergebot commented Jul 14, 2025

Merge started

Uh oh!

pytorchmergebot commented Jul 14, 2025

Uh oh!

pytorchmergebot commented Jul 14, 2025

Merge failed

Uh oh!

ezyang commented Jul 15, 2025

Uh oh!

pytorchmergebot commented Jul 15, 2025

Merge started

Uh oh!

pytorchmergebot commented Jul 15, 2025

Uh oh!

pytorchmergebot commented Jul 15, 2025

Uh oh!

pytorchmergebot commented Jul 15, 2025

Uh oh!

pytorchmergebot commented Jul 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ezyang commented Jul 11, 2025 •

edited

Loading

pytorch-bot bot commented Jul 11, 2025 •

edited

Loading