Add cross-process AOT autograd cache hit test by frgossen · Pull Request #177397 · pytorch/pytorch

frgossen · 2026-03-13T17:03:06Z

Stack from ghstack (oldest at bottom):

Test that the AOT autograd cache persists across separate processes by
running two subprocesses with a shared cache directory and verifying the
second one gets a cache hit.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @kadeng @chauhang @amjames @Lucaskabela @jataylo

Test that the AOT autograd cache persists across separate processes by running two subprocesses with a shared cache directory and verifying the second one gets a cache hit. [ghstack-poisoned]

pytorch-bot · 2026-03-13T17:03:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/177397

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 7a1e4f8 with merge base 6a461fe ():

NEW FAILURE - The following job has failed:

pull / linux-jammy-cpu-py3.10-gcc11-bazel-test / build-and-test (default, 1, 1, lf.linux.4xlarge) (gh)
Build completed, 1 test FAILED, 177 total actions

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Test that the AOT autograd cache persists across separate processes by running two subprocesses with a shared cache directory and verifying the second one gets a cache hit. ghstack-source-id: d69ed07 Pull Request resolved: #177397

Test that the AOT autograd cache persists across separate processes by running two subprocesses with a shared cache directory and verifying the second one gets a cache hit. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx kadeng chauhang amjames Lucaskabela jataylo [ghstack-poisoned]

Test that the AOT autograd cache persists across separate processes by running two subprocesses with a shared cache directory and verifying the second one gets a cache hit. ghstack-source-id: dda4c48 Pull Request resolved: #177397

Lucaskabela · 2026-03-13T22:25:13Z

Adding @zhxchen17 as well since he does quite a bit of work on AOT Autograd caching

test/dynamo/test_aot_autograd_cache.py

Test that the AOT autograd cache persists across separate processes by running two subprocesses with a shared cache directory and verifying the second one gets a cache hit. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx kadeng chauhang amjames Lucaskabela jataylo [ghstack-poisoned]

pytorch-bot · 2026-03-18T13:46:39Z

This PR needs a `release notes:` label

If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

pytorchmergebot · 2026-03-18T13:51:40Z

Starting merge as part of PR stack under #177428

pytorchmergebot · 2026-03-18T19:01:45Z

Starting merge as part of PR stack under #177428

pytorchmergebot · 2026-03-18T19:54:55Z

Starting merge as part of PR stack under #177428

pytorchmergebot · 2026-03-18T20:02:09Z

Starting merge as part of PR stack under #177428

frgossen · 2026-03-18T20:08:21Z

@pytorchbot merge -i

pytorchmergebot · 2026-03-18T20:10:25Z

Merge started

Your change will be merged while ignoring the following 1 checks: pull / linux-jammy-cpu-py3.10-gcc11-bazel-test / build-and-test (default, 1, 1, lf.linux.4xlarge)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pre_grad_custom_pass was the only custom pass config without UUID-based cache key integration. It was excluded from config serialization but not handled specially via UUID extraction, so its effect was only captured indirectly through the resulting FX graph. This meant two different passes producing the same graph could incorrectly share a cache entry. Align pre_grad_custom_pass with post-grad and joint passes: change its type to CustomGraphPassType, add it to _cache_config_ignore_prefix (so the UUID is extracted explicitly), include it in FxGraphHashDetails, and validate it in _check_can_cache. Pull Request resolved: #177403 Approved by: https://github.com/aorenste, https://github.com/zou3519 ghstack dependencies: #177397

Pull Request resolved: #177428 Approved by: https://github.com/mlazos, https://github.com/zou3519 ghstack dependencies: #177397, #177403

…77429) Add a pre_grad_pass_timing config ("early", "late", or "default") that controls when pre-grad passes run relative to the AOT autograd cache lookup. - "early": passes run before cache lookup, so they execute on every compile (including cache hits) and the cache key reflects the already-transformed graph. - "late": passes run after cache lookup (only on cache miss); requires custom passes to provide a UUID for the cache key. - "default": automatically resolves to "late" when possible (no custom pass, or a custom pass with a UUID), and falls back to "early" when the custom pass has no UUID. Explicitly setting "late" with a UUID-less custom pass now raises a RuntimeError instead of silently bypassing the cache. The existing test_pre_grad_passes_called_on_cache_miss_only test is renamed and pinned to "late" timing, and new tests cover early timing, both default timing branches, and the error case. Pull Request resolved: #177429 Approved by: https://github.com/aorenste, https://github.com/zou3519 ghstack dependencies: #177397, #177403, #177428

Test that the AOT autograd cache persists across separate processes by running two subprocesses with a shared cache directory and verifying the second one gets a cache hit. Pull Request resolved: pytorch#177397 Approved by: https://github.com/aorenste, https://github.com/zou3519

…177403) pre_grad_custom_pass was the only custom pass config without UUID-based cache key integration. It was excluded from config serialization but not handled specially via UUID extraction, so its effect was only captured indirectly through the resulting FX graph. This meant two different passes producing the same graph could incorrectly share a cache entry. Align pre_grad_custom_pass with post-grad and joint passes: change its type to CustomGraphPassType, add it to _cache_config_ignore_prefix (so the UUID is extracted explicitly), include it in FxGraphHashDetails, and validate it in _check_can_cache. Pull Request resolved: pytorch#177403 Approved by: https://github.com/aorenste, https://github.com/zou3519 ghstack dependencies: pytorch#177397

Pull Request resolved: pytorch#177428 Approved by: https://github.com/mlazos, https://github.com/zou3519 ghstack dependencies: pytorch#177397, pytorch#177403

…torch#177429) Add a pre_grad_pass_timing config ("early", "late", or "default") that controls when pre-grad passes run relative to the AOT autograd cache lookup. - "early": passes run before cache lookup, so they execute on every compile (including cache hits) and the cache key reflects the already-transformed graph. - "late": passes run after cache lookup (only on cache miss); requires custom passes to provide a UUID for the cache key. - "default": automatically resolves to "late" when possible (no custom pass, or a custom pass with a UUID), and falls back to "early" when the custom pass has no UUID. Explicitly setting "late" with a UUID-less custom pass now raises a RuntimeError instead of silently bypassing the cache. The existing test_pre_grad_passes_called_on_cache_miss_only test is renamed and pinned to "late" timing, and new tests cover early timing, both default timing branches, and the error case. Pull Request resolved: pytorch#177429 Approved by: https://github.com/aorenste, https://github.com/zou3519 ghstack dependencies: pytorch#177397, pytorch#177403, pytorch#177428

Add cross-process AOT autograd cache hit test

5f55cb2

Test that the AOT autograd cache persists across separate processes by running two subprocesses with a shared cache directory and verifying the second one gets a cache hit. [ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: dynamo topic: not user facing topic category labels Mar 13, 2026

frgossen requested review from aorenste and zou3519 March 13, 2026 17:03

frgossen mentioned this pull request Mar 13, 2026

Move pre-grad passes after AOTAutograd cache lookup #176340

Closed

This was referenced Mar 13, 2026

Add UUID-based cache key support for pre-grad custom passes #177403

Closed

Remove unused static_inputs_log from aot_autograd.py #177428

Closed

Add pre_grad_pass_timing config for early vs late pre-grad passes #177429

Closed

Lucaskabela requested a review from zhxchen17 March 13, 2026 22:24

aorenste approved these changes Mar 16, 2026

View reviewed changes

test/dynamo/test_aot_autograd_cache.py Outdated Show resolved Hide resolved

test/dynamo/test_aot_autograd_cache.py Outdated Show resolved Hide resolved

frgossen added 3 commits March 17, 2026 08:38

frgossen mentioned this pull request Mar 17, 2026

Add DEFAULT pre_grad_pass_timing that auto-selects early vs late #177664

Closed

zou3519 approved these changes Mar 17, 2026

View reviewed changes

frgossen mentioned this pull request Mar 17, 2026

Default pre_grad_pass_timing to "late" in fbcode #177690

Open

frgossen added module: aotdispatch umbrella label for AOTAutograd issues and removed topic: not user facing topic category labels Mar 17, 2026

frgossen added the topic: not user facing topic category label Mar 18, 2026

This was referenced Mar 18, 2026

Consolidate pre-grad pass timing tests into parametrized test #177731

Open

Parametrize cross-process pre-grad custom pass cache test #177733

Open

frgossen mentioned this pull request Mar 18, 2026

Add assertion that custom pre-grad pass modifies the graph #177736

Open

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Mar 18, 2026

pytorchmergebot added the merging label Mar 18, 2026

pytorchmergebot added the Merged label Mar 18, 2026

pytorchmergebot closed this in f083dc2 Mar 18, 2026

pytorchmergebot removed the merging label Mar 18, 2026

pytorchmergebot pushed a commit that referenced this pull request Mar 18, 2026

Remove unused static_inputs_log from aot_autograd.py (#177428)

8f38cad

Pull Request resolved: #177428 Approved by: https://github.com/mlazos, https://github.com/zou3519 ghstack dependencies: #177397, #177403

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cross-process AOT autograd cache hit test#177397

Add cross-process AOT autograd cache hit test#177397
frgossen wants to merge 5 commits intogh/frgossen/9/basefrom
gh/frgossen/9/head

frgossen commented Mar 13, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Mar 13, 2026 •

edited

Loading

Uh oh!

Lucaskabela commented Mar 13, 2026

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 18, 2026

Uh oh!

pytorchmergebot commented Mar 18, 2026

Uh oh!

pytorchmergebot commented Mar 18, 2026

Uh oh!

pytorchmergebot commented Mar 18, 2026

Uh oh!

pytorchmergebot commented Mar 18, 2026

Uh oh!

frgossen commented Mar 18, 2026

Uh oh!

pytorchmergebot commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

frgossen commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/177397

❌ 1 New Failure

Uh oh!

Lucaskabela commented Mar 13, 2026

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 18, 2026

This PR needs a release notes: label

Uh oh!

pytorchmergebot commented Mar 18, 2026

Uh oh!

pytorchmergebot commented Mar 18, 2026

Uh oh!

pytorchmergebot commented Mar 18, 2026

Uh oh!

pytorchmergebot commented Mar 18, 2026

Uh oh!

frgossen commented Mar 18, 2026

Uh oh!

pytorchmergebot commented Mar 18, 2026

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

frgossen commented Mar 13, 2026 •

edited

Loading

pytorch-bot bot commented Mar 13, 2026 •

edited

Loading

This PR needs a `release notes:` label