Add UUID-based cache key support for pre-grad custom passes by frgossen · Pull Request #177403 · pytorch/pytorch

frgossen · 2026-03-13T17:57:18Z

Stack from ghstack (oldest at bottom):

pre_grad_custom_pass was the only custom pass config without UUID-based
cache key integration. It was excluded from config serialization but not
handled specially via UUID extraction, so its effect was only captured
indirectly through the resulting FX graph. This meant two different
passes producing the same graph could incorrectly share a cache entry.

Align pre_grad_custom_pass with post-grad and joint passes: change its
type to CustomGraphPassType, add it to _cache_config_ignore_prefix (so
the UUID is extracted explicitly), include it in FxGraphHashDetails, and
validate it in _check_can_cache.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @jataylo @Lucaskabela

pre_grad_custom_pass was the only custom pass config without UUID-based cache key integration. It was excluded from config serialization but not handled specially via UUID extraction, so its effect was only captured indirectly through the resulting FX graph. This meant two different passes producing the same graph could incorrectly share a cache entry. Align pre_grad_custom_pass with post-grad and joint passes: change its type to CustomGraphPassType, add it to _cache_config_ignore_prefix (so the UUID is extracted explicitly), include it in FxGraphHashDetails, and validate it in _check_can_cache. [ghstack-poisoned]

pytorch-bot · 2026-03-13T17:57:23Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/177403

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 6361bb2 with merge base 6a461fe ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

inductor / unit-test / inductor-test / test (inductor, 1, 2, linux.g5.4xlarge.nvidia.gpu) (gh) (similar failure)
test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_pca_lowrank_cuda_float32

This comment was automatically generated by Dr. CI and updates every 15 minutes.

pre_grad_custom_pass was the only custom pass config without UUID-based cache key integration. It was excluded from config serialization but not handled specially via UUID extraction, so its effect was only captured indirectly through the resulting FX graph. This meant two different passes producing the same graph could incorrectly share a cache entry. Align pre_grad_custom_pass with post-grad and joint passes: change its type to CustomGraphPassType, add it to _cache_config_ignore_prefix (so the UUID is extracted explicitly), include it in FxGraphHashDetails, and validate it in _check_can_cache. ghstack-source-id: 09ddb53 Pull Request resolved: #177403

pytorch-bot · 2026-03-13T17:57:26Z

This PR needs a `release notes:` label

If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

test/dynamo/test_aot_autograd_cache.py

pre_grad_custom_pass was the only custom pass config without UUID-based cache key integration. It was excluded from config serialization but not handled specially via UUID extraction, so its effect was only captured indirectly through the resulting FX graph. This meant two different passes producing the same graph could incorrectly share a cache entry. Align pre_grad_custom_pass with post-grad and joint passes: change its type to CustomGraphPassType, add it to _cache_config_ignore_prefix (so the UUID is extracted explicitly), include it in FxGraphHashDetails, and validate it in _check_can_cache. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo Lucaskabela [ghstack-poisoned]

pytorchmergebot · 2026-03-18T13:51:41Z