add and fix OpInfo tests for the default partitioner by bdhirsh · Pull Request #165372 · pytorch/pytorch

bdhirsh · 2025-10-13T22:22:50Z

I noticed the default partitioner was breaking in some dynamic shape tests, so prior to turning off functionalization I want to tweak it to pass all of our OpInfo tests

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2025-10-13T22:22:53Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/165372

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 90343a5 with merge base e787d53 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

I noticed the default partitioner was breaking in some dynamic shape tests, so prior to turning off functionalization I want to tweak it to pass all of our OpInfo tests [ghstack-poisoned]

ezyang · 2025-10-14T18:18:48Z

torch/_functorch/partitioners.py

+        elif (
+            "tensor_meta" not in node.meta
+            and node.op == "call_function"
+            and not isinstance(node.meta.get("val"), torch._subclasses.FakeTensor)


why not just test Tensor?

bdhirsh · 2025-10-14T20:44:02Z

@pytorchbot merge

pytorchmergebot · 2025-10-14T20:47:13Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

I noticed the default partitioner was breaking in some dynamic shape tests, so prior to turning off functionalization I want to tweak it to pass all of our OpInfo tests Pull Request resolved: pytorch#165372 Approved by: https://github.com/ezyang ghstack dependencies: pytorch#165327

malfet · 2025-10-15T17:37:02Z

@pytorchbot revert -m "Looks like it broke slow jobs, see https://hud.pytorch.org/hud/pytorch/pytorch/331b7cc054415210ec73f4e7e4571f8a0c21ed62/1?per_page=50&name_filter=slow&mergeEphemeralLF=true" -c nosignal

pytorchmergebot · 2025-10-15T17:38:47Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

pytorchmergebot · 2025-10-15T17:38:55Z

@bdhirsh your PR has been successfully reverted.

This reverts commit bcfea48. Reverted #165372 on behalf of https://github.com/malfet due to Looks like it broke slow jobs, see https://hud.pytorch.org/hud/pytorch/pytorch/331b7cc054415210ec73f4e7e4571f8a0c21ed62/1?per_page=50&name_filter=slow&mergeEphemeralLF=true ([comment](#165372 (comment)))

I noticed the default partitioner was breaking in some dynamic shape tests, so prior to turning off functionalization I want to tweak it to pass all of our OpInfo tests [ghstack-poisoned]

bdhirsh · 2025-10-15T20:30:35Z

test/functorch/test_aotdispatch.py

-        decorator=toleranceOverride({torch.float32: tol(atol=1e-05, rtol=1e-05)}),
+        # This delta is coming entirely from the clone() on tangents
+        # in AOTDispatcher to make them contiguous
+        decorator=toleranceOverride({torch.float32: tol(atol=4e-05, rtol=1e-05)}),


fyi the slow-test failure that caused the revert was interesting:

(1) this test wobbled tolerance a bit:

PYTORCH_OPINFO_SAMPLE_INPUT_INDEX=32 PYTORCH_TEST_WITH_SLOW=1 PYTORCH_TEST_SKIP_FAST=1 python test/functorch/test_aotdispatch.py TestEagerFusionOpInfoCPU.test_aot_autograd_symbolic_default_partition_exhaustive_linalg_pinv_singular_cpu_float32

(2) I used @SherlockNoMad 's handy DebugMode to find the different kernels running between eager and aot_eager, and I found the difference is coming entirely from AOTDispatcher emitting clone() on tangents in the backward (presumably this causes us to call linalg.pinv backward with different striding, and the op's numerics are sensitive to strides)

We're going to have to deal with this in the "bitwise equality" workstream. I dont' want to deal with it in this PR, but we might want to consider not running the clone at all, and raising an error if we got the strides of our tangents wrong / require the user to tell us what striding they want for the tangents.

I noticed the default partitioner was breaking in some dynamic shape tests, so prior to turning off functionalization I want to tweak it to pass all of our OpInfo tests [ghstack-poisoned]

pytorchmergebot · 2025-10-16T13:04:25Z

Starting merge as part of PR stack under #164577

I'm cleaning this PR up as a proper way of disabling functionalization via config in AOTDispatcher. I removed the non-functionalization related changes from the original version: (1) preventing proxy mode (and functionalization) from incorrectly decomposing CIA ops (Ed has a PR for it here: #164939) (2) preventing python-dispatcher-based decomps above autograd from running. I'm not doing this for now, will likely do it in a followup Pull Request resolved: #164577 Approved by: https://github.com/ezyang ghstack dependencies: #165372

I noticed the default partitioner was breaking in some dynamic shape tests, so prior to turning off functionalization I want to tweak it to pass all of our OpInfo tests Pull Request resolved: pytorch#165372 Approved by: https://github.com/ezyang ghstack dependencies: pytorch#165327

…#165372)" This reverts commit bcfea48. Reverted pytorch#165372 on behalf of https://github.com/malfet due to Looks like it broke slow jobs, see https://hud.pytorch.org/hud/pytorch/pytorch/331b7cc054415210ec73f4e7e4571f8a0c21ed62/1?per_page=50&name_filter=slow&mergeEphemeralLF=true ([comment](pytorch#165372 (comment)))

I noticed the default partitioner was breaking in some dynamic shape tests, so prior to turning off functionalization I want to tweak it to pass all of our OpInfo tests Pull Request resolved: pytorch#165372 Approved by: https://github.com/ezyang

…#164577) I'm cleaning this PR up as a proper way of disabling functionalization via config in AOTDispatcher. I removed the non-functionalization related changes from the original version: (1) preventing proxy mode (and functionalization) from incorrectly decomposing CIA ops (Ed has a PR for it here: pytorch#164939) (2) preventing python-dispatcher-based decomps above autograd from running. I'm not doing this for now, will likely do it in a followup Pull Request resolved: pytorch#164577 Approved by: https://github.com/ezyang ghstack dependencies: pytorch#165372

I noticed the default partitioner was breaking in some dynamic shape tests, so prior to turning off functionalization I want to tweak it to pass all of our OpInfo tests Pull Request resolved: pytorch#165372 Approved by: https://github.com/ezyang

…#164577) I'm cleaning this PR up as a proper way of disabling functionalization via config in AOTDispatcher. I removed the non-functionalization related changes from the original version: (1) preventing proxy mode (and functionalization) from incorrectly decomposing CIA ops (Ed has a PR for it here: pytorch#164939) (2) preventing python-dispatcher-based decomps above autograd from running. I'm not doing this for now, will likely do it in a followup Pull Request resolved: pytorch#164577 Approved by: https://github.com/ezyang ghstack dependencies: pytorch#165372

add and fix OpInfo tests for the default partitioner

be3ccd9

[ghstack-poisoned]

bdhirsh requested review from Chillee and ezyang as code owners October 13, 2025 22:22

bdhirsh mentioned this pull request Oct 13, 2025

make aotdispatcher opinfo tests keep input mutations in graph #165327

Closed

bdhirsh mentioned this pull request Oct 13, 2025

add the option to disable functionalization in AOTDispatcher #164577

Closed

pytorch-bot bot added the ciflow/inductor label Oct 13, 2025

github-actions bot requested review from SherlockNoMad, albanD, antoniojkim and miladm October 13, 2025 22:23

bdhirsh added the release notes: composability release notes category label Oct 13, 2025

bdhirsh added 2 commits October 14, 2025 06:52

Update on "add and fix OpInfo tests for the default partitioner"

5a2984a

I noticed the default partitioner was breaking in some dynamic shape tests, so prior to turning off functionalization I want to tweak it to pass all of our OpInfo tests [ghstack-poisoned]

Update on "add and fix OpInfo tests for the default partitioner"

4615909

I noticed the default partitioner was breaking in some dynamic shape tests, so prior to turning off functionalization I want to tweak it to pass all of our OpInfo tests [ghstack-poisoned]

albanD removed their request for review October 14, 2025 14:56

ezyang reviewed Oct 14, 2025

View reviewed changes

ezyang approved these changes Oct 14, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 14, 2025

pytorchmergebot added the merging label Oct 14, 2025

pytorchmergebot added the Merged label Oct 14, 2025

pytorchmergebot closed this in bcfea48 Oct 14, 2025

pytorchmergebot removed the merging label Oct 14, 2025

pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels Oct 15, 2025

pytorchmergebot reopened this Oct 15, 2025

Update on "add and fix OpInfo tests for the default partitioner"

66460f8

I noticed the default partitioner was breaking in some dynamic shape tests, so prior to turning off functionalization I want to tweak it to pass all of our OpInfo tests [ghstack-poisoned]

bdhirsh commented Oct 15, 2025

View reviewed changes

bdhirsh added the ciflow/slow label Oct 15, 2025

bdhirsh added 2 commits October 15, 2025 15:14

Update on "add and fix OpInfo tests for the default partitioner"

586a71b

I noticed the default partitioner was breaking in some dynamic shape tests, so prior to turning off functionalization I want to tweak it to pass all of our OpInfo tests [ghstack-poisoned]

Update on "add and fix OpInfo tests for the default partitioner"

90343a5

I noticed the default partitioner was breaking in some dynamic shape tests, so prior to turning off functionalization I want to tweak it to pass all of our OpInfo tests [ghstack-poisoned]

pytorchmergebot closed this in f33c7e1 Oct 16, 2025

github-actions bot deleted the gh/bdhirsh/674/head branch November 16, 2025 02:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add and fix OpInfo tests for the default partitioner#165372

add and fix OpInfo tests for the default partitioner#165372
bdhirsh wants to merge 6 commits intogh/bdhirsh/674/basefrom
gh/bdhirsh/674/head

bdhirsh commented Oct 13, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 13, 2025 •

edited

Loading

Uh oh!

ezyang Oct 14, 2025

Uh oh!

bdhirsh commented Oct 14, 2025

Uh oh!

pytorchmergebot commented Oct 14, 2025

Uh oh!

malfet commented Oct 15, 2025

Uh oh!

pytorchmergebot commented Oct 15, 2025

Uh oh!

pytorchmergebot commented Oct 15, 2025

Uh oh!

bdhirsh Oct 15, 2025

Uh oh!

pytorchmergebot commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

bdhirsh commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/165372

✅ No Failures

Uh oh!

ezyang Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

bdhirsh commented Oct 14, 2025

Uh oh!

pytorchmergebot commented Oct 14, 2025

Merge started

Uh oh!

malfet commented Oct 15, 2025

Uh oh!

pytorchmergebot commented Oct 15, 2025

Uh oh!

pytorchmergebot commented Oct 15, 2025

Uh oh!

bdhirsh Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

pytorchmergebot commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bdhirsh commented Oct 13, 2025 •

edited

Loading

pytorch-bot bot commented Oct 13, 2025 •

edited

Loading