[invoke_subgraph] Don't run the graph twice when autograd enabled by angelayi · Pull Request #167245 · pytorch/pytorch

angelayi · 2025-11-06T21:18:18Z

In the previous PR we found that the autograd eager impl of invoke_subgraph calls the subgraph twice. If the subgraph contains effects then effects will be run twice, which is bad. This PR fixes the issue by getting the output metadata from subgraph's node.meta if it exists.

Stack from ghstack (oldest at bottom):

Differential Revision: D87392740

[ghstack-poisoned]

pytorch-bot · 2025-11-06T21:18:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/167245

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: ffc2658 Pull Request resolved: #167245

…enabled" [ghstack-poisoned]

…enabled" In the [previous PR](https://github.com/pytorch/pytorch/pull/167231/files#diff-e2b74af5d8b538a7d07d18507d27010703742ddad5f819992b55f5abc6d9a502R964-R966) we found that the autograd eager impl of invoke_subgraph calls the subgraph twice. If the subgraph contains effects then effects will be run twice, which is bad. This PR fixes the issue by getting the output metadata from `subgraph`'s `node.meta` if it exists. [ghstack-poisoned]

anijain2305

Nice!

…enabled" In the [previous PR](https://github.com/pytorch/pytorch/pull/167231/files#diff-e2b74af5d8b538a7d07d18507d27010703742ddad5f819992b55f5abc6d9a502R964-R966) we found that the autograd eager impl of invoke_subgraph calls the subgraph twice. If the subgraph contains effects then effects will be run twice, which is bad. This PR fixes the issue by getting the output metadata from `subgraph`'s `node.meta` if it exists. [ghstack-poisoned]

ghstack-source-id: 5e9d7c1 Pull Request resolved: #167245

…enabled" In the [previous PR](https://github.com/pytorch/pytorch/pull/167231/files#diff-e2b74af5d8b538a7d07d18507d27010703742ddad5f819992b55f5abc6d9a502R964-R966) we found that the autograd eager impl of invoke_subgraph calls the subgraph twice. If the subgraph contains effects then effects will be run twice, which is bad. This PR fixes the issue by getting the output metadata from `subgraph`'s `node.meta` if it exists. [ghstack-poisoned]

ghstack-source-id: 1427e97 Pull Request resolved: #167245

…enabled" In the [previous PR](https://github.com/pytorch/pytorch/pull/167231/files#diff-e2b74af5d8b538a7d07d18507d27010703742ddad5f819992b55f5abc6d9a502R964-R966) we found that the autograd eager impl of invoke_subgraph calls the subgraph twice. If the subgraph contains effects then effects will be run twice, which is bad. This PR fixes the issue by getting the output metadata from `subgraph`'s `node.meta` if it exists. [ghstack-poisoned]

angelayi · 2025-11-18T16:29:27Z

@pytorchbot merge

angelayi · 2025-11-19T06:51:24Z

@pytorchbot merge -f "failure looks unrelated"

pytorchmergebot · 2025-11-19T06:53:04Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

yangw-dev · 2025-11-20T00:19:32Z

@pytorchbot revert -m "the base pr is broken internal tests in the stack" -c ghfirst

pytorchmergebot · 2025-11-20T00:21:03Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

…bled (#167245)" This reverts commit 789240b. Reverted #167245 on behalf of https://github.com/yangw-dev due to the base pr is broken internal tests in the stack ([comment](#167245 (comment)))

pytorchmergebot · 2025-11-20T00:21:16Z

@angelayi your PR has been successfully reverted.

…enabled" In the [previous PR](https://github.com/pytorch/pytorch/pull/167231/files#diff-e2b74af5d8b538a7d07d18507d27010703742ddad5f819992b55f5abc6d9a502R964-R966) we found that the autograd eager impl of invoke_subgraph calls the subgraph twice. If the subgraph contains effects then effects will be run twice, which is bad. This PR fixes the issue by getting the output metadata from `subgraph`'s `node.meta` if it exists. Differential Revision: [D87392740](https://our.internmc.facebook.com/intern/diff/D87392740) [ghstack-poisoned]

angelayi · 2025-11-21T21:04:30Z

@angelayi has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

angelayi · 2025-11-21T23:23:33Z

@angelayi has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

pytorchmergebot · 2025-11-24T23:34:22Z

Starting merge as part of PR stack under #167363

Updates the implementation of `unlift_tokens` to handle unlifting invoke_subgraph. The context of `unlift_tokens` is currently tokens are threaded as inputs and outputs of the toplevel graph produced by AOTAutograd. However we don't want the inductor traced graph to have any notion of effects/tokens, just that the tokens should introduce some extra dependency behavior. So, we unlift the tokens from the toplevel graph. Instead of placeholder nodes the tokens will come from a `_make_token` call, and instead of outputting the tokens we will sink all tokens into `_sink_tokens`. Similarly, we want the invoke_subgraph subgraph to not have any notion of tokens, so we will also remove the tokens from the inputs of the invoke_subgraph subgraph. However, we still need a way mark the invoke_subgraph call as being effectful at the toplevel module to prevent invoke_subgraph calls from being reordered, so I wrap the invoke_subgraph with an effects. Before: ``` def forward(self, token, x): repeated_subgraph0 = self.repeated_subgraph0 invoke_subgraph = torch.ops.higher_order.invoke_subgraph(repeated_subgraph0, 'subgraph_0', token, x) getitem = invoke_subgraph[0] # output token getitem_1 = invoke_subgraph[1] return (getitem, getitem_1) def repeated_subgraph(self, token, x): with_effects = torch.ops.higher_order.with_effects(token, torch.ops.mylib.record_memory.default, 'forward', 'N') getitem = with_effects[0] # output token add = torch.ops.aten.add(x, x) return (getitem, add) ``` After: ``` def forward(self, x): token = torch.ops.prims._make_token.default() repeated_subgraph0 = self.repeated_subgraph0 invoke_subgraph = torch.ops.higher_order.with_effects( token, torch.ops.higher_order.invoke_subgraph, repeated_subgraph0, 'subgraph_0', token, x ) getitem = invoke_subgraph[0] # output token getitem_1 = invoke_subgraph[1] _ = torch.ops.prims._sink_tokens.default([getitem]) return (getitem_1,) def repeated_subgraph(self, x): token = torch.ops.prims._make_token.default() with_effects = torch.ops.higher_order.with_effects(token, torch.ops.mylib.record_memory.default, 'forward', 'N') getitem = with_effects[0] # output token add = torch.ops.aten.add(x, x) _ = torch.ops.prims._sink_tokens.default([getitem]) return (add,) ``` Differential Revision: [D87668981](https://our.internmc.facebook.com/intern/diff/D87668981) Pull Request resolved: #167363 Approved by: https://github.com/fxdawnn ghstack dependencies: #167231, #167245

…torch#167245) In the [previous PR](https://github.com/pytorch/pytorch/pull/167231/files#diff-e2b74af5d8b538a7d07d18507d27010703742ddad5f819992b55f5abc6d9a502R964-R966) we found that the autograd eager impl of invoke_subgraph calls the subgraph twice. If the subgraph contains effects then effects will be run twice, which is bad. This PR fixes the issue by getting the output metadata from `subgraph`'s `node.meta` if it exists. Differential Revision: [D87392740](https://our.internmc.facebook.com/intern/diff/D87392740) Pull Request resolved: pytorch#167245 Approved by: https://github.com/anijain2305 ghstack dependencies: pytorch#167231

Updates the implementation of `unlift_tokens` to handle unlifting invoke_subgraph. The context of `unlift_tokens` is currently tokens are threaded as inputs and outputs of the toplevel graph produced by AOTAutograd. However we don't want the inductor traced graph to have any notion of effects/tokens, just that the tokens should introduce some extra dependency behavior. So, we unlift the tokens from the toplevel graph. Instead of placeholder nodes the tokens will come from a `_make_token` call, and instead of outputting the tokens we will sink all tokens into `_sink_tokens`. Similarly, we want the invoke_subgraph subgraph to not have any notion of tokens, so we will also remove the tokens from the inputs of the invoke_subgraph subgraph. However, we still need a way mark the invoke_subgraph call as being effectful at the toplevel module to prevent invoke_subgraph calls from being reordered, so I wrap the invoke_subgraph with an effects. Before: ``` def forward(self, token, x): repeated_subgraph0 = self.repeated_subgraph0 invoke_subgraph = torch.ops.higher_order.invoke_subgraph(repeated_subgraph0, 'subgraph_0', token, x) getitem = invoke_subgraph[0] # output token getitem_1 = invoke_subgraph[1] return (getitem, getitem_1) def repeated_subgraph(self, token, x): with_effects = torch.ops.higher_order.with_effects(token, torch.ops.mylib.record_memory.default, 'forward', 'N') getitem = with_effects[0] # output token add = torch.ops.aten.add(x, x) return (getitem, add) ``` After: ``` def forward(self, x): token = torch.ops.prims._make_token.default() repeated_subgraph0 = self.repeated_subgraph0 invoke_subgraph = torch.ops.higher_order.with_effects( token, torch.ops.higher_order.invoke_subgraph, repeated_subgraph0, 'subgraph_0', token, x ) getitem = invoke_subgraph[0] # output token getitem_1 = invoke_subgraph[1] _ = torch.ops.prims._sink_tokens.default([getitem]) return (getitem_1,) def repeated_subgraph(self, x): token = torch.ops.prims._make_token.default() with_effects = torch.ops.higher_order.with_effects(token, torch.ops.mylib.record_memory.default, 'forward', 'N') getitem = with_effects[0] # output token add = torch.ops.aten.add(x, x) _ = torch.ops.prims._sink_tokens.default([getitem]) return (add,) ``` Differential Revision: [D87668981](https://our.internmc.facebook.com/intern/diff/D87668981) Pull Request resolved: pytorch#167363 Approved by: https://github.com/fxdawnn ghstack dependencies: pytorch#167231, pytorch#167245

…bled (#167245)" This reverts commit 789240b. Reverted #167245 on behalf of https://github.com/yangw-dev due to the base pr is broken internal tests in the stack ([comment](#167245 (comment)))

…67245) In the [previous PR](https://github.com/pytorch/pytorch/pull/167231/files#diff-e2b74af5d8b538a7d07d18507d27010703742ddad5f819992b55f5abc6d9a502R964-R966) we found that the autograd eager impl of invoke_subgraph calls the subgraph twice. If the subgraph contains effects then effects will be run twice, which is bad. This PR fixes the issue by getting the output metadata from `subgraph`'s `node.meta` if it exists. Differential Revision: [D87392740](https://our.internmc.facebook.com/intern/diff/D87392740) Pull Request resolved: #167245 Approved by: https://github.com/anijain2305 ghstack dependencies: #167231

Updates the implementation of `unlift_tokens` to handle unlifting invoke_subgraph. The context of `unlift_tokens` is currently tokens are threaded as inputs and outputs of the toplevel graph produced by AOTAutograd. However we don't want the inductor traced graph to have any notion of effects/tokens, just that the tokens should introduce some extra dependency behavior. So, we unlift the tokens from the toplevel graph. Instead of placeholder nodes the tokens will come from a `_make_token` call, and instead of outputting the tokens we will sink all tokens into `_sink_tokens`. Similarly, we want the invoke_subgraph subgraph to not have any notion of tokens, so we will also remove the tokens from the inputs of the invoke_subgraph subgraph. However, we still need a way mark the invoke_subgraph call as being effectful at the toplevel module to prevent invoke_subgraph calls from being reordered, so I wrap the invoke_subgraph with an effects. Before: ``` def forward(self, token, x): repeated_subgraph0 = self.repeated_subgraph0 invoke_subgraph = torch.ops.higher_order.invoke_subgraph(repeated_subgraph0, 'subgraph_0', token, x) getitem = invoke_subgraph[0] # output token getitem_1 = invoke_subgraph[1] return (getitem, getitem_1) def repeated_subgraph(self, token, x): with_effects = torch.ops.higher_order.with_effects(token, torch.ops.mylib.record_memory.default, 'forward', 'N') getitem = with_effects[0] # output token add = torch.ops.aten.add(x, x) return (getitem, add) ``` After: ``` def forward(self, x): token = torch.ops.prims._make_token.default() repeated_subgraph0 = self.repeated_subgraph0 invoke_subgraph = torch.ops.higher_order.with_effects( token, torch.ops.higher_order.invoke_subgraph, repeated_subgraph0, 'subgraph_0', token, x ) getitem = invoke_subgraph[0] # output token getitem_1 = invoke_subgraph[1] _ = torch.ops.prims._sink_tokens.default([getitem]) return (getitem_1,) def repeated_subgraph(self, x): token = torch.ops.prims._make_token.default() with_effects = torch.ops.higher_order.with_effects(token, torch.ops.mylib.record_memory.default, 'forward', 'N') getitem = with_effects[0] # output token add = torch.ops.aten.add(x, x) _ = torch.ops.prims._sink_tokens.default([getitem]) return (add,) ``` Differential Revision: [D87668981](https://our.internmc.facebook.com/intern/diff/D87668981) Pull Request resolved: #167363 Approved by: https://github.com/fxdawnn ghstack dependencies: #167231, #167245

ghstack-source-id: abf87b5 Pull Request resolved: pytorch/pytorch#167245

ghstack-source-id: 44c172c Pull Request resolved: pytorch/pytorch#167245

[invoke_subgraph] Don't run the graph twice when autograd enabled

9bc7e32

[ghstack-poisoned]

angelayi requested a review from zou3519 as a code owner November 6, 2025 21:18

This was referenced Nov 6, 2025

[effects] Add register_effectful_op #163284

Closed

[opaque obj] Allow non-effectful scriptobjs #163714

Closed

[opaque obj] torch.compile support #163936

Closed

This was referenced Nov 6, 2025

[opqaue obj] Add attribute support #167230

Closed

[hoo] Invoke subgraph + effect #167231

Closed

angelayi added a commit that referenced this pull request Nov 6, 2025

[invoke_subgraph] Don't run the graph twice when autograd enabled

30b44f5

ghstack-source-id: ffc2658 Pull Request resolved: #167245

angelayi marked this pull request as draft November 6, 2025 22:39

angelayi requested a review from anijain2305 November 6, 2025 22:39

Update on "[invoke_subgraph] Don't run the graph twice when autograd …

5b4d194

…enabled" [ghstack-poisoned]

angelayi mentioned this pull request Nov 7, 2025

[hoo] Fix unlift of effects with invoke_subgraph #167363

Closed

pytorch-bot bot mentioned this pull request Nov 7, 2025

[hoo] Invoke subgraph + effects Inductor support #167364

Closed

angelayi marked this pull request as ready for review November 7, 2025 21:29

angelayi added 2 commits November 7, 2025 15:42

anijain2305 approved these changes Nov 13, 2025

View reviewed changes

angelayi added a commit that referenced this pull request Nov 14, 2025

[invoke_subgraph] Don't run the graph twice when autograd enabled

b44b077

ghstack-source-id: 5e9d7c1 Pull Request resolved: #167245

This was referenced Nov 14, 2025

intermediate, mutation #167877

Closed

test #167878

Closed

test #167879

Closed

angelayi added a commit that referenced this pull request Nov 17, 2025

[invoke_subgraph] Don't run the graph twice when autograd enabled

c5702d5

ghstack-source-id: 1427e97 Pull Request resolved: #167245

angelayi added 2 commits November 17, 2025 16:42

angelayi added the topic: not user facing topic category label Nov 18, 2025

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 18, 2025

pytorchmergebot added the merging label Nov 19, 2025

pytorchmergebot added the Merged label Nov 19, 2025

pytorchmergebot closed this in 789240b Nov 19, 2025

pytorchmergebot removed the merging label Nov 19, 2025

pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels Nov 20, 2025

pytorchmergebot reopened this Nov 20, 2025

angelayi mentioned this pull request Nov 21, 2025

[effect] Remove special handling for profiler op #168389

Closed

pytorchmergebot closed this in 561c1eb Nov 24, 2025

tiendatngcs pushed a commit to tiendatngcs/pytorch-Dec25 that referenced this pull request Dec 10, 2025

[invoke_subgraph] Don't run the graph twice when autograd enabled

3cc5538

ghstack-source-id: abf87b5 Pull Request resolved: pytorch/pytorch#167245

tiendatngcs pushed a commit to tiendatngcs/pytorch-Dec25 that referenced this pull request Dec 10, 2025

[invoke_subgraph] Don't run the graph twice when autograd enabled

875befe

ghstack-source-id: 44c172c Pull Request resolved: pytorch/pytorch#167245

github-actions bot deleted the gh/angelayi/133/head branch December 25, 2025 02:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[invoke_subgraph] Don't run the graph twice when autograd enabled#167245

[invoke_subgraph] Don't run the graph twice when autograd enabled#167245
angelayi wants to merge 9 commits intogh/angelayi/133/basefrom
gh/angelayi/133/head

angelayi commented Nov 6, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 6, 2025 •

edited

Loading

Uh oh!

anijain2305 left a comment

Uh oh!

angelayi commented Nov 18, 2025

Uh oh!

angelayi commented Nov 19, 2025

Uh oh!

pytorchmergebot commented Nov 19, 2025

Uh oh!

yangw-dev commented Nov 20, 2025

Uh oh!

pytorchmergebot commented Nov 20, 2025

Uh oh!

pytorchmergebot commented Nov 20, 2025

Uh oh!

angelayi commented Nov 21, 2025

Uh oh!

angelayi commented Nov 21, 2025

Uh oh!

pytorchmergebot commented Nov 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

angelayi commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/167245

Uh oh!

anijain2305 left a comment

Choose a reason for hiding this comment

Uh oh!

angelayi commented Nov 18, 2025

Uh oh!

angelayi commented Nov 19, 2025

Uh oh!

pytorchmergebot commented Nov 19, 2025

Merge started

Uh oh!

yangw-dev commented Nov 20, 2025

Uh oh!

pytorchmergebot commented Nov 20, 2025

Uh oh!

pytorchmergebot commented Nov 20, 2025

Uh oh!

angelayi commented Nov 21, 2025

Uh oh!

angelayi commented Nov 21, 2025

Uh oh!

pytorchmergebot commented Nov 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

angelayi commented Nov 6, 2025 •

edited

Loading

pytorch-bot bot commented Nov 6, 2025 •

edited

Loading