[DTensor] single-dim foreach strategy by pianpwk · Pull Request #170631 · pytorch/pytorch

pianpwk · 2025-12-17T00:51:24Z

Stack from ghstack (oldest at bottom):

-> [DTensor] single-dim foreach strategy #170631

[ghstack-poisoned]

pytorch-bot · 2025-12-17T00:51:28Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/170631

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit b2ec35b with merge base 61622da ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

inductor / unit-test / inductor-pallas-cpu-test / test (inductor-pallas-cpu, 1, 1, linux.12xlarge) (gh) (similar failure)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 55e0aff Pull-Request: #170631

[ghstack-poisoned]

ghstack-source-id: a22949b Pull-Request: #170631

wconstab · 2025-12-17T03:08:49Z

torch/distributed/tensor/_ops/single_dim_strategy.py

+        return OpSchema(
+            target_op,  # type: ignore[arg-type]
+            args_schema=tuple(target_args),
+            kwargs_schema=op_schema.kwargs_schema,


note: I don't know of if there are any cases where kwargs_schema could contain a TupleStrategy and that would need to also be translated.

just changed to tree_map instead

wconstab · 2025-12-17T03:09:44Z

torch/distributed/tensor/_ops/single_dim_strategy.py

+        child_strategies: list[StrategyType] = []
+        for tensorlist_i in range(tensorlist_len):
+            per_index_schema = _translate_foreach_op_schema(op_schema, tensorlist_i)
+            per_index_strategy = _expanded_strategy(per_index_schema)


wondering if it is worth doing LRU cache on this helper. It seems like a very common case to have 100s or 1000s of tensors in the foreach list, but all of them sharing the same placements, or only a handful of patterns for placements.

ah, I tried this but it seems like it needs more work; the OpSchema._comparison_key that's used for caching doesn't include placements, so out-of-box it's over-caching and incorrect

I added a todo there

wconstab

this looks pretty good to me! thanks!

i think we can land it asap, i will work on landing the earlier PRs in the stack

[ghstack-poisoned]

ghstack-source-id: 83854c3 Pull-Request: #170631

wconstab · 2025-12-19T22:44:42Z

torch/distributed/tensor/_ops/single_dim_strategy.py

+
+    # TODO maybe this could be helped by adding a new 'tag' to the OpOverload?
+    # Also, i'm guessing that i'll need more info from the registration callsite
+    # about which inputs are expected to be lists vs tensors. But maybe I can just infer it all from the runtime


i guess you already handled this part (Also..) so you can delete it

wconstab

lgtm! i think we can land it and continue to iterate on the todos.

up to you whether to land my pointwise PR too, which I've rebased recently and made safe to land, or just copy its contents into your test for now

This PR adds a dummy version of a single-dim pointwise strategy, and it shouldn't be used yet for real but it is being used by unit tests in #170631 so it is useful to land. Pull Request resolved: #168115 Approved by: https://github.com/pianpwk Co-authored-by: Pian Pawakapan <pianpwk@meta.com>

[ghstack-poisoned]

ghstack-source-id: 06a7151 Pull-Request: #170631

pianpwk · 2025-12-22T12:46:05Z

@pytorchbot merge

pytorchmergebot · 2025-12-22T12:49:09Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

This PR adds a dummy version of a single-dim pointwise strategy, and it shouldn't be used yet for real but it is being used by unit tests in #170631 so it is useful to land. Pull Request resolved: #168115 Approved by: https://github.com/pianpwk Co-authored-by: Pian Pawakapan <pianpwk@meta.com>

This PR adds a dummy version of a single-dim pointwise strategy, and it shouldn't be used yet for real but it is being used by unit tests in pytorch#170631 so it is useful to land. Pull Request resolved: pytorch#168115 Approved by: https://github.com/pianpwk Co-authored-by: Pian Pawakapan <pianpwk@meta.com>

Pull Request resolved: pytorch#170631 Approved by: https://github.com/wconstab

Update

e93bd4a

[ghstack-poisoned]

pytorch-bot bot added the ciflow/inductor label Dec 17, 2025

pianpwk mentioned this pull request Dec 17, 2025

[DTensor] Single Dim Strategy infra #167677

Closed

pianpwk added a commit that referenced this pull request Dec 17, 2025

foreach single-dim strategy

242e0d0

ghstack-source-id: 55e0aff Pull-Request: #170631

This was referenced Dec 17, 2025

[DTensor] Add single-dim registration infra #170359

Closed

[DTensor] Add Dijkstra-based single-dim strategy search #169438

Closed

[DTensor] single-dim pointwise strategy #168115

Closed

pianpwk marked this pull request as draft December 17, 2025 00:53

Update

8c9d3e5

[ghstack-poisoned]

pianpwk added a commit that referenced this pull request Dec 17, 2025

foreach single-dim strategy

0f25232

ghstack-source-id: a22949b Pull-Request: #170631

wconstab reviewed Dec 17, 2025

View reviewed changes

Update

e35a81c

[ghstack-poisoned]

pianpwk added a commit that referenced this pull request Dec 18, 2025

foreach single-dim strategy

ae7dc62

ghstack-source-id: 83854c3 Pull-Request: #170631

pianpwk changed the title ~~foreach single-dim strategy~~ [DTensor] single-dim foreach strategy Dec 18, 2025

pianpwk added the release notes: distributed (dtensor) release notes category label Dec 18, 2025

pianpwk marked this pull request as ready for review December 18, 2025 19:15

wconstab reviewed Dec 19, 2025

View reviewed changes

wconstab approved these changes Dec 19, 2025

View reviewed changes

Update

b2ec35b

[ghstack-poisoned]

pianpwk added a commit that referenced this pull request Dec 21, 2025

foreach single-dim strategy

16382f4

ghstack-source-id: 06a7151 Pull-Request: #170631

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 22, 2025

pytorchmergebot added the merging label Dec 22, 2025

pytorchmergebot added the Merged label Dec 22, 2025

pytorchmergebot closed this in ecf79f3 Dec 22, 2025

pytorchmergebot removed the merging label Dec 22, 2025

pianpwk mentioned this pull request Jan 7, 2026

[DTensor] LRU cachable OpStrategy #171223

Closed

krastogi-in pushed a commit to krastogi-in/pytorch that referenced this pull request Jan 9, 2026

[DTensor] single-dim foreach strategy (pytorch#170631)

af1a233

Pull Request resolved: pytorch#170631 Approved by: https://github.com/wconstab

github-actions bot deleted the gh/pianpwk/46/head branch January 22, 2026 02:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DTensor] single-dim foreach strategy#170631

[DTensor] single-dim foreach strategy#170631
pianpwk wants to merge 4 commits intogh/pianpwk/46/basefrom
gh/pianpwk/46/head

pianpwk commented Dec 17, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Dec 17, 2025 •

edited

Loading

Uh oh!

wconstab Dec 17, 2025

Uh oh!

pianpwk Dec 18, 2025

Uh oh!

wconstab Dec 17, 2025

Uh oh!

pianpwk Dec 18, 2025 •

edited

Loading

Uh oh!

wconstab left a comment

Uh oh!

wconstab Dec 19, 2025

Uh oh!

wconstab left a comment

Uh oh!

pianpwk commented Dec 22, 2025

Uh oh!

pytorchmergebot commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pianpwk commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/170631

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

wconstab Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

pianpwk Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

wconstab Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

pianpwk Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wconstab left a comment

Choose a reason for hiding this comment

Uh oh!

wconstab Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

wconstab left a comment

Choose a reason for hiding this comment

Uh oh!

pianpwk commented Dec 22, 2025

Uh oh!

pytorchmergebot commented Dec 22, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pianpwk commented Dec 17, 2025 •

edited

Loading

pytorch-bot bot commented Dec 17, 2025 •

edited

Loading

pianpwk Dec 18, 2025 •

edited

Loading