[LocalTensor] Cache DeviceMesh.get_coordinate results in LocalTensorMode by wconstab · Pull Request #173836 · pytorch/pytorch

wconstab · 2026-01-29T20:58:30Z

Stack from ghstack (oldest at bottom):

-> [LocalTensor] Cache DeviceMesh.get_coordinate results in LocalTensorMode #173836

The get_coordinate method was being called repeatedly with the same
DeviceMesh during operations like DTensor.from_local, recomputing the
same coordinate mapping each time. This adds a per-mode cache keyed
by mesh id to avoid redundant computation.

In profiling of sharding rule validation, get_coordinate accounted for
~86% of from_local call time. With caching, from_local latency dropped
from 4.55ms to 0.76ms (83% reduction).

Authored with Claude.

The get_coordinate method was being called repeatedly with the same DeviceMesh during operations like DTensor.from_local, recomputing the same coordinate mapping each time. This adds a per-mode cache keyed by mesh id to avoid redundant computation. In profiling of sharding rule validation, get_coordinate accounted for ~86% of from_local call time. With caching, from_local latency dropped from 4.55ms to 0.76ms (83% reduction). Authored with Claude. [ghstack-poisoned]

pytorch-bot · 2026-01-29T20:58:32Z

This PR needs a `release notes:` label

If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

pytorch-bot · 2026-01-29T20:58:35Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/173836

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 1ad1fb9 with merge base 4b0f7fb ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

The get_coordinate method was being called repeatedly with the same DeviceMesh during operations like DTensor.from_local, recomputing the same coordinate mapping each time. This adds a per-mode cache keyed by mesh id to avoid redundant computation. In profiling of sharding rule validation, get_coordinate accounted for ~86% of from_local call time. With caching, from_local latency dropped from 4.55ms to 0.76ms (83% reduction). Authored with Claude. ghstack-source-id: 02756a0 Pull Request resolved: #173836

…ocalTensorMode" The get_coordinate method was being called repeatedly with the same DeviceMesh during operations like DTensor.from_local, recomputing the same coordinate mapping each time. This adds a per-mode cache keyed by mesh id to avoid redundant computation. In profiling of sharding rule validation, get_coordinate accounted for ~86% of from_local call time. With caching, from_local latency dropped from 4.55ms to 0.76ms (83% reduction). Authored with Claude. [ghstack-poisoned]

The get_coordinate method was being called repeatedly with the same DeviceMesh during operations like DTensor.from_local, recomputing the same coordinate mapping each time. This adds a per-mode cache keyed by mesh id to avoid redundant computation. In profiling of sharding rule validation, get_coordinate accounted for ~86% of from_local call time. With caching, from_local latency dropped from 4.55ms to 0.76ms (83% reduction). Authored with Claude. ghstack-source-id: df2f1e5 Pull Request resolved: #173836

…ocalTensorMode" The get_coordinate method was being called repeatedly with the same DeviceMesh during operations like DTensor.from_local, recomputing the same coordinate mapping each time. This adds a per-mode cache keyed by mesh id to avoid redundant computation. In profiling of sharding rule validation, get_coordinate accounted for ~86% of from_local call time. With caching, from_local latency dropped from 4.55ms to 0.76ms (83% reduction). Authored with Claude. [ghstack-poisoned]

The get_coordinate method was being called repeatedly with the same DeviceMesh during operations like DTensor.from_local, recomputing the same coordinate mapping each time. This adds a per-mode cache keyed by mesh id to avoid redundant computation. In profiling of sharding rule validation, get_coordinate accounted for ~86% of from_local call time. With caching, from_local latency dropped from 4.55ms to 0.76ms (83% reduction). Authored with Claude. ghstack-source-id: 97e54a8 Pull Request resolved: #173836

wconstab · 2026-02-06T21:10:24Z

@pytorchbot merge

pytorchmergebot · 2026-02-06T21:12:36Z

Merge failed

Reason: This PR needs a release notes: label
If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Details for Dev Infra team

Raised by workflow job

pytorch-bot · 2026-02-06T21:12:41Z

This PR needs a `release notes:` label

If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

wconstab · 2026-02-06T21:16:16Z

@pytorchbot merge

pytorchmergebot · 2026-02-06T21:18:20Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…ode (pytorch#173836) The get_coordinate method was being called repeatedly with the same DeviceMesh during operations like DTensor.from_local, recomputing the same coordinate mapping each time. This adds a per-mode cache keyed by mesh id to avoid redundant computation. In profiling of sharding rule validation, get_coordinate accounted for ~86% of from_local call time. With caching, from_local latency dropped from 4.55ms to 0.76ms (83% reduction). Authored with Claude. Pull Request resolved: pytorch#173836 Approved by: https://github.com/dzmitry-huba

This was referenced Jan 31, 2026

[DTensor] Strategy Validation #173976

Closed

[DTensor] Add support for torch.linalg operators #171935

Closed

This was referenced Jan 31, 2026

[DTensor] Test pointwise partial propagation #174000

Closed

[DTensor] Complete single-dim pointwise rule #172278

Closed

WIP claude rewrite pointwise #174080

Closed

wconstab requested a review from dzmitry-huba February 2, 2026 22:50

wconstab added 2 commits February 2, 2026 15:03

dzmitry-huba approved these changes Feb 6, 2026

View reviewed changes

pytorch-bot Bot added the ciflow/trunk Trigger trunk jobs on your pull request label Feb 6, 2026

pytorchmergebot added the merging label Feb 6, 2026

pytorchmergebot removed the merging label Feb 6, 2026

wconstab added the topic: not user facing topic category label Feb 6, 2026

pytorchmergebot added the merging label Feb 6, 2026

pytorchmergebot added the Merged label Feb 6, 2026

pytorchmergebot closed this in 8e65dfa Feb 6, 2026

pytorchmergebot removed the merging label Feb 6, 2026

github-actions Bot deleted the gh/wconstab/511/head branch March 9, 2026 02:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LocalTensor] Cache DeviceMesh.get_coordinate results in LocalTensorMode#173836

[LocalTensor] Cache DeviceMesh.get_coordinate results in LocalTensorMode#173836
wconstab wants to merge 7 commits intogh/wconstab/511/basefrom
gh/wconstab/511/head

wconstab commented Jan 29, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Jan 29, 2026

Uh oh!

pytorch-bot Bot commented Jan 29, 2026 •

edited

Loading

Uh oh!

wconstab commented Feb 6, 2026

Uh oh!

pytorchmergebot commented Feb 6, 2026

Uh oh!

pytorch-bot Bot commented Feb 6, 2026

Uh oh!

wconstab commented Feb 6, 2026

Uh oh!

pytorchmergebot commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

wconstab commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Jan 29, 2026

This PR needs a release notes: label

Uh oh!

pytorch-bot Bot commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/173836

✅ No Failures

Uh oh!

wconstab commented Feb 6, 2026

Uh oh!

pytorchmergebot commented Feb 6, 2026

Merge failed

Uh oh!

pytorch-bot Bot commented Feb 6, 2026

This PR needs a release notes: label

Uh oh!

wconstab commented Feb 6, 2026

Uh oh!

pytorchmergebot commented Feb 6, 2026

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wconstab commented Jan 29, 2026 •

edited

Loading

This PR needs a `release notes:` label

pytorch-bot Bot commented Jan 29, 2026 •

edited

Loading

This PR needs a `release notes:` label