Skip to content

port distributed tensor test files for Intel GPU#161604

Closed
wincent8 wants to merge 15 commits intopytorch:mainfrom
wincent8:wliao2/add_tensor_2
Closed

port distributed tensor test files for Intel GPU#161604
wincent8 wants to merge 15 commits intopytorch:mainfrom
wincent8:wliao2/add_tensor_2

Conversation

@wincent8
Copy link
Contributor

@wincent8 wincent8 commented Aug 27, 2025

In this pr, we port test/distributed/tensor test filesfor Intel GPU
We could enable Intel GPU with following methods and try the best to keep the original code styles:

Use torch.accelerator for general gpu
Skip the case if running on xpu which has known issues

cc @H-Huang @awgu @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @pragupta @tianyu-l @XilunWu

@pytorch-bot pytorch-bot bot added oncall: distributed Add this issue/PR to distributed oncall triage queue topic: not user facing topic category labels Aug 27, 2025
@pytorch-bot
Copy link

pytorch-bot bot commented Aug 27, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161604

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit af92cf8 with merge base 6737e2c (image):

FLAKY - The following job failed but was likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@daisyden daisyden changed the title port distributed tensor test files for Intel GPU [WIP]port distributed tensor test files for Intel GPU Aug 28, 2025
@wincent8 wincent8 changed the title [WIP]port distributed tensor test files for Intel GPU port distributed tensor test files for Intel GPU Sep 1, 2025
@pytorch-bot
Copy link

pytorch-bot bot commented Sep 2, 2025

To add the ciflow label ciflow/h100-distributed please first approve the workflows that are awaiting approval (scroll to the bottom of this page).

This helps ensure we don't trigger CI on this PR until it is actually authorized to do so. Please ping one of the reviewers if you do not have access to approve and run workflows.

@daisyden daisyden added module: dtensor distributed tensor tag release notes: distributed (dtensor) release notes category ciflow/h100-distributed keep-going Don't stop on first failure, keep running tests until the end labels Sep 2, 2025
Copy link
Collaborator

@guangyey guangyey left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@guangyey guangyey added the ciflow/xpu Run XPU CI tasks label Sep 2, 2025
@guangyey guangyey requested a review from d4l3k September 2, 2025 03:06
@wincent8 wincent8 force-pushed the wliao2/add_tensor_2 branch from efe6fad to 3315b58 Compare September 2, 2025 05:44
@guangyey guangyey added the ciflow/xpu Run XPU CI tasks label Sep 2, 2025
@pytorch-bot pytorch-bot bot removed the ciflow/xpu Run XPU CI tasks label Sep 2, 2025
@guangyey guangyey added the ciflow/xpu Run XPU CI tasks label Sep 2, 2025
Copy link
Member

@d4l3k d4l3k left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@wincent8
Copy link
Contributor Author

wincent8 commented Sep 3, 2025

@pytorchbot merge

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 3, 2025
@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@pytorchmergebot
Copy link
Collaborator

Merge failed

Reason: 1 jobs have failed, first few of them are: xpu / linux-jammy-xpu-n-py3.10 / test (default, 5, 8, linux.idc.xpu)

Details for Dev Infra team Raised by workflow job

@etaf
Copy link
Collaborator

etaf commented Sep 4, 2025

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

pytorchmergebot pushed a commit that referenced this pull request Sep 12, 2025
it's another pr to port distributed tensor test for Intel GPU, while the other pr is #161604
We could enable Intel GPU with following methods and try the best to keep the original code styles:

Use torch.accelerator for general gpu
Skip the case if running on xpu which has known issues

Pull Request resolved: #161703
Approved by: https://github.com/guangyey, https://github.com/d4l3k
markc-614 pushed a commit to markc-614/pytorch that referenced this pull request Sep 17, 2025
In this pr, we port test/distributed/tensor test filesfor Intel GPU
We could enable Intel GPU with following methods and try the best to keep the original code styles:

Use torch.accelerator for general gpu
Skip the case if running on xpu which has known issues

Pull Request resolved: pytorch#161604
Approved by: https://github.com/guangyey, https://github.com/d4l3k
markc-614 pushed a commit to markc-614/pytorch that referenced this pull request Sep 17, 2025
it's another pr to port distributed tensor test for Intel GPU, while the other pr is pytorch#161604
We could enable Intel GPU with following methods and try the best to keep the original code styles:

Use torch.accelerator for general gpu
Skip the case if running on xpu which has known issues

Pull Request resolved: pytorch#161703
Approved by: https://github.com/guangyey, https://github.com/d4l3k
mansiag05 pushed a commit to mansiag05/pytorch that referenced this pull request Sep 22, 2025
In this pr, we port test/distributed/tensor test filesfor Intel GPU
We could enable Intel GPU with following methods and try the best to keep the original code styles:

Use torch.accelerator for general gpu
Skip the case if running on xpu which has known issues

Pull Request resolved: pytorch#161604
Approved by: https://github.com/guangyey, https://github.com/d4l3k
mansiag05 pushed a commit to mansiag05/pytorch that referenced this pull request Sep 22, 2025
it's another pr to port distributed tensor test for Intel GPU, while the other pr is pytorch#161604
We could enable Intel GPU with following methods and try the best to keep the original code styles:

Use torch.accelerator for general gpu
Skip the case if running on xpu which has known issues

Pull Request resolved: pytorch#161703
Approved by: https://github.com/guangyey, https://github.com/d4l3k
cleonard530 pushed a commit to cleonard530/pytorch that referenced this pull request Sep 22, 2025
it's another pr to port distributed tensor test for Intel GPU, while the other pr is pytorch#161604
We could enable Intel GPU with following methods and try the best to keep the original code styles:

Use torch.accelerator for general gpu
Skip the case if running on xpu which has known issues

Pull Request resolved: pytorch#161703
Approved by: https://github.com/guangyey, https://github.com/d4l3k
dsashidh pushed a commit to dsashidh/pytorch that referenced this pull request Sep 26, 2025
In this pr, we port test/distributed/tensor test filesfor Intel GPU
We could enable Intel GPU with following methods and try the best to keep the original code styles:

Use torch.accelerator for general gpu
Skip the case if running on xpu which has known issues

Pull Request resolved: pytorch#161604
Approved by: https://github.com/guangyey, https://github.com/d4l3k
dsashidh pushed a commit to dsashidh/pytorch that referenced this pull request Sep 26, 2025
it's another pr to port distributed tensor test for Intel GPU, while the other pr is pytorch#161604
We could enable Intel GPU with following methods and try the best to keep the original code styles:

Use torch.accelerator for general gpu
Skip the case if running on xpu which has known issues

Pull Request resolved: pytorch#161703
Approved by: https://github.com/guangyey, https://github.com/d4l3k
pytorchmergebot pushed a commit that referenced this pull request Nov 18, 2025
…61703)

it's another pr to port distributed tensor test for Intel GPU, while the other pr is #161604
We could enable Intel GPU with following methods and try the best to keep the original code styles:

Use torch.accelerator for general gpu
Skip the case if running on xpu which has known issues

Pull Request resolved: #161703
Approved by: https://github.com/guangyey, https://github.com/d4l3k, https://github.com/albanD
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/trunk Trigger trunk jobs on your pull request ciflow/xpu Run XPU CI tasks keep-going Don't stop on first failure, keep running tests until the end Merged module: dtensor distributed tensor tag oncall: distributed Add this issue/PR to distributed oncall triage queue open source release notes: distributed (dtensor) release notes category topic: not user facing topic category

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

7 participants