Skip to content

[ROCm] test_hip_device_count safely runs on 1 GPU systems#156398

Closed
BLOrange-AMD wants to merge 3 commits intopytorch:mainfrom
ROCm:hip_device_count_upstream_fix
Closed

[ROCm] test_hip_device_count safely runs on 1 GPU systems#156398
BLOrange-AMD wants to merge 3 commits intopytorch:mainfrom
ROCm:hip_device_count_upstream_fix

Conversation

@BLOrange-AMD
Copy link
Contributor

@BLOrange-AMD BLOrange-AMD commented Jun 19, 2025

Fixes test_cuda.py::TestCuda::test_hip_device_count on single gpu scenario

cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang @naromero77amd

@pytorch-bot
Copy link

pytorch-bot bot commented Jun 19, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/156398

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 385b504 with merge base 77518d1 (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copy link
Collaborator

@jeffdaily jeffdaily left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is only a problem if the testing systems has 1 GPU. Either we make this change or decorate the test to require 2 GPUs.

@BLOrange-AMD
Copy link
Contributor Author

@malfet @atalman Could you help to review this PR? Thanks.

@jeffdaily jeffdaily changed the title Added index 0 for ROCR_VISIBLE_DEVICES [ROCm] test_hip_device_count safely runs on 1 GPU systems Jun 26, 2025
@pytorch-bot pytorch-bot bot added ciflow/rocm Trigger "default" config CI on ROCm module: rocm AMD GPU support for Pytorch labels Jun 26, 2025
@pytorch-bot pytorch-bot bot removed the ciflow/rocm Trigger "default" config CI on ROCm label Jun 27, 2025
@jeffdaily jeffdaily added the ciflow/rocm Trigger "default" config CI on ROCm label Jun 27, 2025
@jeffdaily
Copy link
Collaborator

@pytorchbot merge

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jun 28, 2025
@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/rocm Trigger "default" config CI on ROCm ciflow/trunk Trigger trunk jobs on your pull request Merged module: rocm AMD GPU support for Pytorch open source topic: not user facing topic category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants