Skip to content

[SymmMem] Find NVSHMEM from system installation#157755

Closed
pytorchbot wants to merge 1 commit intorelease/2.8from
cherry-pick-157513-by-pytorch_bot_bot_
Closed

[SymmMem] Find NVSHMEM from system installation#157755
pytorchbot wants to merge 1 commit intorelease/2.8from
cherry-pick-157513-by-pytorch_bot_bot_

Conversation

@pytorchbot
Copy link
Collaborator

@pytorchbot pytorchbot commented Jul 8, 2025

Stack from ghstack (oldest at bottom):

Previously we only search for NVSHMEM from pip install location.
This PR adds search in system locations deemed default by CMake.
Related: #157453 untars NVSHMEM into /usr/local on our CI machines.

cc @H-Huang @awgu @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @pragupta @ezyang @msaroufim @Skylion007

Previously we only search for NVSHMEM from pip install location.
This PR adds search in system locations deemed default by CMake.
Related: #157453 untars NVSHMEM into `/usr/local` on our CI machines.

Pull Request resolved: #157513
Approved by: https://github.com/atalman, https://github.com/Skylion007

(cherry picked from commit 99c1a6b)
@pytorch-bot
Copy link

pytorch-bot bot commented Jul 8, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/157755

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 644ab89 with merge base 3a7ff82 (image):

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added ciflow/h100-symm-mem oncall: distributed Add this issue/PR to distributed oncall triage queue release notes: distributed (c10d) release notes category labels Jul 8, 2025
@Skylion007 Skylion007 requested a review from malfet July 8, 2025 02:35
@github-actions
Copy link
Contributor

github-actions bot commented Sep 6, 2025

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

@github-actions github-actions bot added the Stale label Sep 6, 2025
@github-actions github-actions bot closed this Oct 6, 2025
@github-actions github-actions bot deleted the cherry-pick-157513-by-pytorch_bot_bot_ branch November 6, 2025 02:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/h100-symm-mem oncall: distributed Add this issue/PR to distributed oncall triage queue open source release notes: distributed (c10d) release notes category Stale

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants