Turn on compile with NVSHMEM#154538
Conversation
[ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154538
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit a9095a4 with merge base 241f8dc ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
We'll need https://pypi.nvidia.com/nvidia-nvshmem-cu12/ added to the release builds at some point if they aren't already |
|
Thanks @Skylion007 . Thinking the same. Do you know how I can do that? |
|
At the very least, you need to add it to here: and have @atalman or someone with write access upload a binary to the S3 so the nightly builds do not fail (you need to regenerate the YAMLs after editing this file) Trigger CI Flows binary |
|
I added the extra dependency in #154568. |
Before: `USE_NVSHMEM=1` need to be explicit set in build environment. After: `USE_NVSHMEM=1` is the default for CUDA/Rocm on Linux. [ghstack-poisoned]
|
If we want binaries to have nvshmem, we need to make sure it's installed and discoverable for binary builds? |
Before: `USE_NVSHMEM=1` need to be explicit set in build environment. After: `USE_NVSHMEM=1` is the default for CUDA/Rocm on Linux. [ghstack-poisoned]
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Before: `USE_NVSHMEM=1` need to be explicit set in build environment. After: `USE_NVSHMEM=1` is the default for CUDA/Rocm on Linux. Pull Request resolved: pytorch#154538 Approved by: https://github.com/ngimel
NVSHMEM 3.2.5 (released Mar 2025) have both cu11 and cu12 builds. See: https://pypi.nvidia.com/nvidia-nvshmem-cu12/ https://pypi.nvidia.com/nvidia-nvshmem-cu11/ Pull Request resolved: #154568 Approved by: https://github.com/atalman ghstack dependencies: #154538
Before: `USE_NVSHMEM=1` need to be explicit set in build environment. After: `USE_NVSHMEM=1` is the default for CUDA/Rocm on Linux. Pull Request resolved: pytorch#154538 Approved by: https://github.com/ngimel
NVSHMEM 3.2.5 (released Mar 2025) have both cu11 and cu12 builds. See: https://pypi.nvidia.com/nvidia-nvshmem-cu12/ https://pypi.nvidia.com/nvidia-nvshmem-cu11/ Pull Request resolved: pytorch#154568 Approved by: https://github.com/atalman ghstack dependencies: pytorch#154538
This reverts commit 3685b10.
Stack from ghstack (oldest at bottom):
Before:
USE_NVSHMEM=1need to be explicit set in build environment.After:
USE_NVSHMEM=1is the default for CUDA/Rocm on Linux.