Add CUDA 12.6 Linux Builds to Binaries Matrix#138899
Add CUDA 12.6 Linux Builds to Binaries Matrix#138899tinglvv wants to merge 20 commits intopytorch:mainfrom
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/138899
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 New FailureAs of commit d55064c with merge base ea0f60e ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
Removing 12.1 for the nightly binary build per suggestion. |
|
Not sure if we should remove 12.1 from LINUX_BINARY_SMOKE_WORKFLOWS, removing temporarily due to the below error |
| "nvidia-cuda-nvrtc-cu12==12.6.77; platform_system == 'Linux' and platform_machine == 'x86_64' | " | ||
| "nvidia-cuda-runtime-cu12==12.6.77; platform_system == 'Linux' and platform_machine == 'x86_64' | " | ||
| "nvidia-cuda-cupti-cu12==12.6.80; platform_system == 'Linux' and platform_machine == 'x86_64' | " | ||
| "nvidia-cudnn-cu12==9.1.0.70; platform_system == 'Linux' and platform_machine == 'x86_64' | " |
There was a problem hiding this comment.
This might be a good time to update CUDNN as well anyway?
There was a problem hiding this comment.
No, let's not mix different updates (CUDA and cuDNN) into the same PR, but follow up separately.
atalman
left a comment
There was a problem hiding this comment.
Add an exception in generate_conda_matrix to not include any 12.6 builds. We don't want to add new conda builds for 12.6
|
Error for windows-binary-wheel might be due to #138458 which set 12.4 as default |
|
linux aarch64 failures should be resolved after correcting build script for aarch64. |
|
@pytorchbot rebase |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
|
Rebase failed due to Command Raised by https://github.com/pytorch/pytorch/actions/runs/11750112212 |
Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com>
Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com>
| "nvidia-curand-cu12==10.3.7.77; platform_system == 'Linux' and platform_machine == 'x86_64' | " | ||
| "nvidia-cusolver-cu12==11.7.1.2; platform_system == 'Linux' and platform_machine == 'x86_64' | " | ||
| "nvidia-cusparse-cu12==12.5.4.2; platform_system == 'Linux' and platform_machine == 'x86_64' | " | ||
| "nvidia-cusparselt-cu12==0.6.3; platform_system == 'Linux' and platform_machine == 'x86_64' | " |
There was a problem hiding this comment.
We seem to be bumping cusparselt here as well. Watch for unit test failures that https://hud.pytorch.org/pytorch/pytorch/pull/138175 is currently facing.
| ) | ||
| # Special build building to use on Colab. Python 3.11 for 12.1 CUDA | ||
| if python_version == "3.11" and arch_version == "12.1": | ||
| # Special build building to use on Colab. Python 3.11 for 12.4 CUDA |
There was a problem hiding this comment.
This seems to be dependent on what Colab's support matrix is, e.g. does it support CUDA 12.4?
It may does support it, but it would be good to double check.
|
@pytorchmergebot merge -f "lint failure is expected" |
Merge startedYour change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Fixes Lint after: #138899 Due to landrace. Run ``./regenerate.sh`` Pull Request resolved: #140446 Approved by: https://github.com/wdvr, https://github.com/huydhn, https://github.com/seemethere, https://github.com/malfet
Summary: X-link: facebookresearch/FBGEMM#486 - Upgrade gcc version to support newer libstdc++, which is required now that pytorch/pytorch#141035 has landed - Deprecate support for CUDA 12.1 and add support for 12.6, per changes in pytorch/pytorch#138899 Pull Request resolved: #3398 Reviewed By: sryap Differential Revision: D66277492 Pulled By: q10 fbshipit-source-id: 24817efb5c07c1985ab3beeb1610879edbd81acc
|
which version finally? 12.6, 12.6.2 or 12.6.3? |
|
Hi @johnnynunez
for x86 nightly build, it is 12.6.3 now - #141433. For windows builds, it is 12..6.2 as windows AMI takes time to build and may not make it before 2.6.0 code freeze. cc @atalman |
Related to pytorch#138440 Issue tracker: pytorch#138609 Version based on https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html Pull Request resolved: pytorch#138899 Approved by: https://github.com/atalman Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com>
Fixes Lint after: pytorch#138899 Due to landrace. Run ``./regenerate.sh`` Pull Request resolved: pytorch#140446 Approved by: https://github.com/wdvr, https://github.com/huydhn, https://github.com/seemethere, https://github.com/malfet
Related to #138440
Issue tracker: #138609
Version based on https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html
cc @albanD @ptrblck @atalman @malfet @nWEIdia