Upgrade ROCm wheels to manylinux2_28 - 1a of 2 (docker images)#140681
Upgrade ROCm wheels to manylinux2_28 - 1a of 2 (docker images)#140681jithunnair-amd wants to merge 11 commits intopytorch:mainfrom
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/140681
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 New FailureAs of commit 6642fb5 with merge base cb8c956 ( NEW FAILURE - The following job has failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
@huydhn @atalman Got this error for the rocm6.1 docker build job: |
|
I have created the missing ECR record |
Thanks, can you please do it for rocm6.2 as well? That will also need an ECR record. |
@huydhn Also, I suppose this means we will need to request you to create a new ECR record for every ROCm upgrade? |
That's the current process. Also, due to #140958 earlier today, I haven't been able to deploy the ECR change yet. I will need to check with @jeanschmidt when we can resume infra deployment. We could use |
|
@pytorchbot rebase |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
|
Successfully rebased |
b0b44ca to
1a87ee6
Compare
1a87ee6 to
3dd5cb0
Compare
I just create it, starting the infra deployment now, so give it about 15 minutes to finish |
3dd5cb0 to
2a4560f
Compare
|
@pytorchbot rebase |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
2a4560f to
cb09ef8
Compare
434d09a to
6642fb5
Compare
|
@pytorchbot merge -f "Manylinux/libtorch/CI docker image builds for ROCm completed with build duration 40-45min; other CI failures unrelated" |
Merge startedYour change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
…er images) (#141609) Upgrade gcc version from 9 to 11 on ROCm manylinux images. Needed for #141423 since almalinux8-based manylinux2_28 images for ROCm (#140681) installs gcc-toolset-9, which installs [gcc 9.2.1](https://pkgs.org/download/gcc-toolset-9-gcc-c++). However, PyTorch CMakeLists.txt enforces a [minimum gcc version of 9.3](https://github.com/pytorch/pytorch/blob/5318bf8baf19fecda365c185cd81196e3cfb08e3/CMakeLists.txt#L61). Pull Request resolved: #141609 Approved by: https://github.com/jeffdaily Co-authored-by: Jithun Nair <jithun.nair@amd.com>
Depends on #140681 and #141609 Highlights: * Upgrade binaries to ROCm6.2.4 to use latest docker images * Remove pre-cxx11 builds for libtorch on ROCm * Use manylinux2_28 docker images for ROCm * Set `DESIRED_DEVTOOLSET=cxx-abi` (and hence `_GLIBCXX_USE_CXX11_ABI=1`) for ROCm manylinux2_28 wheels (ROCm RHEL8 packages also have GCC_ABI=1, so it keeps it consistent) Pull Request resolved: #141423 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com> Co-authored-by: Pruthvi Madugundu <pruthvigithub@gmail.com>
…h#140681) Fixes pytorch#140631 Highlights: * Use `cpu_final` base for ROCm in `.ci/docker/manywheel/Dockerfile_2_28` * Cleans up install_miopen.sh to remove old ROCm references * Install `gcc-gfortran` package to build magma for ROCm on almalinux Needs builder PR pytorch/builder#2043 (merged) so that GCC_ABI expected value is updated. Pull Request resolved: pytorch#140681 Approved by: https://github.com/jeffdaily
…er images) (pytorch#141609) Upgrade gcc version from 9 to 11 on ROCm manylinux images. Needed for pytorch#141423 since almalinux8-based manylinux2_28 images for ROCm (pytorch#140681) installs gcc-toolset-9, which installs [gcc 9.2.1](https://pkgs.org/download/gcc-toolset-9-gcc-c++). However, PyTorch CMakeLists.txt enforces a [minimum gcc version of 9.3](https://github.com/pytorch/pytorch/blob/5318bf8baf19fecda365c185cd81196e3cfb08e3/CMakeLists.txt#L61). Pull Request resolved: pytorch#141609 Approved by: https://github.com/jeffdaily Co-authored-by: Jithun Nair <jithun.nair@amd.com>
) Depends on pytorch#140681 and pytorch#141609 Highlights: * Upgrade binaries to ROCm6.2.4 to use latest docker images * Remove pre-cxx11 builds for libtorch on ROCm * Use manylinux2_28 docker images for ROCm * Set `DESIRED_DEVTOOLSET=cxx-abi` (and hence `_GLIBCXX_USE_CXX11_ABI=1`) for ROCm manylinux2_28 wheels (ROCm RHEL8 packages also have GCC_ABI=1, so it keeps it consistent) Pull Request resolved: pytorch#141423 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com> Co-authored-by: Pruthvi Madugundu <pruthvigithub@gmail.com>
) Depends on pytorch#140681 and pytorch#141609 Highlights: * Upgrade binaries to ROCm6.2.4 to use latest docker images * Remove pre-cxx11 builds for libtorch on ROCm * Use manylinux2_28 docker images for ROCm * Set `DESIRED_DEVTOOLSET=cxx-abi` (and hence `_GLIBCXX_USE_CXX11_ABI=1`) for ROCm manylinux2_28 wheels (ROCm RHEL8 packages also have GCC_ABI=1, so it keeps it consistent) Pull Request resolved: pytorch#141423 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com> Co-authored-by: Pruthvi Madugundu <pruthvigithub@gmail.com>
Fixes #140631
Highlights:
cpu_finalbase for ROCm in.ci/docker/manywheel/Dockerfile_2_28gcc-gfortranpackage to build magma for ROCm on almalinuxNeeds builder PR pytorch/builder#2043 (merged) so that GCC_ABI expected value is updated.
cc @jeffdaily @sunway513 @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang @naromero77amd