Skip to content

Use gcc13 in Manylinux 2.28 images#152825

Closed
atalman wants to merge 12 commits intopytorch:mainfrom
atalman:fixes_gcc14
Closed

Use gcc13 in Manylinux 2.28 images#152825
atalman wants to merge 12 commits intopytorch:mainfrom
atalman:fixes_gcc14

Conversation

@atalman
Copy link
Copy Markdown
Contributor

@atalman atalman commented May 5, 2025

Related to: #152426

@atalman atalman requested a review from jeffdaily as a code owner May 5, 2025 14:31
@pytorch-bot
Copy link
Copy Markdown

pytorch-bot Bot commented May 5, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152825

Note: Links to docs will display an error until the docs builds have been completed.

❌ 9 New Failures, 5 Pending, 1 Unrelated Failure

As of commit 6778233 with merge base 2f09e79 (image):

NEW FAILURES - The following jobs have failed:

FLAKY - The following job failed but was likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot Bot added the topic: not user facing topic category label May 5, 2025
@atalman atalman added the ciflow/binaries Trigger all binary build and upload jobs on the PR label May 5, 2025
@tinglvv
Copy link
Copy Markdown
Collaborator

tinglvv commented May 6, 2025

@pytorchbot rebase -b main

@pytorchmergebot
Copy link
Copy Markdown
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/main. Check the current status here

@pytorchmergebot
Copy link
Copy Markdown
Collaborator

Successfully rebased fixes_gcc14 onto refs/remotes/origin/main, please pull locally before adding more changes (for example, via git checkout fixes_gcc14 && git pull --rebase)

@atalman atalman changed the title Use gcc14 in Manylinux 2.28 images Use gcc13 in Manylinux 2.28 images May 6, 2025
@atalman atalman requested a review from a team as a code owner May 7, 2025 13:09
DOCKER_GPU_BUILD_ARG=" --build-arg DEVTOOLSET_VERSION=13 --build-arg NINJA_VERSION=1.12.1"
MANY_LINUX_VERSION="2_28_aarch64"
;;
manylinuxcxx11-abi-builder:cpu-cxx11-abi)
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we want to do anything about this build ? Looks like its still using gcc=9

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like currently it is using gcc11? https://github.com/pytorch/pytorch/blob/main/.ci/docker/manywheel/Dockerfile_2_28_aarch64#L4
We may need a PR to test the docker image before merging 🤔

Copy link
Copy Markdown
Contributor

@malfet malfet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure, though I though that manylinux standard mandates minimum compler version, isn't it?

@atalman
Copy link
Copy Markdown
Contributor Author

atalman commented May 7, 2025

Hi @malfet as far as I know its not mandated, glibc version is. I believe you commended on the issue earlier: #114232 (comment)

@atalman
Copy link
Copy Markdown
Contributor Author

atalman commented May 8, 2025

@pytorchmergebot merge -f "all required tests and lint are green"

@pytorchmergebot
Copy link
Copy Markdown
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

pytorchmergebot pushed a commit that referenced this pull request Jul 15, 2025
…el for GCC 12 and above (#158117)

This PR disables `strict-aliasing` GCC C++ optimization flag on all AArch64 cpus for GCC versions 12 and above.

Pull Request #152825 upgraded gcc version from 11 to 13 in manywheel which caused several segmentation faults in unit tests ( not visible in CI workflows because the jammy gcc version has not been updated yet ).

We Identified the problem also exists in GCC12 hence the ` __GNUC__ >= 12`

Fixes #157626

fixes these tests failures when pytorch is built in GCC12 and above
```
test_ops.py::TestCommonCPU::test_noncontiguous_samples_grid_sampler_2d_cpu_float32 Fatal Python error: Segmentation fault
test_ops.py::TestCommonCPU::test_dtypes_grid_sampler_2d_cpu Fatal Python error: Segmentation fault
test_ops.py::TestMathBitsCPU::test_neg_view_nn_functional_grid_sample_cpu_float64 free(): invalid next size (fast)
test_ops.py::TestCompositeComplianceCPU::test_backward_grid_sampler_2d_cpu_float32 Fatal Python error: Segmentation fault
test_ops.py::TestCommonCPU::test_dtypes_nn_functional_grid_sample_cpu Fatal Python error: Segmentation fault

```

Pull Request resolved: #158117
Approved by: https://github.com/malfet
fadara01 added a commit that referenced this pull request Nov 3, 2025
This is needed because manylinux uses GCC-13 since #152825
As a result of the current compiler version mismatches, we've seen tests
passing jammy-aarch64 pre-commit CI, but failing for wheels built in manylinux
Related to: #166736


ghstack-source-id: a68de47
Pull-Request: #166849
fadara01 added a commit that referenced this pull request Nov 3, 2025
This is needed because manylinux uses GCC-13 since #152825
As a result of the current compiler version mismatches, we've seen tests
passing jammy-aarch64 pre-commit CI, but failing for wheels built in manylinux
Related to: #166736

ghstack-source-id: 9d18d85
Pull-Request: #166849
fadara01 added a commit that referenced this pull request Nov 3, 2025
This is needed because manylinux uses GCC-13 since #152825
As a result of the current compiler version mismatches, we've seen tests
passing jammy-aarch64 pre-commit CI, but failing for wheels built in manylinux
Related to: #166736

ghstack-source-id: 36be71d
Pull-Request: #166849
pytorchmergebot pushed a commit that referenced this pull request Nov 6, 2025
This is needed because manylinux uses GCC-13 since #152825
As a result of the current compiler version mismatches, we've seen tests passing jammy-aarch64 pre-commit CI, but failing for wheels built in manylinux
Related to: #166736

Pull Request resolved: #166849
Approved by: https://github.com/robert-hardwick, https://github.com/malfet, https://github.com/Skylion007, https://github.com/atalman
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/binaries Trigger all binary build and upload jobs on the PR Merged topic: not user facing topic category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants