Skip to content

Fix compilation on aarch64 with gcc#124511

Closed
malfet wants to merge 1 commit intomainfrom
malfet-patch-28
Closed

Fix compilation on aarch64 with gcc#124511
malfet wants to merge 1 commit intomainfrom
malfet-patch-28

Conversation

@malfet
Copy link
Copy Markdown
Contributor

@malfet malfet commented Apr 19, 2024

Which is more stringent than clang when equivalently sized NEON registers are cast to each other. In particular, at one point uint16x4_t were cast to int16x4_t, which gcc does not allow. Added vreinterpret_s16_u16 (which is a no-op) to solve this and tested in https://godbolt.org/z/sYb4ThM6M

Test plan: Build aarch64 wheels

Which is more stringent than clang when equivalently sized NEON registers are cast to each other. In particular, at one point `uint16x4_t` were cast to `int16x4_t`, which gcc does not allow. Added `vreinterpret_s16_u16` (which is a no-op) to solve this and tested in https://godbolt.org/z/sYb4ThM6M
@pytorch-bot
Copy link
Copy Markdown

pytorch-bot bot commented Apr 19, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/124511

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit cf2f8d5 with merge base 25c65d6 (image):

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Apr 19, 2024
@malfet malfet added topic: not user facing topic category ciflow/binaries_wheel Trigger binary build and upload jobs for wheel on the PR and removed module: cpu CPU specific problem (e.g., perf, algorithm) labels Apr 19, 2024
@malfet malfet requested a review from a team April 19, 2024 18:53
Copy link
Copy Markdown
Contributor

@mikekgfb mikekgfb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you!

@malfet
Copy link
Copy Markdown
Contributor Author

malfet commented Apr 19, 2024

@pytorchbot merge -f "aarch64 builds are green again, same as Mac builds"

@pytorchmergebot
Copy link
Copy Markdown
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@malfet malfet deleted the malfet-patch-28 branch April 19, 2024 22:38
sanketpurandare pushed a commit to sanketpurandare/pytorch that referenced this pull request Apr 22, 2024
Which is more stringent than clang when equivalently sized NEON registers are cast to each other. In particular, at one point `uint16x4_t` were cast to `int16x4_t`, which gcc does not allow. Added `vreinterpret_s16_u16` (which is a no-op) to solve this and tested in https://godbolt.org/z/sYb4ThM6M

Test plan: Build aarch64 wheels
Pull Request resolved: pytorch#124511
Approved by: https://github.com/mikekgfb
pytorch-bot bot pushed a commit that referenced this pull request May 3, 2024
Which is more stringent than clang when equivalently sized NEON registers are cast to each other. In particular, at one point `uint16x4_t` were cast to `int16x4_t`, which gcc does not allow. Added `vreinterpret_s16_u16` (which is a no-op) to solve this and tested in https://godbolt.org/z/sYb4ThM6M

Test plan: Build aarch64 wheels
Pull Request resolved: #124511
Approved by: https://github.com/mikekgfb
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/binaries_wheel Trigger binary build and upload jobs for wheel on the PR Merged topic: not user facing topic category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants