Fix compilation on aarch64 with gcc by malfet · Pull Request #124511 · pytorch/pytorch

malfet · 2024-04-19T18:42:03Z

Which is more stringent than clang when equivalently sized NEON registers are cast to each other. In particular, at one point uint16x4_t were cast to int16x4_t, which gcc does not allow. Added vreinterpret_s16_u16 (which is a no-op) to solve this and tested in https://godbolt.org/z/sYb4ThM6M

Test plan: Build aarch64 wheels

Which is more stringent than clang when equivalently sized NEON registers are cast to each other. In particular, at one point `uint16x4_t` were cast to `int16x4_t`, which gcc does not allow. Added `vreinterpret_s16_u16` (which is a no-op) to solve this and tested in https://godbolt.org/z/sYb4ThM6M

pytorch-bot · 2024-04-19T18:42:06Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/124511

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit cf2f8d5 with merge base 25c65d6 ():

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / linux-focal-py3.11-clang10 / test (default, 3, 3, linux.2xlarge) (gh)
RuntimeError: profiler/test_profiler 1/1 failed
pull / linux-focal-py3.12-clang10 / test (default, 1, 3, linux.2xlarge) (gh)
RuntimeError: profiler/test_profiler 1/1 failed

This comment was automatically generated by Dr. CI and updates every 15 minutes.

mikekgfb

Thank you!

malfet · 2024-04-19T19:51:07Z

@pytorchbot merge -f "aarch64 builds are green again, same as Mac builds"

pytorchmergebot · 2024-04-19T19:53:10Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Which is more stringent than clang when equivalently sized NEON registers are cast to each other. In particular, at one point `uint16x4_t` were cast to `int16x4_t`, which gcc does not allow. Added `vreinterpret_s16_u16` (which is a no-op) to solve this and tested in https://godbolt.org/z/sYb4ThM6M Test plan: Build aarch64 wheels Pull Request resolved: pytorch#124511 Approved by: https://github.com/mikekgfb

Which is more stringent than clang when equivalently sized NEON registers are cast to each other. In particular, at one point `uint16x4_t` were cast to `int16x4_t`, which gcc does not allow. Added `vreinterpret_s16_u16` (which is a no-op) to solve this and tested in https://godbolt.org/z/sYb4ThM6M Test plan: Build aarch64 wheels Pull Request resolved: #124511 Approved by: https://github.com/mikekgfb

pytorch-bot bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Apr 19, 2024

malfet added topic: not user facing topic category ciflow/binaries_wheel Trigger binary build and upload jobs for wheel on the PR and removed module: cpu CPU specific problem (e.g., perf, algorithm) labels Apr 19, 2024

malfet requested a review from a team April 19, 2024 18:53

This was referenced Apr 19, 2024

Speedup int4mm_kernel with NEON #124257

Closed

[NEON] Remove implicit type conversions in tinygemm_kernel #124508

Closed

mikekgfb approved these changes Apr 19, 2024

View reviewed changes

pytorchmergebot added the merging label Apr 19, 2024

pytorchmergebot added the Merged label Apr 19, 2024

pytorchmergebot closed this in e6a788a Apr 19, 2024

pytorchmergebot removed the merging label Apr 19, 2024

malfet deleted the malfet-patch-28 branch April 19, 2024 22:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix compilation on aarch64 with gcc#124511

Fix compilation on aarch64 with gcc#124511
malfet wants to merge 1 commit intomainfrom
malfet-patch-28

malfet commented Apr 19, 2024

Uh oh!

pytorch-bot bot commented Apr 19, 2024 •

edited

Loading

Uh oh!

mikekgfb left a comment

Uh oh!

malfet commented Apr 19, 2024

Uh oh!

pytorchmergebot commented Apr 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

malfet commented Apr 19, 2024

Uh oh!

pytorch-bot bot commented Apr 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/124511

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

mikekgfb left a comment

Choose a reason for hiding this comment

Uh oh!

malfet commented Apr 19, 2024

Uh oh!

pytorchmergebot commented Apr 19, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Apr 19, 2024 •

edited

Loading