[PyTorch] add NEON half2float fmadd/fmsub by swolchok · Pull Request #137723 · pytorch/pytorch

swolchok · 2024-10-10T19:35:14Z

Stack from ghstack (oldest at bottom):

NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface.

Differential Revision: D64197048

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/) [ghstack-poisoned]

pytorch-bot · 2024-10-10T19:35:18Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/137723

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit de8a7a9 with merge base de4c2a3 ():

NEW FAILURES - The following jobs have failed:

Check mergeability of ghstack PR / ghstack-mergeability-check (gh)
RuntimeError: Command git -C /home/runner/work/pytorch/pytorch cherry-pick -x b5a5706 returned non-zero exit code 1
trunk / macos-py3-arm64 / build (gh)
/Users/ec2-user/runner/_work/pytorch/pytorch/aten/src/ATen/cpu/vec/vec128/vec128_half_neon.h:694:56: error: no function template matches function template specialization 'fmsub'

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-10-10T19:35:27Z

This pull request was exported from Phabricator. Differential Revision: D64197048

NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/) [ghstack-poisoned]

facebook-github-bot · 2024-10-10T19:50:39Z

This pull request was exported from Phabricator. Differential Revision: D64197048

NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/) [ghstack-poisoned]

facebook-github-bot · 2024-10-10T21:11:21Z

This pull request was exported from Phabricator. Differential Revision: D64197048

Pull Request resolved: #137723 NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. ghstack-source-id: 247383137 @exported-using-ghexport Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/)

…d/fmsub" NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/) [ghstack-poisoned]

facebook-github-bot · 2024-10-10T21:38:59Z

This pull request was exported from Phabricator. Differential Revision: D64197048

Pull Request resolved: #137723 NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. ghstack-source-id: 247393404 @exported-using-ghexport Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/)

swolchok · 2024-10-11T23:39:10Z

this is getting folded into a forthcoming PR

[PyTorch] add NEON half2float fmadd/fmsub

b7072d1

NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/) [ghstack-poisoned]

pytorch-bot Bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Oct 10, 2024

swolchok mentioned this pull request Oct 10, 2024

[PyTorch] Port ExecuTorch bfdot improvement back to ATen BlasKernel, Try #2 #137377

Closed

facebook-github-bot added the fb-exported label Oct 10, 2024

Update on "[PyTorch] add NEON half2float fmadd/fmsub"

de17f26

NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/) [ghstack-poisoned]

Update on "[PyTorch] add NEON half2float fmadd/fmsub"

ca27a27

NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/) [ghstack-poisoned]

swolchok added the release notes: cpp release notes category label Oct 11, 2024

swolchok requested review from jgong5, kimishpatel and malfet October 11, 2024 01:26

jgong5 approved these changes Oct 11, 2024

View reviewed changes

pytorch-bot Bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 11, 2024

swolchok closed this Oct 11, 2024

github-actions Bot deleted the gh/swolchok/653/head branch November 11, 2024 02:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PyTorch] add NEON half2float fmadd/fmsub#137723

[PyTorch] add NEON half2float fmadd/fmsub#137723
swolchok wants to merge 4 commits intogh/swolchok/653/basefrom
gh/swolchok/653/head

swolchok commented Oct 10, 2024 •

edited by pytorch-bot Bot

Loading

Uh oh!

pytorch-bot Bot commented Oct 10, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Oct 10, 2024

Uh oh!

facebook-github-bot commented Oct 10, 2024

Uh oh!

facebook-github-bot commented Oct 10, 2024

Uh oh!

facebook-github-bot commented Oct 10, 2024

Uh oh!

swolchok commented Oct 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

swolchok commented Oct 10, 2024 • edited by pytorch-bot Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Oct 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/137723

❌ 2 New Failures

Uh oh!

facebook-github-bot commented Oct 10, 2024

Uh oh!

facebook-github-bot commented Oct 10, 2024

Uh oh!

facebook-github-bot commented Oct 10, 2024

Uh oh!

facebook-github-bot commented Oct 10, 2024

Uh oh!

swolchok commented Oct 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

swolchok commented Oct 10, 2024 •

edited by pytorch-bot Bot

Loading

pytorch-bot Bot commented Oct 10, 2024 •

edited

Loading