Use templates instead of macro when defining Vec256<BFloat16> bin operators by xuhdev · Pull Request #35844 · pytorch/pytorch

xuhdev · 2020-04-01T22:57:42Z

Stack from ghstack:

Add comparison operators to Vec256<BFloat16> #36106 Add comparison operators to Vec256
Use templates instead of macro when defining Vec256<BFloat16> bin operators #35844 Use templates instead of macro when defining Vec256 bin operators

Also, bitwise operators can operate on the underlying __m256i
representation directly instead of making expensive conversions to
float16.

Differential Revision: D20927639

…rators Also, bitwise operators can operate on the underlying __m256i representation directly instead of making expensive conversions to float16. [ghstack-poisoned]

…rators Also, bitwise operators can operate on the underlying __m256i representation directly instead of making expensive conversions to float16. ghstack-source-id: 83304a6 Pull Request resolved: #35844

dr-ci · 2020-04-01T22:58:56Z

💊 CircleCI build failures summary and remediations

As of commit a2e7aa8 (more details on the Dr. CI page):

✅ None of the build failures appear to be your fault 💚

2/2 broken upstream at merge base 8afa001 since Apr 07
Please rebase on the viable/strict branch (expand for instructions)

If your commit is newer than viable/strict, you can try basing on an older, stable commit:
```
git fetch https://github.com/pytorch/pytorch viable/strict
git rebase --onto FETCH_HEAD $(git merge-base origin/master HEAD)
```
If your commit is older than viable/strict:
```
git fetch https://github.com/pytorch/pytorch viable/strict
git rebase FETCH_HEAD
```
Check out the recency history of this "viable master" tracking branch.

🚧 2 upstream failures:

These were probably caused by upstream breakages:

pytorch_bazel_test from Apr 06 until Apr 07 (15 commits; 2e8f954 - ebf743a)
- 🔁 rerun
pytorch_cpp_doc_push since Apr 07
- 🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker.

See how this bot performed.

This comment has been revised 24 times.

xuhdev · 2020-04-01T23:05:13Z

-  auto o2 = func(a_hi, b_hi);                                                               \
-  return cvtfp32_bf16(o1, o2);                                                              \
+template<typename Op>
+Vec256<BFloat16> inline bfloat16_binary_op_as_fp32(const Vec256<BFloat16>& a, const Vec256<BFloat16>& b, Op op) {


@XiaobingSuper Do you think this function can also be used for implementing operators >, <, >=, and <=? Now #35117 should be waiting on these operators.

Yes, there a PR #35092 doing this.

…16> bin operators" Also, bitwise operators can operate on the underlying __m256i representation directly instead of making expensive conversions to float16. [ghstack-poisoned]

ngimel · 2020-04-07T20:40:05Z

+}
+
+Vec256<BFloat16> inline operator&(const Vec256<BFloat16>& a, const Vec256<BFloat16>& b) {
+  return _mm256_and_si256(a, b);


this is instruction for signed integers, not for floats? It used to be _mm256_and_ps, which is indeed instruction for floats. Ah, nm, I see what you are doing.

Yes. The point is that it is not necessary to convert to float in this case, because bitwise operators have the same effects. There are two different instructions for integers and float because they can be directly applied to different data types (__m256i and __m256).

facebook-github-bot · 2020-04-09T00:13:34Z

@ngimel merged this pull request in 0bc17dd.

…rators (pytorch#35844) Summary: Pull Request resolved: pytorch#35844 Also, bitwise operators can operate on the underlying __m256i representation directly instead of making expensive conversions to float16. Test Plan: Imported from OSS Differential Revision: D20927639 Pulled By: ngimel fbshipit-source-id: 148c503df090580c8504f0df8d6ed2648d614120

Use templates instead of macro when defining Vec256<BFloat16> bin ope…

695c28a

…rators Also, bitwise operators can operate on the underlying __m256i representation directly instead of making expensive conversions to float16. [ghstack-poisoned]

xuhdev requested a review from XiaobingSuper April 1, 2020 22:58

pytorchbot added the open source label Apr 1, 2020

xuhdev commented Apr 1, 2020

View reviewed changes

XiaobingSuper requested a review from ngimel April 3, 2020 01:32

zou3519 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Apr 6, 2020

xuhdev requested a review from ezyang April 6, 2020 22:10

Update on "Use templates instead of macro when defining Vec256<BFloat…

056e73b

…16> bin operators" Also, bitwise operators can operate on the underlying __m256i representation directly instead of making expensive conversions to float16. [ghstack-poisoned]

xuhdev mentioned this pull request Apr 6, 2020

Add comparison operators to Vec256<BFloat16> #36106

Closed

Update on "Use templates instead of macro when defining Vec256<BFloat…

7d2108b

…16> bin operators" Also, bitwise operators can operate on the underlying __m256i representation directly instead of making expensive conversions to float16. [ghstack-poisoned]

xuhdev mentioned this pull request Apr 7, 2020

bfloat16: vectorized unary ops #35092

Closed

xuhdev added 2 commits April 6, 2020 19:25

Update on "Use templates instead of macro when defining Vec256<BFloat…

53d9eca

…16> bin operators" Also, bitwise operators can operate on the underlying __m256i representation directly instead of making expensive conversions to float16. [ghstack-poisoned]

Update on "Use templates instead of macro when defining Vec256<BFloat…

a2e7aa8

…16> bin operators" Also, bitwise operators can operate on the underlying __m256i representation directly instead of making expensive conversions to float16. [ghstack-poisoned]

ngimel reviewed Apr 7, 2020

View reviewed changes

ngimel approved these changes Apr 7, 2020

View reviewed changes

facebook-github-bot closed this in 0bc17dd Apr 9, 2020

facebook-github-bot added the merged label Apr 9, 2020

xuhdev deleted the gh/xuhdev/69/head branch April 9, 2020 21:03

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use templates instead of macro when defining Vec256<BFloat16> bin operators#35844

Use templates instead of macro when defining Vec256<BFloat16> bin operators#35844
xuhdev wants to merge 5 commits intogh/xuhdev/69/basefrom
gh/xuhdev/69/head

xuhdev commented Apr 1, 2020 •

edited by ngimel

Loading

Uh oh!

dr-ci Bot commented Apr 1, 2020 •

edited

Loading

Uh oh!

xuhdev Apr 1, 2020

Uh oh!

XiaobingSuper Apr 2, 2020

Uh oh!

ngimel Apr 7, 2020 •

edited

Loading

Uh oh!

xuhdev Apr 7, 2020

Uh oh!

facebook-github-bot commented Apr 9, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

xuhdev commented Apr 1, 2020 • edited by ngimel Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci Bot commented Apr 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CircleCI build failures summary and remediations

🚧 2 upstream failures:

Uh oh!

xuhdev Apr 1, 2020

Choose a reason for hiding this comment

Uh oh!

XiaobingSuper Apr 2, 2020

Choose a reason for hiding this comment

Uh oh!

ngimel Apr 7, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xuhdev Apr 7, 2020

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Apr 9, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

xuhdev commented Apr 1, 2020 •

edited by ngimel

Loading

dr-ci Bot commented Apr 1, 2020 •

edited

Loading

ngimel Apr 7, 2020 •

edited

Loading