hardsigmoid: add cuda kernels by vkuzo · Pull Request #36351 · pytorch/pytorch

vkuzo · 2020-04-09T23:50:48Z

Stack from ghstack:

hardswish: add backards pass test #36420 hardswish: add backards pass test
hardsigmoid: add cuda kernels #36351 hardsigmoid: add cuda kernels

Summary:

Adds CUDA kernels for hardsigmoid, to enable its use in training.

Note: the update to the cpu backward pass is to keep the cpu vs cuda
logic consistent, no change in functionality.

Test Plan:

add CI for the forward pass
run this for the backward pass:
https://gist.github.com/vkuzo/95957d365600f9ad10d25bd20f58cc1a

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D20955589

Summary: Adds CUDA kernels for hardsigmoid, to enable its use in training. Note: the update to the cpu backward pass is to keep the cpu vs cuda logic consistent, no change in functionality. Test Plan: add CI for the forward pass run this for the backward pass: https://gist.github.com/vkuzo/95957d365600f9ad10d25bd20f58cc1a Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: Adds CUDA kernels for hardsigmoid, to enable its use in training. Note: the update to the cpu backward pass is to keep the cpu vs cuda logic consistent, no change in functionality. Test Plan: add CI for the forward pass run this for the backward pass: https://gist.github.com/vkuzo/95957d365600f9ad10d25bd20f58cc1a Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 598c203 Pull Request resolved: #36351

dr-ci · 2020-04-09T23:51:39Z

💊 Build failures summary and remediations

As of commit 1a27431 (more details on the Dr. CI page):

1/1 failures introduced in this PR

XLA failure

Job pytorch_xla_linux_xenial_py3_6_clang7_test is failing. Please create an issue with title prefixed by [PT_BREAK] in pytorch/xla and link to to this PR. If you have questions, please reach out to @ailzhang / @dlibenzi / @JackCaoG.

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker.

See how this bot performed.

This comment has been revised 17 times.

Summary: Adds CUDA kernels for hardsigmoid, to enable its use in training. Note: the update to the cpu backward pass is to keep the cpu vs cuda logic consistent, no change in functionality. Test Plan: add CI for the forward pass run this for the backward pass: https://gist.github.com/vkuzo/95957d365600f9ad10d25bd20f58cc1a Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D20955589](https://our.internmc.facebook.com/intern/diff/D20955589) [ghstack-poisoned]

Summary: Adds CUDA kernels for hardsigmoid, to enable its use in training. Note: the update to the cpu backward pass is to keep the cpu vs cuda logic consistent, no change in functionality. Test Plan: add CI for the forward pass run this for the backward pass: https://gist.github.com/vkuzo/95957d365600f9ad10d25bd20f58cc1a Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 24645f5 Pull Request resolved: #36351

Summary: Adds CUDA kernels for hardsigmoid, to enable its use in training. Note: the update to the cpu backward pass is to keep the cpu vs cuda logic consistent, no change in functionality. Test Plan: add CI for the forward pass run this for the backward pass: https://gist.github.com/vkuzo/95957d365600f9ad10d25bd20f58cc1a Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D20955589](https://our.internmc.facebook.com/intern/diff/D20955589) [ghstack-poisoned]

raghuramank100 · 2020-04-12T22:23:29Z

 }

+void hardsigmoid_kernel(TensorIterator& iter) {
+  AT_DISPATCH_FLOATING_TYPES_AND2(at::ScalarType::Half, at::ScalarType::BFloat16, iter.dtype(), "hardsigmoid_cuda", [&]() {


Tensorflow uses a different definition, https://www.tensorflow.org/api_docs/python/tf/keras/activations/hard_sigmoid,
we do not have a use case for that definition, so lets stick with this one

Summary: Adds CUDA kernels for hardsigmoid, to enable its use in training. Note: the update to the cpu backward pass is to keep the cpu vs cuda logic consistent, no change in functionality. Test Plan: add CI for the forward pass run this for the backward pass: https://gist.github.com/vkuzo/95957d365600f9ad10d25bd20f58cc1a Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D20955589](https://our.internmc.facebook.com/intern/diff/D20955589) [ghstack-poisoned]

facebook-github-bot · 2020-04-15T18:14:37Z

This pull request has been merged in 16e90eb.

Summary: #36351 make `hardsigmoid_backward` use tensoriterator but that can be done only after proper device dispatch. Pull Request resolved: #36704 Differential Revision: D21068126 Pulled By: ailzhang fbshipit-source-id: 6a6a74216f2b50fa7d15f692cd1583d3d233580a

Summary: Pull Request resolved: pytorch#36351 Adds CUDA kernels for hardsigmoid, to enable its use in training. Note: the update to the cpu backward pass is to keep the cpu vs cuda logic consistent, no change in functionality. Test Plan: add CI for the forward pass run this for the backward pass: https://gist.github.com/vkuzo/95957d365600f9ad10d25bd20f58cc1a Imported from OSS Differential Revision: D20955589 fbshipit-source-id: dc198aa6a58e1a7996e1831f1e479c398ffcbc90

Summary: pytorch#36351 make `hardsigmoid_backward` use tensoriterator but that can be done only after proper device dispatch. Pull Request resolved: pytorch#36704 Differential Revision: D21068126 Pulled By: ailzhang fbshipit-source-id: 6a6a74216f2b50fa7d15f692cd1583d3d233580a

vkuzo mentioned this pull request Apr 9, 2020

hardswish: add cuda kernels #36350

Closed

vkuzo requested review from jamesr66a, jerryzh168, raghuramank100 and supriyar April 9, 2020 23:51

vkuzo self-assigned this Apr 9, 2020

vkuzo added the oncall: quantization Quantization support in PyTorch label Apr 9, 2020

vkuzo mentioned this pull request Apr 11, 2020

hardswish: add backards pass test #36420

Closed

raghuramank100 reviewed Apr 12, 2020

View reviewed changes

raghuramank100 approved these changes Apr 12, 2020

View reviewed changes

facebook-github-bot closed this in 16e90eb Apr 15, 2020

ailzhang mentioned this pull request Apr 15, 2020

Skip hardsigmoid test, we'll need to override this. pytorch/xla#1919

Merged

facebook-github-bot added the merged label Apr 15, 2020

ailzhang mentioned this pull request Apr 16, 2020

Fix hardsigmoid/hardswish for proper device dispatch. #36704

Closed

facebook-github-bot deleted the gh/vkuzo/27/head branch April 19, 2020 14:17

zou3519 mentioned this pull request May 8, 2020

hardsigmoid cuda_dispatch_ptr INTERNAL ASSERT FAILED #38007

Closed

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hardsigmoid: add cuda kernels#36351

hardsigmoid: add cuda kernels#36351
vkuzo wants to merge 5 commits intogh/vkuzo/27/basefrom
gh/vkuzo/27/head

vkuzo commented Apr 9, 2020 •

edited

Loading

Uh oh!

dr-ci Bot commented Apr 9, 2020 •

edited

Loading

Uh oh!

raghuramank100 Apr 12, 2020

Uh oh!

facebook-github-bot commented Apr 15, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

vkuzo commented Apr 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci Bot commented Apr 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 Build failures summary and remediations

XLA failure

Uh oh!

raghuramank100 Apr 12, 2020

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Apr 15, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vkuzo commented Apr 9, 2020 •

edited

Loading

dr-ci Bot commented Apr 9, 2020 •

edited

Loading