[jit] Fuse tensor-scalar ops when scalar is constant by zou3519 · Pull Request #10511 · pytorch/pytorch

zou3519 · 2018-08-14T18:59:06Z

This is on the way to resolving #9940.

This PR modifies graph fuser to fuse operations that have constant
scalar arguments. These constant scalar arguments are directly inlined
into the kernel body.

The context for this is that LSTM backward (in particular, sigmoid
backward) has many add(x, 1.) operations. This PR should be sufficient for
LSTM backward to get fused by the graph fuser.

cc @apaszke @zdevito

test/test_jit.py

zdevito

This looks good. I have some small comments. I am surprised that we didn't need to modify the fusion_compiler, did we already have code to emit constants?

torch/csrc/jit/passes/graph_fuser.cpp

zou3519 · 2018-08-15T15:51:14Z

@zdevito Yes there was already code to emit constants inlined into the body of the FusionGroup. In particular, the graph_fuser inlines the alpha argument in add(Tensor, Tensor, Scalar alpha) into the FusionGroup if it is constant. I took this logic and expanded it to work with all constant number arguments.

test/expect/TestScript.test_lstm_fusion_cuda-backward.expect

apaszke

Mostly LGTM, but I'd like to remove the hacky scalar checks that don't even try to see what overloads do we handle. Please use the matching syntax, or we'll end up with a lot of bugs again.

test/expect/TestScript.test_lstm_fusion_cuda-backward.expect

test/test_jit.py

torch/csrc/jit/passes/graph_fuser.cpp

test/expect/TestScript.test_lstm_fusion_cuda-backward.expect

torch/csrc/jit/register_prim_ops.cpp

facebook-github-bot

zou3519 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

torch/csrc/jit/passes/graph_fuser.cpp

apaszke

Looks great now! Some minor comments.

torch/csrc/jit/passes/graph_fuser.cpp

facebook-github-bot

zou3519 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

This is on the way to resolving pytorch#9940. This PR modifies graph fuser to fuse operations that have constant scalar arguments. These constant scalar arguments are directly inlined into the kernel body. The context for this is that LSTM backward (in particular, sigmoid backward) has many add(x, 1.) operations. This PR should be sufficient for LSTM backward to get fused by the graph fuser.

- Use WithInsertPoint instead of insertAfter - Make the compatible devices logic more explicit by adding a Device struct and a DeviceType enum. The possible Devices are `Unknown | AnyDevice | CPU | CUDA i`

- Use ->matches instead of hacky allowing-numbers-in-type-checks - Rewrite bool compatibleDevices(Node * consumer, Value * producer) to be more readable.

facebook-github-bot

zou3519 is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

zou3519 added the oncall: jit Add this issue/PR to JIT oncall triage queue label Aug 14, 2018

zou3519 requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners August 14, 2018 18:59

zou3519 commented Aug 14, 2018

View reviewed changes

test/test_jit.py Outdated

This comment was marked as off-topic.

Sign in to view

zou3519 force-pushed the pytorch-fusescalar branch from 2267979 to a235f9d Compare August 14, 2018 19:01

zdevito reviewed Aug 15, 2018

View reviewed changes

torch/csrc/jit/passes/graph_fuser.cpp Outdated

This comment was marked as off-topic.

Sign in to view

torch/csrc/jit/passes/graph_fuser.cpp Outdated

This comment was marked as off-topic.

Sign in to view

zou3519 force-pushed the pytorch-fusescalar branch from 389d846 to 0616b4a Compare August 15, 2018 16:37

zou3519 commented Aug 15, 2018

View reviewed changes

test/expect/TestScript.test_lstm_fusion_cuda-backward.expect Outdated

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

apaszke reviewed Aug 16, 2018

View reviewed changes

zou3519 commented Aug 17, 2018

View reviewed changes

test/expect/TestScript.test_lstm_fusion_cuda-backward.expect Outdated

This comment was marked as off-topic.

Sign in to view

zou3519 commented Aug 17, 2018

View reviewed changes

torch/csrc/jit/register_prim_ops.cpp Outdated

This comment was marked as off-topic.

Sign in to view

facebook-github-bot reviewed Aug 17, 2018

View reviewed changes

zou3519 commented Aug 17, 2018

View reviewed changes

torch/csrc/jit/passes/graph_fuser.cpp Outdated

This comment was marked as off-topic.

Sign in to view

apaszke approved these changes Aug 17, 2018

View reviewed changes

facebook-github-bot reviewed Aug 17, 2018

View reviewed changes

zou3519 force-pushed the pytorch-fusescalar branch 2 times, most recently from 221c95f to c79b24a Compare August 17, 2018 17:07

zou3519 added 8 commits August 17, 2018 10:38

Expect file updates

7c4c68f

Fix tests

36e4ae1

Fix expect files

3a4c0a5

Addressed comments:

c0f54e3

- Use WithInsertPoint instead of insertAfter - Make the compatible devices logic more explicit by adding a Device struct and a DeviceType enum. The possible Devices are `Unknown | AnyDevice | CPU | CUDA i`

Address comments:

d437193

- Use ->matches instead of hacky allowing-numbers-in-type-checks - Rewrite bool compatibleDevices(Node * consumer, Value * producer) to be more readable.

Fix mul/div

b4b1fec

Address comments

9d943a6

zou3519 force-pushed the pytorch-fusescalar branch from c79b24a to 9d943a6 Compare August 17, 2018 17:39

facebook-github-bot reviewed Aug 17, 2018

View reviewed changes

facebook-github-bot closed this in 86c9856 Aug 17, 2018

ezyang added the merged label Jun 26, 2019

Conversation

zou3519 commented Aug 14, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as off-topic.

Uh oh!

zdevito left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

zou3519 commented Aug 15, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

apaszke left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

apaszke left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

zou3519 commented Aug 14, 2018 •

edited

Loading

zou3519 commented Aug 15, 2018 •

edited

Loading