Boxed variable dispatch by smessmer · Pull Request #29934 · pytorch/pytorch

smessmer · 2019-11-15T23:07:41Z

Stack from ghstack:

Boxed variable dispatch #29934 Boxed variable dispatch

Previously, when doing boxed dispatch (e.g. custom ops), the dispatcher manually removed the VariableTensorId flag before dispatching
because custom ops don't have variable kernels.
This is one of the blockers that prevented us from using the boxed dispatch mechanism for ops from native_functions.yaml because they define variable kernels and need them to be called for autograd.

This PR changes that. The dispatcher doesn't remove the VariableTensorId flag anymore.
Instead, to make custom ops work, we implement a variable fallback kernel that is called whenever no other variable kernel was found.

See https://fb.quip.com/czCUA1NzMeEt for the detailed plan.

Differential Revision: D18542342

Previously, when doing boxed dispatch (e.g. custom ops), the dispatcher manually removed the VariableTensorId flag before dispatching because custom ops don't have variable kernels. This is one of the blockers that prevented us from using the boxed dispatch mechanism for ops from native_functions.yaml because they define variable kernels and need them to be called for autograd. This PR changes that. The dispatcher doesn't remove the VariableTensorId flag anymore. Instead, to make custom ops work, we implement a variable fallback kernel that is called whenever no other variable kernel was found. Differential Revision: [D18542342](https://our.internmc.facebook.com/intern/diff/D18542342/) [ghstack-poisoned]

Previously, when doing boxed dispatch (e.g. custom ops), the dispatcher manually removed the VariableTensorId flag before dispatching because custom ops don't have variable kernels. This is one of the blockers that prevented us from using the boxed dispatch mechanism for ops from native_functions.yaml because they define variable kernels and need them to be called for autograd. This PR changes that. The dispatcher doesn't remove the VariableTensorId flag anymore. Instead, to make custom ops work, we implement a variable fallback kernel that is called whenever no other variable kernel was found. See https://fb.quip.com/czCUA1NzMeEt for the detailed plan. Differential Revision: [D18542342](https://our.internmc.facebook.com/intern/diff/D18542342/) [ghstack-poisoned]

Pull Request resolved: #29934 Previously, when doing boxed dispatch (e.g. custom ops), the dispatcher manually removed the VariableTensorId flag before dispatching because custom ops don't have variable kernels. This is one of the blockers that prevented us from using the boxed dispatch mechanism for ops from native_functions.yaml because they define variable kernels and need them to be called for autograd. This PR changes that. The dispatcher doesn't remove the VariableTensorId flag anymore. Instead, to make custom ops work, we implement a variable fallback kernel that is called whenever no other variable kernel was found. ghstack-source-id: 94056388 Differential Revision: [D18542342](https://our.internmc.facebook.com/intern/diff/D18542342/)

ezyang · 2019-11-18T14:58:24Z

aten/src/ATen/core/VariableFallbackKernel.cpp

+
+void variable_fallback_kernel(const OperatorHandle& op, Stack* stack) {
+    at::AutoNonVariableTypeMode _var_guard(true);
+    Dispatcher::singleton().callBoxed(op, stack);


OK, this isn't really what I've been advocating for, but I'll let it slide for now.

it's not? IIRC, we said we're setting the thread local flag and re-dispatching. That's what I'm doing here. What would you do instead?

We also need to setup grad_fn on the outputs to error if you try to backprop through them! Otherwise you'll just pop out requires_grad=False outputs which will silently fail to backprop if someone uses them.

Ah, that's what you mean. Yes, we should do that. This doesn't regress behavior though, it's just as broken as it was before. There were some issues with adding the grad_fn if I remember correctly which meant we can't do it right now, but I'll look at it in a separate PR.

Yep, that's why I approved.

Previously, when doing boxed dispatch (e.g. custom ops), the dispatcher manually removed the VariableTensorId flag before dispatching because custom ops don't have variable kernels. This is one of the blockers that prevented us from using the boxed dispatch mechanism for ops from native_functions.yaml because they define variable kernels and need them to be called for autograd. This PR changes that. The dispatcher doesn't remove the VariableTensorId flag anymore. Instead, to make custom ops work, we implement a variable fallback kernel that is called whenever no other variable kernel was found. See https://fb.quip.com/czCUA1NzMeEt for the detailed plan. Differential Revision: [D18542342](https://our.internmc.facebook.com/intern/diff/D18542342/) [ghstack-poisoned]

Pull Request resolved: #29934 Previously, when doing boxed dispatch (e.g. custom ops), the dispatcher manually removed the VariableTensorId flag before dispatching because custom ops don't have variable kernels. This is one of the blockers that prevented us from using the boxed dispatch mechanism for ops from native_functions.yaml because they define variable kernels and need them to be called for autograd. This PR changes that. The dispatcher doesn't remove the VariableTensorId flag anymore. Instead, to make custom ops work, we implement a variable fallback kernel that is called whenever no other variable kernel was found. ghstack-source-id: 94139776 Differential Revision: [D18542342](https://our.internmc.facebook.com/intern/diff/D18542342/)

Previously, when doing boxed dispatch (e.g. custom ops), the dispatcher manually removed the VariableTensorId flag before dispatching because custom ops don't have variable kernels. This is one of the blockers that prevented us from using the boxed dispatch mechanism for ops from native_functions.yaml because they define variable kernels and need them to be called for autograd. This PR changes that. The dispatcher doesn't remove the VariableTensorId flag anymore. Instead, to make custom ops work, we implement a variable fallback kernel that is called whenever no other variable kernel was found. See https://fb.quip.com/czCUA1NzMeEt for the detailed plan. Differential Revision: [D18542342](https://our.internmc.facebook.com/intern/diff/D18542342/) [ghstack-poisoned]

Pull Request resolved: #29934 Previously, when doing boxed dispatch (e.g. custom ops), the dispatcher manually removed the VariableTensorId flag before dispatching because custom ops don't have variable kernels. This is one of the blockers that prevented us from using the boxed dispatch mechanism for ops from native_functions.yaml because they define variable kernels and need them to be called for autograd. This PR changes that. The dispatcher doesn't remove the VariableTensorId flag anymore. Instead, to make custom ops work, we implement a variable fallback kernel that is called whenever no other variable kernel was found. ghstack-source-id: 94460733 Differential Revision: [D18542342](https://our.internmc.facebook.com/intern/diff/D18542342/)

Previously, when doing boxed dispatch (e.g. custom ops), the dispatcher manually removed the VariableTensorId flag before dispatching because custom ops don't have variable kernels. This is one of the blockers that prevented us from using the boxed dispatch mechanism for ops from native_functions.yaml because they define variable kernels and need them to be called for autograd. This PR changes that. The dispatcher doesn't remove the VariableTensorId flag anymore. Instead, to make custom ops work, we implement a variable fallback kernel that is called whenever no other variable kernel was found. See https://fb.quip.com/czCUA1NzMeEt for the detailed plan. Differential Revision: [D18542342](https://our.internmc.facebook.com/intern/diff/D18542342/) [ghstack-poisoned]

Pull Request resolved: #29934 Previously, when doing boxed dispatch (e.g. custom ops), the dispatcher manually removed the VariableTensorId flag before dispatching because custom ops don't have variable kernels. This is one of the blockers that prevented us from using the boxed dispatch mechanism for ops from native_functions.yaml because they define variable kernels and need them to be called for autograd. This PR changes that. The dispatcher doesn't remove the VariableTensorId flag anymore. Instead, to make custom ops work, we implement a variable fallback kernel that is called whenever no other variable kernel was found. ghstack-source-id: 94563165 Differential Revision: [D18542342](https://our.internmc.facebook.com/intern/diff/D18542342/)

Previously, when doing boxed dispatch (e.g. custom ops), the dispatcher manually removed the VariableTensorId flag before dispatching because custom ops don't have variable kernels. This is one of the blockers that prevented us from using the boxed dispatch mechanism for ops from native_functions.yaml because they define variable kernels and need them to be called for autograd. This PR changes that. The dispatcher doesn't remove the VariableTensorId flag anymore. Instead, to make custom ops work, we implement a variable fallback kernel that is called whenever no other variable kernel was found. See https://fb.quip.com/czCUA1NzMeEt for the detailed plan. Differential Revision: [D18542342](https://our.internmc.facebook.com/intern/diff/D18542342/) [ghstack-poisoned]

Pull Request resolved: #29934 Previously, when doing boxed dispatch (e.g. custom ops), the dispatcher manually removed the VariableTensorId flag before dispatching because custom ops don't have variable kernels. This is one of the blockers that prevented us from using the boxed dispatch mechanism for ops from native_functions.yaml because they define variable kernels and need them to be called for autograd. This PR changes that. The dispatcher doesn't remove the VariableTensorId flag anymore. Instead, to make custom ops work, we implement a variable fallback kernel that is called whenever no other variable kernel was found. ghstack-source-id: 94618474 Differential Revision: [D18542342](https://our.internmc.facebook.com/intern/diff/D18542342/)

Summary: Pull Request resolved: pytorch#29934 Previously, when doing boxed dispatch (e.g. custom ops), the dispatcher manually removed the VariableTensorId flag before dispatching because custom ops don't have variable kernels. This is one of the blockers that prevented us from using the boxed dispatch mechanism for ops from native_functions.yaml because they define variable kernels and need them to be called for autograd. This PR changes that. The dispatcher doesn't remove the VariableTensorId flag anymore. Instead, to make custom ops work, we implement a variable fallback kernel that is called whenever no other variable kernel was found. ghstack-source-id: 94618474 Test Plan: unit tests Differential Revision: D18542342 fbshipit-source-id: a30ae35d98f89f7ae507151f55c42cfbed54a451

smessmer requested review from dzhulgakov and ezyang November 15, 2019 23:10

ezyang reviewed Nov 18, 2019

View reviewed changes

ezyang approved these changes Nov 18, 2019

View reviewed changes

This was referenced Nov 22, 2019

[pytorch][PR] Convert KernelTable to a flat-indexed array rather than a hashtable. #30332

Closed

[pytorch][PR] Remove LeftRight from OperatorEntry and DispatchTable. #30333

Closed

Make Dispatcher::backendFallbackKernels_ an array #30340

Closed

smessmer mentioned this pull request Nov 23, 2019

[wip] Use codegen'ed unboxing wrappers #30370

Closed

smessmer added 3 commits November 24, 2019 13:43

facebook-github-bot closed this in d2336ed Nov 27, 2019

smessmer mentioned this pull request Dec 2, 2019

Binding Variable methods/functions in native_functions.yaml is awkward due to AutoNonVariableTypeMode #30102

Closed

facebook-github-bot deleted the gh/smessmer/117/head branch December 10, 2019 15:20

ezyang mentioned this pull request Apr 27, 2020

Pytorch 1.5.0 requires_grad being automatically set to false in C++ registered operators #37306

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Boxed variable dispatch#29934

Boxed variable dispatch#29934
smessmer wants to merge 12 commits intogh/smessmer/117/basefrom
gh/smessmer/117/head

smessmer commented Nov 15, 2019 •

edited

Loading

Uh oh!

ezyang Nov 18, 2019

Uh oh!

smessmer Nov 18, 2019

Uh oh!

ezyang Nov 18, 2019 •

edited

Loading

Uh oh!

smessmer Nov 18, 2019

Uh oh!

ezyang Nov 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

smessmer commented Nov 15, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ezyang Nov 18, 2019

Choose a reason for hiding this comment

Uh oh!

smessmer Nov 18, 2019

Choose a reason for hiding this comment

Uh oh!

ezyang Nov 18, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

smessmer Nov 18, 2019

Choose a reason for hiding this comment

Uh oh!

ezyang Nov 19, 2019

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

smessmer commented Nov 15, 2019 •

edited

Loading

ezyang Nov 18, 2019 •

edited

Loading