Make FakeTensors return meta device within kernel invocation, add FakeTensor op tests by eellison · Pull Request #78972 · pytorch/pytorch

eellison · 2022-06-06T21:27:14Z

Stack from ghstack (oldest at bottom):

Add FakeTensor Op tests, as well as necessary changes to make the tests pass.

Previously, FakeTensor(cpu, ...) would always return cpu if device was called. While this the behavior you want within userland code, within the kernel itself you want to compute as if everything is a meta tensor. For instance, within TensorIterator there are a number of is_meta() checks that before this PR would return False, and attempt to proceed as if the FakeTensors were on CPU instead of meta, which led to segfaults among other incorrect behavior. Another example is in _linalg_check_errors. This PR changes FakeTensorMode to store whether or not we are in a kernel invocation, and while that is True, return meta.
Extends the error checking in VariableType_* to ignore Tensors with torch_dispatch defined, in addition to the existing when a torch_mode is defined.
A few small misc changes (adding a couple new ops to be special-handled in FakeTensors)

… op tests [ghstack-poisoned]

facebook-github-bot · 2022-06-06T21:27:20Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/78972
📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓Need help or want to give feedback on the CI? Visit our office hours

✅ No Failures (0 Pending)

As of commit 37852f4 (more details on the Dr. CI page):

Expand to see more

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

… FakeTensor op tests" Add FakeTensor Op tests, as well as necessary changes to make the tests pass. - Previously, `FakeTensor(cpu, ...)` would always return `cpu` if `device` was called. While this the behavior you want within userland code, within the kernel itself you want to compute as if everything is a meta tensor. For instance, within TensorIterator there are a number of `is_meta()` checks that before this PR would return False, and attempt to proceed as if the FakeTensors were on CPU instead of meta, which led to segfaults among other incorrect behavior. [Another example is in _linalg_check_errors](https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/BatchLinearAlgebra.cpp#L1298). PR changes `FakeTensorMode` to store whether or not we are in a kernel invocation, and while that is True, return `meta`. - Extends the error checking in `VariableType_*` to ignore Tensors with torch_dispatch defined, in addition to the existing when a torch_mode is defined. - A few small misc changes (adding a couple new ops to be special-handled in FakeTensors, a couple meta registrations) [ghstack-poisoned]

… op tests ghstack-source-id: d8c02c8 Pull Request resolved: #78972

ezyang · 2022-06-07T00:17:25Z

This doesn't invalidate the pull request but I want to make a higher level point: we are going to need support for dynamic shape soon, and this means that we were actually be using the C++ meta function implementations, for example we are not going to make tensor iterator symbolic aware. Instead we will be using python side implementations to get dynamic shape aware meta-tensor support. As a result it's not that important to make sure fake tensor works with tensor iterator (although it certainly is nice not to Segfault)

torch/csrc/autograd/autograd_not_implemented_fallback.cpp

ezyang · 2022-06-07T00:38:40Z

torch/_subclasses/fake_tensor.py


+
+class ComplexInputException(Exception):
+    pass


Did you black this file, would be nice to have the formatting on its own

ezyang · 2022-06-07T00:40:58Z

torch/_subclasses/fake_tensor.py

                if run_impl_check(func):
                    return op_impl(self, func, *args, **kwargs)

+            self.in_kernel_invocation = True


Naïvely I would've expected these to pair up with the no dispatch call

do you mean put inside the no_dispatch or to use _DisableTorchDispatch instead of self.in_kernel_invocation ? I haven't thought too hard about it... maybe would be a good cleanup, although I still need to do a little more thinking/understanding of composability

eellison · 2022-06-07T00:49:25Z

@ezyang good point. there were a lot of issues not just within TE, though, such as copy_ clone, the linalg test failures, Distribution error checking, etc, etc.

Other seemingly innocuous code like aten::outer could also lead to issues:

self.reshape({self.size(0), 1}) * vec2 within no_dispatch would break because self.reshape would get confused at self having a different device (cpu) than dispatch key and something would break.

Relatedly, it would be great to not have to handle something as simple as aten::outer in the a TorchDispatchMode but right now there is no way to decompose it (i will write up more full issue). If a decomposition for it exists in python, then you can just run that, but there is no equivalent way to take a composite kernel in C++ and run each operator with the mode enabled. If you don't wrap the kernel with no_dispatch it will just endlessly recur. I had some luck running the decompositions defined in python for this but coverage isn't all of the way there and it proved unnecessary at least for now to get the tests passing.

ezyang · 2022-06-07T00:51:44Z

I'm not sure what the best way to get the test passing is, but our plan on record is to get all of the composites rewritten in python

…on, add FakeTensor op tests" Add FakeTensor Op tests, as well as necessary changes to make the tests pass. - Previously, `FakeTensor(cpu, ...)` would always return `cpu` if `device` was called. While this the behavior you want within userland code, within the kernel itself you want to compute as if everything is a meta tensor. For instance, within TensorIterator there are a number of `is_meta()` checks that before this PR would return False, and attempt to proceed as if the FakeTensors were on CPU instead of meta, which led to segfaults among other incorrect behavior. [Another example is in _linalg_check_errors](https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/BatchLinearAlgebra.cpp#L1298). This PR changes `FakeTensorMode` to store whether or not we are in a kernel invocation, and while that is True, return `meta`. - Extends the error checking in `VariableType_*` to ignore Tensors with torch_dispatch defined, in addition to the existing when a torch_mode is defined. - A few small misc changes (adding a couple new ops to be special-handled in FakeTensors) [ghstack-poisoned]

… op tests ghstack-source-id: 692a600 Pull Request resolved: #78972

…on, add FakeTensor op tests" Add FakeTensor Op tests, as well as necessary changes to make the tests pass. - Previously, `FakeTensor(cpu, ...)` would always return `cpu` if `device` was called. While this the behavior you want within userland code, within the kernel itself you want to compute as if everything is a meta tensor. For instance, within TensorIterator there are a number of `is_meta()` checks that before this PR would return False, and attempt to proceed as if the FakeTensors were on CPU instead of meta, which led to segfaults among other incorrect behavior. [Another example is in _linalg_check_errors](https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/BatchLinearAlgebra.cpp#L1298). This PR changes `FakeTensorMode` to store whether or not we are in a kernel invocation, and while that is True, return `meta`. - Extends the error checking in `VariableType_*` to ignore Tensors with torch_dispatch defined, in addition to the existing when a torch_mode is defined. - A few small misc changes (adding a couple new ops to be special-handled in FakeTensors) [ghstack-poisoned]

… op tests ghstack-source-id: 2a2f344 Pull Request resolved: #78972

…on, add FakeTensor op tests" Add FakeTensor Op tests, as well as necessary changes to make the tests pass. - Previously, `FakeTensor(cpu, ...)` would always return `cpu` if `device` was called. While this the behavior you want within userland code, within the kernel itself you want to compute as if everything is a meta tensor. For instance, within TensorIterator there are a number of `is_meta()` checks that before this PR would return False, and attempt to proceed as if the FakeTensors were on CPU instead of meta, which led to segfaults among other incorrect behavior. [Another example is in _linalg_check_errors](https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/BatchLinearAlgebra.cpp#L1298). This PR changes `FakeTensorMode` to store whether or not we are in a kernel invocation, and while that is True, return `meta`. - Extends the error checking in `VariableType_*` to ignore Tensors with torch_dispatch defined, in addition to the existing when a torch_mode is defined. - A few small misc changes (adding a couple new ops to be special-handled in FakeTensors) [ghstack-poisoned]

… op tests ghstack-source-id: c19a309 Pull Request resolved: #78972

…on, add FakeTensor op tests" Add FakeTensor Op tests, as well as necessary changes to make the tests pass. - Previously, `FakeTensor(cpu, ...)` would always return `cpu` if `device` was called. While this the behavior you want within userland code, within the kernel itself you want to compute as if everything is a meta tensor. For instance, within TensorIterator there are a number of `is_meta()` checks that before this PR would return False, and attempt to proceed as if the FakeTensors were on CPU instead of meta, which led to segfaults among other incorrect behavior. [Another example is in _linalg_check_errors](https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/BatchLinearAlgebra.cpp#L1298). This PR changes `FakeTensorMode` to store whether or not we are in a kernel invocation, and while that is True, return `meta`. - Extends the error checking in `VariableType_*` to ignore Tensors with torch_dispatch defined, in addition to the existing when a torch_mode is defined. - A few small misc changes (adding a couple new ops to be special-handled in FakeTensors) [ghstack-poisoned]

… op tests ghstack-source-id: bc8977f Pull Request resolved: #78972

…on, add FakeTensor op tests" Add FakeTensor Op tests, as well as necessary changes to make the tests pass. - Previously, `FakeTensor(cpu, ...)` would always return `cpu` if `device` was called. While this the behavior you want within userland code, within the kernel itself you want to compute as if everything is a meta tensor. For instance, within TensorIterator there are a number of `is_meta()` checks that before this PR would return False, and attempt to proceed as if the FakeTensors were on CPU instead of meta, which led to segfaults among other incorrect behavior. [Another example is in _linalg_check_errors](https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/BatchLinearAlgebra.cpp#L1298). This PR changes `FakeTensorMode` to store whether or not we are in a kernel invocation, and while that is True, return `meta`. - Extends the error checking in `VariableType_*` to ignore Tensors with torch_dispatch defined, in addition to the existing when a torch_mode is defined. - A few small misc changes (adding a couple new ops to be special-handled in FakeTensors) [ghstack-poisoned]

… op tests ghstack-source-id: 623a132 Pull Request resolved: #78972

eellison · 2022-06-09T01:38:06Z

@pytorchbot merge

pytorchmergebot · 2022-06-09T01:39:22Z

@pytorchbot successfully started a merge job. Check the current status here

github-actions · 2022-06-09T01:40:14Z

Hey @eellison.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

… op tests (#78972) Summary: Pull Request resolved: #78972 Approved by: https://github.com/ezyang Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/3c5a3ca9e89183ff3b9274fbe589fa205dc86be4 Reviewed By: osalpekar Differential Revision: D37030335 Pulled By: eellison fbshipit-source-id: c246709d73bca62690fa86c20d44816907da2541

Make FakeTensors return meta within kerenl invocation, add FakeTensor…

7251930

… op tests [ghstack-poisoned]

eellison requested review from albanD, mruberry, ngimel and soulitzer as code owners June 6, 2022 21:27

This was referenced Jun 6, 2022

add non-kwarg device and _like constructors #78536

Closed

Migrate FakeTensors to always call into FakeTensorMode and have them hold a reference #78677

Closed

facebook-github-bot added the cla signed label Jun 6, 2022

eellison mentioned this pull request Jun 6, 2022

Add CPU Fallback #78522

Closed

eellison mentioned this pull request Jun 6, 2022

refactor op handling to use register pattern #78523

Closed

eellison requested review from ezyang and samdow and removed request for mruberry, ngimel and soulitzer June 6, 2022 21:39

eellison added a commit that referenced this pull request Jun 6, 2022

Make FakeTensors return meta within kerenl invocation, add FakeTensor…

92d5bb3

… op tests ghstack-source-id: d8c02c8 Pull Request resolved: #78972

eellison changed the title ~~Make FakeTensors return meta within kerenl invocation, add FakeTensor op tests~~ Make FakeTensors return meta device within kernel invocation, add FakeTensor op tests Jun 6, 2022

ezyang reviewed Jun 7, 2022

View reviewed changes

torch/csrc/autograd/autograd_not_implemented_fallback.cpp Show resolved Hide resolved

ezyang reviewed Jun 7, 2022

View reviewed changes

ezyang approved these changes Jun 7, 2022

View reviewed changes

eellison added a commit that referenced this pull request Jun 8, 2022

Make FakeTensors return meta within kerenl invocation, add FakeTensor…

8b184a1

… op tests ghstack-source-id: 692a600 Pull Request resolved: #78972

eellison added a commit that referenced this pull request Jun 8, 2022

Make FakeTensors return meta within kerenl invocation, add FakeTensor…

7e7c60e

… op tests ghstack-source-id: 2a2f344 Pull Request resolved: #78972

eellison added a commit that referenced this pull request Jun 8, 2022

Make FakeTensors return meta within kerenl invocation, add FakeTensor…

635417d

… op tests ghstack-source-id: c19a309 Pull Request resolved: #78972

eellison added a commit that referenced this pull request Jun 8, 2022

Make FakeTensors return meta within kerenl invocation, add FakeTensor…

70a2c22

… op tests ghstack-source-id: bc8977f Pull Request resolved: #78972

eellison added a commit that referenced this pull request Jun 8, 2022

Make FakeTensors return meta within kerenl invocation, add FakeTensor…

ddc66e2

… op tests ghstack-source-id: 623a132 Pull Request resolved: #78972

eellison mentioned this pull request Jun 8, 2022

Add Dynamic Output Shape Tag For data-dependent ops, handle in FakeTensor #79170

Closed

pytorchmergebot added the Merged label Jun 9, 2022

pytorchmergebot closed this in 3c5a3ca Jun 9, 2022

facebook-github-bot deleted the gh/eellison/306/head branch June 12, 2022 14:20

eellison mentioned this pull request Jun 17, 2022

Add Fake Tensor deepcopy impl #79580

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make FakeTensors return meta device within kernel invocation, add FakeTensor op tests#78972

Make FakeTensors return meta device within kernel invocation, add FakeTensor op tests#78972
eellison wants to merge 16 commits intogh/eellison/306/basefrom
gh/eellison/306/head

eellison commented Jun 6, 2022 •

edited

Loading

Uh oh!

facebook-github-bot commented Jun 6, 2022 •

edited

Loading

Uh oh!

ezyang commented Jun 7, 2022

Uh oh!

Uh oh!

ezyang Jun 7, 2022

Uh oh!

ezyang Jun 7, 2022

Uh oh!

eellison Jun 7, 2022

Uh oh!

eellison commented Jun 7, 2022 •

edited

Loading

Uh oh!

ezyang commented Jun 7, 2022

Uh oh!

eellison commented Jun 9, 2022

Uh oh!

pytorchmergebot commented Jun 9, 2022

Uh oh!

github-actions bot commented Jun 9, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

eellison commented Jun 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Jun 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

✅ No Failures (0 Pending)

Uh oh!

ezyang commented Jun 7, 2022

Uh oh!

Uh oh!

ezyang Jun 7, 2022

Choose a reason for hiding this comment

Uh oh!

ezyang Jun 7, 2022

Choose a reason for hiding this comment

Uh oh!

eellison Jun 7, 2022

Choose a reason for hiding this comment

Uh oh!

eellison commented Jun 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ezyang commented Jun 7, 2022

Uh oh!

eellison commented Jun 9, 2022

Uh oh!

pytorchmergebot commented Jun 9, 2022

Uh oh!

github-actions bot commented Jun 9, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

eellison commented Jun 6, 2022 •

edited

Loading

facebook-github-bot commented Jun 6, 2022 •

edited

Loading

eellison commented Jun 7, 2022 •

edited

Loading