Add CPU Fallback by eellison · Pull Request #78522 · pytorch/pytorch

eellison · 2022-05-31T13:46:01Z

Stack from ghstack (oldest at bottom):

This adds the option to convert tensors to fallback to cpu if the operator is not supported on meta tensors. The output of the cpu fallback operator must be a new-unaliased TensorImpl because the conversion to cpu and back will lose shared metadata between inputs and outputs.

[ghstack-poisoned]

facebook-github-bot · 2022-05-31T13:46:08Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/78522
📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓Need help or want to give feedback on the CI? Visit our office hours

✅ No Failures (1 Pending)

As of commit 48e1e46 (more details on the Dr. CI page):

Expand to see more

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

[ghstack-poisoned]

This adds the option to convert tensors to fallback to cpu if the operator is not supported on `meta` tensors. The output of the cpu fallback operator must be a new-unaliased TensorImpl because the conversion to cpu and back will lose shared metadata between inputs and outputs. [ghstack-poisoned]

ezyang · 2022-05-31T17:30:57Z

torch/_subclasses/fake_tensor.py

+    try:
+        yield
+    finally:
+        cpu_fallback_enabled = orig


Why not have this on the fake tensor mode itself?

I guess... is this something that we view as temporary? If so, it might be nice not to commit it to the FakeTensorMode api. But I guess it's not that different either way.. I can move into the Mode

Also there’s no clear way for that to interact with FakeTensors that are constructed from FakeTensor.from_tensor

It is simpler and organizationally better for it to live on the mode, so yes I'd prefer it!

Re how tensors can get to it, that would be done by storing the mode on the tensor!

One thing I don't love about storing the Mode on the Tensor is that you're creating a circular reference from
FakeTensor(a) -> FakeTensorMode -> FakeTensorConverter -> input Tensor -> FakeTensor(a). (cc @samdow )

Converter should have a weak ref to the input tensor

ezyang · 2022-05-31T17:31:35Z

torch/_subclasses/fake_tensor.py

+        r = run_function(func, types, args, kwargs)
+    except NotImplementedError as not_implemented_error:
+        if not cpu_fallback_enabled:
+            raise not_implemented_error


raise here is better as doing it this way will destroy the original backtrace info

sorry, how is this different from one i'm doing now ?

torch/_subclasses/fake_tensor.py

ezyang · 2022-05-31T17:33:13Z

torch/_subclasses/fake_tensor.py

+            r = func(*args , **kwargs)
+        except Exception:
+            # original error more orinformative
+            raise orig_not_implemented_exception


I think this will impede debugging. I suggest raising a fresh exception here but from orig_exception

What does raising a fresh exception here but from orig_exception entail ?

hmm I didn't think it through. How about raise orig_not_implemented_exception from e where e is the exception you caught here

ezyang · 2022-05-31T17:35:28Z

torch/_subclasses/fake_tensor.py

+            if e in tensor_impls:
+                raise orig_not_implemented_exception
+
+        tree_map(throw_on_reused_impls, r)


This seems very insufficient and also the case being tested for here feels like it can be handled.

It's insufficient in that this fallback will not work for view operations, and you won't capture errors here because view operators will return a fresh tensor with shared storage. Testing for shared storage is better.

It's handleable in that if an output of the cpu operation is exactly the same as the input, we know we can just return the original meta in that case and it will be OK.

if one were to have an operator which both
a) is unsupported on meta
and
b) changes the input metadata

then just returning the input meta value would not be sufficient

Yes. But you can also have an operator that doesn't return the input but does mutate the metadata and you have no way of detecting this. Well, tags would help.

This adds the option to convert tensors to fallback to cpu if the operator is not supported on `meta` tensors. The output of the cpu fallback operator must be a new-unaliased TensorImpl because the conversion to cpu and back will lose shared metadata between inputs and outputs. [ghstack-poisoned]

ghstack-source-id: 4a1062d Pull Request resolved: #78522

This adds the option to convert tensors to fallback to cpu if the operator is not supported on `meta` tensors. The output of the cpu fallback operator must be a new-unaliased TensorImpl because the conversion to cpu and back will lose shared metadata between inputs and outputs. [ghstack-poisoned]

ghstack-source-id: 06479c9 Pull Request resolved: #78522

This adds the option to convert tensors to fallback to cpu if the operator is not supported on `meta` tensors. The output of the cpu fallback operator must be a new-unaliased TensorImpl because the conversion to cpu and back will lose shared metadata between inputs and outputs. [ghstack-poisoned]

ghstack-source-id: 8cb6b3a Pull Request resolved: #78522

This adds the option to convert tensors to fallback to cpu if the operator is not supported on `meta` tensors. The output of the cpu fallback operator must be a new-unaliased TensorImpl because the conversion to cpu and back will lose shared metadata between inputs and outputs. [ghstack-poisoned]

eellison · 2022-06-08T22:33:59Z

@pytorchbot merge

pytorchmergebot · 2022-06-08T22:35:09Z

@pytorchbot successfully started a merge job. Check the current status here

github-actions · 2022-06-08T22:35:48Z

Hey @eellison.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

ghstack-source-id: 85603cf Pull Request resolved: pytorch#78522

Summary: Pull Request resolved: #78522 Approved by: https://github.com/ezyang Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/fe7a13496eb87c620ec404a0c37fb243bdbc7c01 Reviewed By: osalpekar Differential Revision: D37025735 Pulled By: eellison fbshipit-source-id: 806330725980fece8d43650d9fdeb8eb6003ffd7

Add CPU Fallback

9938a8b

[ghstack-poisoned]

facebook-github-bot added the cla signed label May 31, 2022

eellison mentioned this pull request May 31, 2022

Fake Tensor Part 1 #77969

Closed

Update on "Add CPU Fallback"

fdc6280

[ghstack-poisoned]

eellison mentioned this pull request May 31, 2022

[WIP] non-fake inputs #78524

Closed

Elias Ellison added 2 commits May 31, 2022 09:16

Update on "Add CPU Fallback"

df12983

[ghstack-poisoned]

Update on "Add CPU Fallback"

b388447

[ghstack-poisoned]

eellison mentioned this pull request May 31, 2022

add non-kwarg device and _like constructors #78536

Closed

eellison requested a review from ezyang May 31, 2022 16:47

ezyang reviewed May 31, 2022

View reviewed changes

torch/_subclasses/fake_tensor.py Outdated Show resolved Hide resolved

ezyang reviewed May 31, 2022

View reviewed changes

eellison added 2 commits June 1, 2022 06:30

eellison mentioned this pull request Jun 1, 2022

Migrate FakeTensors to always call into FakeTensorMode and have them hold a reference #78677

Closed

eellison added a commit that referenced this pull request Jun 1, 2022

Add CPU Fallback

a7c17bb

ghstack-source-id: 4a1062d Pull Request resolved: #78522

eellison added a commit that referenced this pull request Jun 2, 2022

Add CPU Fallback

e1cdd73

ghstack-source-id: 06479c9 Pull Request resolved: #78522

eellison added a commit that referenced this pull request Jun 2, 2022

Add CPU Fallback

b7cfeb5

ghstack-source-id: 8cb6b3a Pull Request resolved: #78522

eellison mentioned this pull request Jun 6, 2022

Make FakeTensors return meta device within kernel invocation, add FakeTensor op tests #78972

Closed

eellison added 9 commits June 7, 2022 09:16

pytorchmergebot added the Merged label Jun 8, 2022

pytorchmergebot closed this in fe7a134 Jun 8, 2022

eellison mentioned this pull request Jun 8, 2022

Add Dynamic Output Shape Tag For data-dependent ops, handle in FakeTensor #79170

Closed

ezyang pushed a commit to ezyang/pytorch that referenced this pull request Jun 9, 2022

Add CPU Fallback

6f24578

ghstack-source-id: 85603cf Pull Request resolved: pytorch#78522

facebook-github-bot deleted the gh/eellison/300/head branch June 12, 2022 14:20

Conversation

eellison commented May 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented May 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

✅ No Failures (1 Pending)

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eellison May 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eellison commented Jun 8, 2022

Uh oh!

pytorchmergebot commented Jun 8, 2022

Uh oh!

github-actions bot commented Jun 8, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

eellison commented May 31, 2022 •

edited

Loading

facebook-github-bot commented May 31, 2022 •

edited

Loading

eellison May 31, 2022 •

edited

Loading