fix grad thrashing of shape analysis by eellison · Pull Request #40939 · pytorch/pytorch

eellison · 2020-07-02T20:25:46Z

Stack from ghstack:

fix grad thrashing of shape analysis #40939 fix grad thrashing of shape analysis
shape analysis fix for default dtype' #40938 shape analysis fix for default dtype'

Previously, when we would do shape analysis by running the op with representative inputs, we would always set the grad property to false. This led to a wrong static analysis when we would create differentiable subgraphs, and propagate shapes without also propagating requires_grad, and then uninline them.

Differential Revision: D22394676

[ghstack-poisoned]

ghstack-source-id: 279e469 Pull Request resolved: #40939

dr-ci · 2020-07-02T20:27:14Z

💊 CI failures summary and remediations

As of commit 9e2e6d2 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 3 times.

Krovatkin · 2020-07-02T22:36:48Z

      // its most constrained form.
-      if (stack[i].isTensor())
-        node->outputs()[i]->inferTypeFrom(stack[i].toTensor());
+      auto tensor_type = node->outputs()[i]->type()->cast<TensorType>();


if we get rid of inferTypeFrom we might be losing some useful information about tensor properties which is probably the reason why we tried running an operation in the first place? Shouldn't we do inferTypeFrom()->withRequiresGrad() ?

Yes, that’s what we’re doing with tensortype::create(stack[i].toTensor())

Infer type from calls tensortype::create

Don’t think we can call withRequiresGrad bc that would get rid of grad analysis that has set it to false. I think we just want to preserve whatever our previous grad inference was

Krovatkin · 2020-07-02T23:30:55Z

+        torch._C._jit_pass_complete_shape_analysis(foo.graph, (torch.tensor([0.39]),), False)
+
+        # requires_grad property shouldn't be accidentally set by shape analysis
+        self.assertTrue(foo.graph.findNode("aten::sub").output().requiresGrad() is None)


couldn't we actually infer in this particular case that aten::sub doesn't need requires_gard?

No, we haven’t done grad analysis, we don’t know if it requires grad or not

oh got it! thanks!

Krovatkin

Previously, when we would do shape analysis by running the op with representative inputs, we would always set the grad property to false. This led to a wrong static analysis when we would create differentiable subgraphs, and propagate shapes without also propagating requires_grad, and then uninline them. Differential Revision: [D22394676](https://our.internmc.facebook.com/intern/diff/D22394676) [ghstack-poisoned]

ghstack-source-id: dd8742b Pull Request resolved: #40939

facebook-github-bot · 2020-07-07T02:12:06Z

@eellison merged this pull request in 37a572f.

Summary: Pull Request resolved: pytorch#40939 Previously, when we would do shape analysis by running the op with representative inputs, we would always set the grad property to false. This led to a wrong static analysis when we would create differentiable subgraphs, and propagate shapes without also propagating requires_grad, and then uninline them. Test Plan: Imported from OSS Differential Revision: D22394676 Pulled By: eellison fbshipit-source-id: 254e6e9f964b40d160befe0e125abe1b7aa2bd5e

* [JIT] fix unfold shape analysis (#40749) Summary: unfold on a 0-dimensioned tensor returns a 1-dim tensor Pull Request resolved: #40749 Differential Revision: D22361481 Pulled By: eellison fbshipit-source-id: 621597e5f97f6e39953eb86f8b85bb4142527a9f * shape analysis fix for default dtype' ghstack-source-id: 723aa27 Pull Request resolved: #40938 * fix grad thrashing of shape analysis ghstack-source-id: dd8742b Pull Request resolved: #40939 Co-authored-by: Elias Ellison <eellison@fb.com>

This reverts commit 62da092.

Summary: Pull Request resolved: pytorch#40939 Previously, when we would do shape analysis by running the op with representative inputs, we would always set the grad property to false. This led to a wrong static analysis when we would create differentiable subgraphs, and propagate shapes without also propagating requires_grad, and then uninline them. Test Plan: Imported from OSS Differential Revision: D22394676 Pulled By: eellison fbshipit-source-id: 254e6e9f964b40d160befe0e125abe1b7aa2bd5e

fix grad thrashing of shape analysis

d8f4b6e

[ghstack-poisoned]

eellison requested a review from apaszke as a code owner July 2, 2020 20:25

eellison mentioned this pull request Jul 2, 2020

shape analysis fix for default dtype' #40938

Closed

eellison pushed a commit that referenced this pull request Jul 2, 2020

fix grad thrashing of shape analysis

2c6dd57

ghstack-source-id: 279e469 Pull Request resolved: #40939

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Jul 2, 2020

eellison requested a review from Krovatkin July 2, 2020 20:29

Krovatkin suggested changes Jul 2, 2020

View reviewed changes

Krovatkin approved these changes Jul 2, 2020

View reviewed changes

eellison pushed a commit that referenced this pull request Jul 6, 2020

fix grad thrashing of shape analysis

a3a0c6c

ghstack-source-id: dd8742b Pull Request resolved: #40939

eellison pushed a commit that referenced this pull request Jul 6, 2020

fix grad thrashing of shape analysis

a697bec

ghstack-source-id: dd8742b Pull Request resolved: #40939

eellison mentioned this pull request Jul 6, 2020

Bucket of shape analysis fixes #41044

Merged

facebook-github-bot closed this in 37a572f Jul 7, 2020

facebook-github-bot added the merged label Jul 7, 2020

facebook-github-bot deleted the gh/eellison/85/head branch July 10, 2020 14:18

csarofeen added a commit to csarofeen/pytorch that referenced this pull request Aug 16, 2020

Revert "fix grad thrashing of shape analysis (pytorch#40939)"

523a05a

This reverts commit 62da092.

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix grad thrashing of shape analysis#40939

fix grad thrashing of shape analysis#40939
eellison wants to merge 2 commits intogh/eellison/85/basefrom
gh/eellison/85/head

eellison commented Jul 2, 2020 •

edited

Loading

Uh oh!

dr-ci Bot commented Jul 2, 2020 •

edited

Loading

Uh oh!

Krovatkin Jul 2, 2020

Uh oh!

eellison Jul 2, 2020

Uh oh!

eellison Jul 2, 2020

Uh oh!

eellison Jul 2, 2020

Uh oh!

Krovatkin Jul 2, 2020

Uh oh!

eellison Jul 2, 2020

Uh oh!

Krovatkin Jul 2, 2020

Uh oh!

Krovatkin left a comment

Uh oh!

facebook-github-bot commented Jul 7, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

eellison commented Jul 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci Bot commented Jul 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

Krovatkin Jul 2, 2020

Choose a reason for hiding this comment

Uh oh!

eellison Jul 2, 2020

Choose a reason for hiding this comment

Uh oh!

eellison Jul 2, 2020

Choose a reason for hiding this comment

Uh oh!

eellison Jul 2, 2020

Choose a reason for hiding this comment

Uh oh!

Krovatkin Jul 2, 2020

Choose a reason for hiding this comment

Uh oh!

eellison Jul 2, 2020

Choose a reason for hiding this comment

Uh oh!

Krovatkin Jul 2, 2020

Choose a reason for hiding this comment

Uh oh!

Krovatkin left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jul 7, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

eellison commented Jul 2, 2020 •

edited

Loading

dr-ci Bot commented Jul 2, 2020 •

edited

Loading