addr ref by ezyang · Pull Request #78014 · pytorch/pytorch

ezyang · 2022-05-20T21:22:07Z

Stack from ghstack (oldest at bottom):

-> addr ref #78014

Signed-off-by: Edward Z. Yang ezyang@fb.com

Signed-off-by: Edward Z. Yang <ezyang@fb.com> [ghstack-poisoned]

facebook-github-bot · 2022-05-20T21:22:19Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/78014
📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓Need help or want to give feedback on the CI? Visit our office hours

✅ No Failures (0 Pending)

As of commit bcec376 (more details on the Dr. CI page):

Expand to see more

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]

ngimel · 2022-05-20T21:54:52Z

aten/src/ATen/native/native_functions.yaml

  dispatch:
    CPU, CUDA: addr
-    CompositeImplicitAutograd: math_addr
+    CompositeExplicitAutograd: math_addr


I thought it had to be CompositeImplicitAutograd for automatic backward?

For some reason, although we have an explicit addr derivative formula XLA didn't want to use it. I'm forcing XLA to use our old formula now and I want to see what happens

XLA tests passed so looks like we're good to go here

ngimel · 2022-05-20T22:03:22Z

torch/_refs/__init__.py

+    check(vec1.ndim == 1, lambda: f"addr: Expected 1-D argument vec1, but got {vec1.ndim}-D")
+    check(vec2.ndim == 1, lambda: f"addr: Expected 1-D argument vec2, but got {vec2.ndim}-D")
+    self = self.expand(vec1.shape[0], vec2.shape[0])
+    check(self.ndim == 2, lambda: f"2D tensor expected, got {self.ndim}D tensor for input")


Wouldn't expand fail if self.ndim wasn't 2?

I guess we should update the original code lol

ngimel · 2022-05-20T22:04:55Z

torch/_refs/__init__.py

+    check(vec2.ndim == 1, lambda: f"addr: Expected 1-D argument vec2, but got {vec2.ndim}-D")
+    self = self.expand(vec1.shape[0], vec2.shape[0])
+    check(self.ndim == 2, lambda: f"2D tensor expected, got {self.ndim}D tensor for input")
+    check(self.shape[0] == vec1.shape[0] and self.shape[1] == vec2.shape[0],


same here, isn't it guaranteed by expand?

ngimel · 2022-05-20T22:08:39Z

torch/_refs/__init__.py

+    check(self.ndim == 2, lambda: f"2D tensor expected, got {self.ndim}D tensor for input")
+    check(self.shape[0] == vec1.shape[0] and self.shape[1] == vec2.shape[0],
+        lambda: f"size mismatch, input: {self.shape}, v1: {vec1.shape}, v2: {vec2.shape}")
+    if utils.is_boolean_dtype(self.dtype):


should you also check alpha dtype (it should be of the same type kind as other inputs)?
@mruberry do you want to make a case again for alpha checking being done in the wrapper?

Integers are OK though

>>> torch.addr(torch.tensor([[False]]), torch.tensor([False]), torch.tensor([True]), beta=1, alpha=2) tensor([[False]])

Extending the type promotion wrapper would probably make life easier

I think we should say that integer numbers can interact with bools and it's fine, however

I tried the obvious thing which is

@elementwise_type_promotion_wrapper( type_promoting_args=("self", "vec1", "vec2", "beta", "alpha"), type_promotion_kind=ELEMENTWISE_TYPE_PROMOTION_KIND.DEFAULT, )

but this doesn't work as an integer alpha causes the entire tensor to promote to int64. I'm not sure what the recommend spelling of this change would be, it seems to have far reaching effects.

Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]

Signed-off-by: Edward Z. Yang <ezyangfb.com> ghstack-source-id: f570d56 Pull Request resolved: #78014

mruberry · 2022-05-21T04:02:25Z

torch/_refs/__init__.py

+        vec2.ndim == 1,
+        lambda: f"addr: Expected 1-D argument vec2, but got {vec2.ndim}-D",
+    )
+    self = self.expand(vec1.shape[0], vec2.shape[0])


This unconditional expand seems odd -- maybe we can use a construct like _maybe_expand_to?

Here is the original code:

check_1d(vec1, "vec1", "addr"); check_1d(vec2, "vec2", "addr"); const auto vec1_size0 = vec1.sizes()[0]; const auto vec2_size0 = vec2.sizes()[0]; auto self_ = &result == &self ? c10::MaybeOwned<Tensor>::borrowed(self) : expand_size(self, {vec1_size0, vec2_size0}, "addr"); TORCH_CHECK( self_->dim() == 2, "2D tensor expected, got ", self_->dim(), "D tensor for input" ); TORCH_CHECK( self_->sizes()[0] == vec1_size0 && self_->sizes()[1] == vec2_size0, "size mismatch, input: ", self_->sizes(), ", v1: ", vec1.sizes(), ", v2: ", vec2.sizes() );

It looks like the expand is done unconditionally, so I faithfully represented this. Are we expected to simplify the logic when writing refs?

It'd be nice if we improve it, sure -- if we don't do it now it's just more time and review later

_maybe_expand_to is... not a thing?

How do you mean?

mruberry · 2022-05-21T04:03:02Z

torch/_refs/__init__.py

+        self.ndim == 2, lambda: f"2D tensor expected, got {self.ndim}D tensor for input"
+    )
+    check(
+        self.shape[0] == vec1.shape[0] and self.shape[1] == vec2.shape[0],


Won't the expand have already asserted this?

Once again, faithful representation of the original code :) My rule for porting: first do a slavish port, and then once that passes all tests, then refactor.

haha, OK I guess -- but some cleanups seem easy to snipe

mruberry · 2022-05-21T04:08:22Z

torch/_refs/__init__.py

+            torch.outer(vec1, vec2) if alpha else torch.full_like(self, False),
+        )
+    else:
+        return beta * self + alpha * torch.outer(vec1, vec2)


I don't think these decompositions for addr are correct. When beta==0, for example, NaNs in self should not propagate

(this will require adding reference inputs for addr that test the NaN propagation, too!)

btw, in the original code, even if alpha==0 the NaNs still propagate

mruberry · 2022-05-21T04:09:24Z

torch/testing/_internal/common_methods_invocations.py

+            # No meta support for torch.tensor
+            DecorateInfo(unittest.expectedFailure, 'TestCommon', 'test_python_reference_meta_functions',),
+            # RuntimeError: no _refs support for torch.outer
+            DecorateInfo(unittest.expectedFailure, 'TestCommon', 'test_python_reference_consistency',),


Eeeks! We should probably get that mode implemented before we start writing refs with torch functions if they have to skip the reference consistency test

Well, this is what you asked for, but you are welcome to change your mind ;)

We will have a context manager that will reroute torch.* API calls to torch._ref.* (failing if the ref doesn't exist),

and

do you have a preference for two separate tests (but then I need to dupe all the xfails for the new test name I added) or one test (but then an xfail will prevent us checking that the torch.* version works)
one test if it’s easier

My preferred resolution is to maintain only one test which does NOT test with the mode (so torch goes to torch, refs go to refs), and not test anything else. If we insist on testing the strict mode, we will need to add an extra set of tests, because any ref that calls into a torch function that doesn't have a ref will fail in strict mode. We can also test in non-strict mode, in which case you don't get any signal if all torch functions being called have refs or not.

Yes I have a PR that will be available soon that separates the tests, though, so we can still test for consistency even without refs

FYI #78026 has landed updating and separating the tests (may require a rebase)

Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]

Signed-off-by: Edward Z. Yang <ezyangfb.com> ghstack-source-id: 4c3126b Pull Request resolved: #78014

Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]

Signed-off-by: Edward Z. Yang <ezyangfb.com> ghstack-source-id: 8d84855 Pull Request resolved: #78014

ezyang · 2022-05-23T22:09:30Z

I think I hit most PR comments. If we're going to have to do this by hand for every op this is going to take a very long time to do properly.

Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]

Signed-off-by: Edward Z. Yang <ezyangfb.com> ghstack-source-id: 004e626 Pull Request resolved: #78014

Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]

Signed-off-by: Edward Z. Yang <ezyangfb.com> ghstack-source-id: 14bf401 Pull Request resolved: #78014

ezyang · 2022-05-25T01:38:53Z

@pytorchbot merge this

github-actions · 2022-05-25T01:40:48Z

Hey @ezyang.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

Summary: Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: #78014 Approved by: https://github.com/ngimel Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/a1765f0176250db2b8ce7b543884ec5600ece14c Reviewed By: mehtanirav Differential Revision: D36668872 Pulled By: ezyang fbshipit-source-id: 35a266e666bf812aa7df9594168ae82cd7d63aa1

addr ref

5d47f31

Signed-off-by: Edward Z. Yang <ezyang@fb.com> [ghstack-poisoned]

ezyang requested review from bdhirsh, mruberry and ngimel as code owners May 20, 2022 21:22

This was referenced May 20, 2022

Add logit ref; allow non-refs to be called in refs; support torch function mode on torch.ops #77816

Closed

Disable complex32 meta conversion, which removes a few skips #77854

Closed

fix meta tests on norms #78012

Closed

fixup #78013

Closed

facebook-github-bot added the cla signed label May 20, 2022

Update on "addr ref"

5a7dcb4

Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]

Update on "addr ref"

f756d26

Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]

ezyang mentioned this pull request May 20, 2022

Reenable mul tests #78016

Closed

ngimel reviewed May 20, 2022

View reviewed changes

ezyang added 2 commits May 20, 2022 17:20

Update on "addr ref"

3ece5d5

Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]

Update on "addr ref"

23e4086

Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]

ezyang added a commit that referenced this pull request May 21, 2022

addr ref

6c10752

Signed-off-by: Edward Z. Yang <ezyangfb.com> ghstack-source-id: f570d56 Pull Request resolved: #78014

ezyang requested a review from Chillee May 21, 2022 02:43

mruberry reviewed May 21, 2022

View reviewed changes

Update on "addr ref"

bc79e2f

Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]

ezyang added a commit that referenced this pull request May 23, 2022

addr ref

3a332d4

Signed-off-by: Edward Z. Yang <ezyangfb.com> ghstack-source-id: 4c3126b Pull Request resolved: #78014

Update on "addr ref"

966beb9

Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]

ezyang added a commit that referenced this pull request May 23, 2022

addr ref

e0fda5f

Signed-off-by: Edward Z. Yang <ezyangfb.com> ghstack-source-id: 8d84855 Pull Request resolved: #78014

ngimel approved these changes May 23, 2022

View reviewed changes

Update on "addr ref"

90bc5fd

Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]

ezyang added a commit that referenced this pull request May 24, 2022

addr ref

284fa64

Signed-off-by: Edward Z. Yang <ezyangfb.com> ghstack-source-id: 004e626 Pull Request resolved: #78014

Update on "addr ref"

bcec376

Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]

ezyang added a commit that referenced this pull request May 24, 2022

addr ref

1cdcd9b

Signed-off-by: Edward Z. Yang <ezyangfb.com> ghstack-source-id: 14bf401 Pull Request resolved: #78014

pytorchmergebot added the Merged label May 25, 2022

pytorchmergebot closed this in a1765f0 May 25, 2022

Chillee mentioned this pull request May 25, 2022

Replace addr_decomp with the one in PyTorch Core. pytorch/functorch#833

Closed

mruberry added the module: primTorch label May 25, 2022

facebook-github-bot deleted the gh/ezyang/1177/head branch May 28, 2022 14:17

Conversation

ezyang commented May 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented May 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

✅ No Failures (0 Pending)

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mruberry May 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ezyang commented May 23, 2022

Uh oh!

ezyang commented May 25, 2022

Uh oh!

github-actions bot commented May 25, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ezyang commented May 20, 2022 •

edited

Loading

facebook-github-bot commented May 20, 2022 •

edited

Loading

mruberry May 22, 2022 •

edited

Loading