Move argument grouping into FunctionSchema by ezyang · Pull Request #48195 · pytorch/pytorch

ezyang · 2020-11-18T19:04:59Z

Stack from ghstack:

Delete NativeFunctions.h include from Functions.h #48687 Delete NativeFunctions.h include from Functions.h
Move var and std overloads to Functions.cpp and remove native:: reference #48683 Move var and std overloads to Functions.cpp and remove native:: reference
Refactor TensorIterator to do allocations via MetaBase::set_output #48659 Refactor TensorIterator to do allocations via MetaBase::set_output
Move argument grouping into FunctionSchema #48195 Move argument grouping into FunctionSchema
Refactor argument fields in FunctionSchema to Arguments #48182 Refactor argument fields in FunctionSchema to Arguments
Structured kernels generate Meta registrations #48116 Structured kernels generate Meta registrations

The general approach is to change Arguments, splitting positional, kwarg_only and out, into pre_self_positional, self_arg, post_self_positional, and pre_tensor_options_kwarg_only, tensor_options and post_tensor_options_kwarg_only. The splits are as you'd expect: we extract out the self argument and the tensor options arguments, and record the other arguments that came before and after. To do this, we move the logic in group_arguments to the parsing process.

Some fuzz in the process:

I renamed ThisArgument to SelfArgument, since we don't actually use the terminology "this" outside of C++ (and the model is Python biased)
I kept the group_arguments function, which now just reads out the arguments from the structured model in the correct order. In the long term, we should get rid of this function entirely, but for now I kept it as is to reduce churn.
I decided to arbitrarily say that when self is missing, everything goes in "post-self", but when tensor options is missing, everything goes in "pre-tensor-options". This was based on where you typically find the argument in question: self is usually at front (so most args are after it), while tensor options are typically at the end (so most args go before it).

Signed-off-by: Edward Z. Yang ezyang@fb.com

Differential Revision: D25231166

Signed-off-by: Edward Z. Yang <ezyang@fb.com> [ghstack-poisoned]

ezyang · 2020-11-18T19:05:50Z

sorry short on time, no description; coming soon

dr-ci · 2020-11-18T19:20:03Z

💊 CI failures summary and remediations

As of commit 00c8118 (more details on the Dr. CI page):

1/3 failures introduced in this PR
2/3 broken upstream at merge base f798696 on Dec 01 from 1:08pm to 6:50pm PDT (18 commits; c5f1117 - 25e367e)

🕵️ 1 new failure recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

pytorch_linux_bionic_py3_8_gcc9_coverage_test2 (1/1)

Step: "Run tests" (full log | diagnosis details | 🔁 rerun)

Dec 02 04:56:20 [E request_callback_no_python.cpp:636] Received error while processing request type 258: RuntimeError: Can not pickle torch.futures.Future

Dec 02 04:56:20 At: 
Dec 02 04:56:20   /opt/conda/lib/python3.8/site-packages/torch/distributed/rpc/internal.py(120): serialize 
Dec 02 04:56:20   /opt/conda/lib/python3.8/site-packages/torch/distributed/rpc/internal.py(172): serialize 
Dec 02 04:56:20  
Dec 02 04:56:20 [E request_callback_no_python.cpp:636] Received error while processing request type 258: RuntimeError: Can not pickle torch.futures.Future 
Dec 02 04:56:20  
Dec 02 04:56:20 At: 
Dec 02 04:56:20   /opt/conda/lib/python3.8/site-packages/torch/distributed/rpc/internal.py(120): serialize 
Dec 02 04:56:20   /opt/conda/lib/python3.8/site-packages/torch/distributed/rpc/internal.py(172): serialize 
Dec 02 04:56:20  
Dec 02 04:56:20 [E request_callback_no_python.cpp:636] Received error while processing request type 258: RuntimeError: Can not pickle torch.futures.Future 
Dec 02 04:56:20  
Dec 02 04:56:20 At: 
Dec 02 04:56:20   /opt/conda/lib/python3.8/site-packages/torch/distributed/rpc/internal.py(120): serialize 
Dec 02 04:56:20   /opt/conda/lib/python3.8/site-packages/torch/distributed/rpc/internal.py(172): serialize 
Dec 02 04:56:20  
Dec 02 04:56:21 ok (2.453s) 
Dec 02 04:56:22   test_return_future_remote (__main__.ProcessGroupRpcTestWithSpawn) ... RPC was initialized with the PROCESS_GROUP backend which is deprecated and slated to be removed and superseded by the TENSORPIPE backend. It is recommended to migrate to the TENSORPIPE backend. 
Dec 02 04:56:22 RPC was initialized with the PROCESS_GROUP backend which is deprecated and slated to be removed and superseded by the TENSORPIPE backend. It is recommended to migrate to the TENSORPIPE backend. 
Dec 02 04:56:22 RPC was initialized with the PROCESS_GROUP backend which is deprecated and slated to be removed and superseded by the TENSORPIPE backend. It is recommended to migrate to the TENSORPIPE backend. 
Dec 02 04:56:22 RPC was initialized with the PROCESS_GROUP backend which is deprecated and slated to be removed and superseded by the TENSORPIPE backend. It is recommended to migrate to the TENSORPIPE backend.

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 22 times.

Signed-off-by: Edward Z. Yang <ezyang@fb.com> [ghstack-poisoned]

Signed-off-by: Edward Z. Yang <ezyang@fb.com> ghstack-source-id: 85a385f Pull Request resolved: #48195

bhosmer

Just a quick drive-by based on the PR description, front-running a full review later: the choices you're describing in your last bullet point kind of point to the brittleness of modeling the sequence in a maximally partitioned way like this. An alternative (I've pitched this before) is to model self and tensor options as optional pairs of (thing, insertion index) - it's true that the indexes wouldn't be correct by construction, but it avoids the pre-post splintering, and the arbitrary choices (and dynamic checks) for degenerate cases. Still worth modeling args and kwargs as separate lists ofc.

ezyang · 2020-11-18T21:38:14Z

@bhosmer Engh, I still don't like it. I gave one reason (tricky bookkeeping if you ever need multiple indices--though tbf we don't have this right now) in our original conversation. But another reason why I don't really like splicing arguments into the lists is that the list prior to splicing isn't really usable for... anything, really.

Imagine:

class Arguments:
    kwarg_only: Tuple[Argument, ...]
    tensor_options: Optional[Tuple[TensorOptionsArguments, int]]

Does kwarg_only contain tensor options arguments or not? You seem to be favoring "make illegal states unrepresentable", so let's say that it doesn't (because if it does, then we need an extra consistency check that the duplicate copy in kwarg_only and tensor_options agree). So then we have something like this, given the schema empty.names(int[] size, *, Dimname[]? names, ScalarType? dtype=None, Layout? layout=None, Device? device=None, bool? pin_memory=None, MemoryFormat? memory_format=None) -> Tensor

Arguments(
  kwarg_only=("Dimname[]? names", "MemoryFormat? memory_format=None")
  tensor_options=("ScalarType? dtype=None, Layout? layout=None, Device? device=None, bool? pin_memory=None", 1)
)

Use of the kwarg_only field directly is useless. There is never a time I want "names" and "memory_format" to be adjacent in an arguments list. I have to do the splice first, or split this into the pre and post parts, before I can do something useful with it. In the pre/post variant, I have pre-done the splitting, because that's what I'm ~always going to want to have happened.

If it's just getting rid of illegal states that we want, the pre/post version can be made to play ball like this:

class Arguments:
    pre_kwarg_only: Tuple[Argument, ...]
    tensor_options_and_post_kwarg_only: Optional[Tuple[TensorOptionsArguments, Tuple[Argument, ...]]]

but the extra typing here didn't seem worth it to me, at least.

bhosmer · 2020-11-18T22:01:48Z

@ezyang thanks for writing out the example, it hits the nail on the head. You're exactly right, the taste/distaste for this approach comes down to whether you feel like the ground representation should be "useful" on its own, or if it's just raw material for derived views.

Will a full review introduce me to client code that uses the pre/post fragments directly (like in a semantically meaningful way)? If so then I'd agree that what I'm pitching is probably a net loss because it'd have to add derived views for both the full list and the fragments. I'd assumed the full list was the only view that mattered, which I think would put the two representations roughly at parity in terms of view complexity.

You might also be reacting to (per your example) kwargs_only being misreadable as a finished list, but I'd think naming and/or commenting would take care of that. But is that part of it?

ezyang · 2020-11-18T22:20:39Z

tools/codegen/model.py

+        if self.tensor_options is not None:
+            ret.extend(self.tensor_options.all())
+        ret.extend(self.post_tensor_options_kwarg_only)
+        return ret


This and positional above are a use of the fields.

Ah ok, this is what I meant by derived views. By parity I meant that the above and something like below (mod typos) are basically the same complexity, and you wouldn't have to tiebreak the degenerate representations.

non_self_positional: Tuple[Argument, ...] self_arg: Optional[Tuple[SelfArgument, int]] non_tensor_options_kwarg_only: Tuple[Argument, ...] tensor_options: Optional[Tuple[TensorOptionsArguments, int]] def positional(self) -> Sequence[Argument]: ret: List[Argument] = [] ret.extend(self.non_self_positional) if self.self_arg is not None: arg, i = self.self_arg ret.insert(i, arg) return ret def kwarg_only(self) -> Sequence[Argument]: ret: List[Argument] = [] ret.extend(self.non_tensor_options_kwarg_only) if self.tensor_options is not None: args, i = self.tensor_options ret[i:i] = args.all() return ret

But TBH it's not worth litigating, the way it is currently is fine by me if you just like it better. My main motivation here is to sketch the alternative out concretely, not to push super hard for its adoption.

ezyang · 2020-11-18T22:22:30Z

tools/codegen/api/cpp.py

+    args.extend(func.arguments.pre_tensor_options_kwarg_only)
+    if func.arguments.tensor_options is not None:
+        args.append(func.arguments.tensor_options)
+    args.extend(func.arguments.post_tensor_options_kwarg_only)


This is another use of the fields, although the intention is to kill this function and make use of the fields directly

ezyang · 2020-11-18T22:30:49Z

Will a full review introduce me to client code that uses the pre/post fragments directly (like in a semantically meaningful way)?

I went ahead and marked the relevant spots. But I didn't really try to refactor anything else after this, so most of the usage is just "squashing" the individual parts back into the list of union which is what the rest of the code uses right now.

Projecting ahead to further refactors, the other things most likely to happen:

For methods, you need to drop the self parameter when it doesn't show up in the C++ signature (so actually your rep is good for this)
When translating to cpp/dispatcher, depending on use_c10_full you will either expand or keep packed the tensor options argument, but it will always be in the same position compared to the other arguments. So splicing will be necessary here
Whatever happens to Faithful out arguments #47712; but most of the heat here is on out arguments so this is essentially the same as (2) (where you are going to do the translation to cpp/dispatcher)

You might also be reacting to (per your example) kwargs_only being misreadable as a finished list, but I'd think naming and/or commenting would take care of that.

You got me, I was intentionally being a little hyperbolic in the example naming ;)

The general approach is to change Arguments, splitting `positional`, `kwarg_only` and `out`, into `pre_self_positional`, `self_arg`, `post_self_positional`, and `pre_tensor_options_kwarg_only`, `tensor_options` and `post_tensor_options_kwarg_only`. The splits are as you'd expect: we extract out the self argument and the tensor options arguments, and record the other arguments that came before and after. To do this, we move the logic in `group_arguments` to the parsing process. Some fuzz in the process: * I renamed `ThisArgument` to `SelfArgument`, since we don't actually use the terminology "this" outside of C++ (and the model is Python biased) * I kept the `group_arguments` function, which now just reads out the arguments from the structured model in the correct order. In the long term, we should get rid of this function entirely, but for now I kept it as is to reduce churn. * I decided to arbitrarily say that when self is missing, everything goes in "post-self", but when tensor options is missing, everything goes in "pre-tensor-options". This was based on where you typically find the argument in question: self is usually at front (so most args are after it), while tensor options are typically at the end (so most args go before it). Signed-off-by: Edward Z. Yang <ezyang@fb.com> [ghstack-poisoned]

bhosmer

LGTM!

bhosmer · 2020-11-30T20:30:04Z

tools/codegen/model.py

+            # If there is enough space...
+            if i <= len(kwarg_only) - len(predicates):
+                # And the next len(predicates) arguments look like TensorOptions arguments
+                if all(p(a) for p, a in zip(predicates, kwarg_only[i : i + len(predicates)])):


It still feels like a small devex pothole to be failing silently on near misses here, but probably out of scope for this work.

Yeah, this is preexisting code, so out of scope for this PR. Note that we do have plenty of "near-ish misses" (e.g., one or two arguments from TensorOptions) in various functions, including

- func: to.device(Tensor self, Device device, ScalarType dtype, bool non_blocking=False, bool copy=False, MemoryFormat? memory_format=None) -> Tensor use_c10_dispatcher: full variants: method device_guard: False

so it's not altogether clear to me what a "you mistyped this" error message would look like.

Yeah, the line isn't trivially easy to draw. I guess if the goal state is "nobody uses TensorOptions except for factory functions in the user C++ API", then the most robust setup might be to drive the gather logic from an explicit annotation in native_functions rather than a pattern match - could even just use category_override: factory.

The general approach is to change Arguments, splitting `positional`, `kwarg_only` and `out`, into `pre_self_positional`, `self_arg`, `post_self_positional`, and `pre_tensor_options_kwarg_only`, `tensor_options` and `post_tensor_options_kwarg_only`. The splits are as you'd expect: we extract out the self argument and the tensor options arguments, and record the other arguments that came before and after. To do this, we move the logic in `group_arguments` to the parsing process. Some fuzz in the process: * I renamed `ThisArgument` to `SelfArgument`, since we don't actually use the terminology "this" outside of C++ (and the model is Python biased) * I kept the `group_arguments` function, which now just reads out the arguments from the structured model in the correct order. In the long term, we should get rid of this function entirely, but for now I kept it as is to reduce churn. * I decided to arbitrarily say that when self is missing, everything goes in "post-self", but when tensor options is missing, everything goes in "pre-tensor-options". This was based on where you typically find the argument in question: self is usually at front (so most args are after it), while tensor options are typically at the end (so most args go before it). Signed-off-by: Edward Z. Yang <ezyang@fb.com> [ghstack-poisoned]

Signed-off-by: Edward Z. Yang <ezyang@fb.com> ghstack-source-id: 249fbfa Pull Request resolved: pytorch#48195

The general approach is to change Arguments, splitting `positional`, `kwarg_only` and `out`, into `pre_self_positional`, `self_arg`, `post_self_positional`, and `pre_tensor_options_kwarg_only`, `tensor_options` and `post_tensor_options_kwarg_only`. The splits are as you'd expect: we extract out the self argument and the tensor options arguments, and record the other arguments that came before and after. To do this, we move the logic in `group_arguments` to the parsing process. Some fuzz in the process: * I renamed `ThisArgument` to `SelfArgument`, since we don't actually use the terminology "this" outside of C++ (and the model is Python biased) * I kept the `group_arguments` function, which now just reads out the arguments from the structured model in the correct order. In the long term, we should get rid of this function entirely, but for now I kept it as is to reduce churn. * I decided to arbitrarily say that when self is missing, everything goes in "post-self", but when tensor options is missing, everything goes in "pre-tensor-options". This was based on where you typically find the argument in question: self is usually at front (so most args are after it), while tensor options are typically at the end (so most args go before it). Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: [D25231166](https://our.internmc.facebook.com/intern/diff/D25231166) [ghstack-poisoned]

Signed-off-by: Edward Z. Yang <ezyang@fb.com> ghstack-source-id: cd0aa6e Pull Request resolved: pytorch#48195

facebook-github-bot · 2020-12-02T17:12:26Z

@ezyang merged this pull request in 742903c.

Summary: Pull Request resolved: pytorch#48195 The general approach is to change Arguments, splitting `positional`, `kwarg_only` and `out`, into `pre_self_positional`, `self_arg`, `post_self_positional`, and `pre_tensor_options_kwarg_only`, `tensor_options` and `post_tensor_options_kwarg_only`. The splits are as you'd expect: we extract out the self argument and the tensor options arguments, and record the other arguments that came before and after. To do this, we move the logic in `group_arguments` to the parsing process. Some fuzz in the process: * I renamed `ThisArgument` to `SelfArgument`, since we don't actually use the terminology "this" outside of C++ (and the model is Python biased) * I kept the `group_arguments` function, which now just reads out the arguments from the structured model in the correct order. In the long term, we should get rid of this function entirely, but for now I kept it as is to reduce churn. * I decided to arbitrarily say that when self is missing, everything goes in "post-self", but when tensor options is missing, everything goes in "pre-tensor-options". This was based on where you typically find the argument in question: self is usually at front (so most args are after it), while tensor options are typically at the end (so most args go before it). Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: zhangguanheng66 Differential Revision: D25231166 Pulled By: ezyang fbshipit-source-id: 25d77ad8319c4ce0bba4ad82e451bf536ef823ad

Previously, this function had nontrivial algorithmic content, but after #48195, this was just a swiss army knife for pasting together arguments while maintaining structure. I added some more properties for Arguments for convenient access in this way, and then inlined the implementation of group_arguments into all of its call sites, simplifying whenever contextual. This might be controversial, but I think the resulting code is easier to understand. You may notice that there is some modest code duplication between dispatcher.cpparguments_exprs and CppSignature.argument_packs. This is a known problem and I will be attempting to fix it in a follow up PR. Confirmed to be byte-for-byte compatible. Signed-off-by: Edward Z. Yang <ezyang@fb.com> [ghstack-poisoned]

Previously, this function had nontrivial algorithmic content, but after #48195, this was just a swiss army knife for pasting together arguments while maintaining structure. I added some more properties for Arguments for convenient access in this way, and then inlined the implementation of group_arguments into all of its call sites, simplifying whenever contextual. This might be controversial, but I think the resulting code is easier to understand. You may notice that there is some modest code duplication between dispatcher.cpparguments_exprs and CppSignature.argument_packs. This is a known problem and I will be attempting to fix it in a follow up PR. Confirmed to be byte-for-byte compatible. Signed-off-by: Edward Z. Yang <ezyang@fb.com> ghstack-source-id: 7ff71ff Pull Request resolved: #49043

Previously, this function had nontrivial algorithmic content, but after #48195, this was just a swiss army knife for pasting together arguments while maintaining structure. I added some more properties for Arguments for convenient access in this way, and then inlined the implementation of group_arguments into all of its call sites, simplifying whenever contextual. This might be controversial, but I think the resulting code is easier to understand. You may notice that there is some modest code duplication between dispatcher.cpparguments_exprs and CppSignature.argument_packs. This is a known problem and I will be attempting to fix it in a follow up PR. Confirmed to be byte-for-byte compatible. Signed-off-by: Edward Z. Yang <ezyang@fb.com> [ghstack-poisoned]

Summary: Pull Request resolved: #49043 Previously, this function had nontrivial algorithmic content, but after #48195, this was just a swiss army knife for pasting together arguments while maintaining structure. I added some more properties for Arguments for convenient access in this way, and then inlined the implementation of group_arguments into all of its call sites, simplifying whenever contextual. This might be controversial, but I think the resulting code is easier to understand. You may notice that there is some modest code duplication between dispatcher.cpparguments_exprs and CppSignature.argument_packs. This is a known problem and I will be attempting to fix it in a follow up PR. Confirmed to be byte-for-byte compatible. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D25455885 Pulled By: ezyang fbshipit-source-id: 8fbe066e8c3cb7ee8adb5b87296ec5bd7b49e01f

Move argument grouping into FunctionSchema

f8151b7

Signed-off-by: Edward Z. Yang <ezyang@fb.com> [ghstack-poisoned]

This was referenced Nov 18, 2020

Get TestTorch.test_empty_meta working again #48113

Closed

Structured kernels generate Meta registrations #48116

Closed

Refactor argument fields in FunctionSchema to Arguments #48182

Closed

facebook-github-bot added the cla signed label Nov 18, 2020

ezyang requested review from bhosmer and smessmer November 18, 2020 19:05

Update on "Move argument grouping into FunctionSchema"

d70e1b2

Signed-off-by: Edward Z. Yang <ezyang@fb.com> [ghstack-poisoned]

ezyang added a commit that referenced this pull request Nov 18, 2020

Move argument grouping into FunctionSchema

3c8533a

Signed-off-by: Edward Z. Yang <ezyang@fb.com> ghstack-source-id: 85a385f Pull Request resolved: #48195

ezyang mentioned this pull request Nov 18, 2020

Faithful out arguments #47712

Closed

bhosmer reviewed Nov 18, 2020

View reviewed changes

ezyang commented Nov 18, 2020

View reviewed changes

ezyang mentioned this pull request Nov 19, 2020

[POC] Class-based scaffolding for structured functions #48262

Merged

bhosmer approved these changes Nov 30, 2020

View reviewed changes

ezyang mentioned this pull request Dec 1, 2020

Refactor TensorIterator to do allocations via MetaBase::set_output #48659

Closed

ezyang added a commit to ezyang/pytorch that referenced this pull request Dec 1, 2020

Move argument grouping into FunctionSchema

ac28503

Signed-off-by: Edward Z. Yang <ezyang@fb.com> ghstack-source-id: 249fbfa Pull Request resolved: pytorch#48195

ezyang added a commit to ezyang/pytorch that referenced this pull request Dec 2, 2020

Move argument grouping into FunctionSchema

ccb13d4

Signed-off-by: Edward Z. Yang <ezyang@fb.com> ghstack-source-id: 249fbfa Pull Request resolved: pytorch#48195

This was referenced Dec 2, 2020

Move var and std overloads to Functions.cpp and remove native:: reference #48683

Closed

Delete NativeFunctions.h include from Functions.h #48687

Closed

ezyang added a commit to ezyang/pytorch that referenced this pull request Dec 2, 2020

Move argument grouping into FunctionSchema

f1a8e9f

Signed-off-by: Edward Z. Yang <ezyang@fb.com> ghstack-source-id: cd0aa6e Pull Request resolved: pytorch#48195

facebook-github-bot closed this in 742903c Dec 2, 2020

facebook-github-bot added the Merged label Dec 2, 2020

facebook-github-bot deleted the gh/ezyang/869/head branch December 6, 2020 15:17

ezyang mentioned this pull request Dec 8, 2020

Delete cpp.group_arguments #49043

Closed

Conversation

ezyang commented Nov 18, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ezyang commented Nov 18, 2020

Uh oh!

dr-ci bot commented Nov 18, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

🕵️ 1 new failure recognized by patterns

pytorch_linux_bionic_py3_8_gcc9_coverage_test2 (1/1)

Uh oh!

bhosmer left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ezyang commented Nov 18, 2020

Uh oh!

bhosmer commented Nov 18, 2020

Uh oh!

ezyang Nov 18, 2020

Choose a reason for hiding this comment

Uh oh!

bhosmer Nov 18, 2020

Choose a reason for hiding this comment

Uh oh!

ezyang Nov 18, 2020

Choose a reason for hiding this comment

Uh oh!

ezyang commented Nov 18, 2020

Uh oh!

bhosmer left a comment

Choose a reason for hiding this comment

Uh oh!

bhosmer Nov 30, 2020

Choose a reason for hiding this comment

Uh oh!

ezyang Nov 30, 2020

Choose a reason for hiding this comment

Uh oh!

bhosmer Nov 30, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Dec 2, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ezyang commented Nov 18, 2020 •

edited

Loading

dr-ci bot commented Nov 18, 2020 •

edited

Loading

bhosmer left a comment •

edited

Loading

bhosmer Nov 30, 2020 •

edited

Loading