[JIT] Normalize op aliases by eellison · Pull Request #38735 · pytorch/pytorch

eellison · 2020-05-19T19:44:43Z

Stack from ghstack:

[JIT] Factor out aliases to separate test #38746 [JIT] Factor out aliases to separate test
[JIT] Normalize op aliases #38735 [JIT] Normalize op aliases
[JIT] Rename canonicalize ops #38734 [JIT] Rename canonicalize ops

Follow up to my comment #36597

This adds a pass to convert op aliases into a normalized form. Having two ops generated in our IR that do the same thing makes the IR harder for downstream consumers of the IR, such as TorchScript passes but also ONNX, glow, etc.

Another solution would have been to fix our code generation to only emit aten::abs from the start. This seems trickier, and doesn't really buy us much if we still have to expose aten::absolute in C++, as @glaringlee of the C++ API thinks we should.

Bike shedding: maybe this should be CanonicalizeOps instead

Differential Revision: D21673108

[ghstack-poisoned]

ghstack-source-id: c8df76f Pull Request resolved: #38735

dr-ci · 2020-05-19T19:53:07Z

💊 CI failures summary and remediations

As of commit 1d1b87f (more details on the Dr. CI page):

1/1 failures possibly* introduced in this PR
- 1/1 non-CircleCI failure(s)

ci.pytorch.org: 1 failed

Failed: pr/py3.6-clang7-rocmdeb-ubuntu16.04

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 15 times.

glaringlee · 2020-05-19T20:58:40Z

I previously thought this numpy-compatible project is for logical compatible, so instead of adding 'absolute', we can reuse 'abs' and fix the non-numpy-compatible stuff inside if any. Now I see that we also need naming compatible, so adding normalization layer and support alias is reasonable to me now. 👍 Thanks.

mruberry · 2020-05-19T21:52:58Z


        graph = torch.jit.script(test_multiple).graph
        FileCheck().check_count("prim::If", 3, exactly=True).run(graph)
-        print(torch.jit.script(test_multiple).code)


Thank you for cleaning this up!

mruberry · 2020-05-19T21:54:50Z

        """
        FileCheck().run(graph_str, parse_ir(graph_str))

+    def test_norm_aliases(self):


This test is cool but either in this PR or a follow-up we should make it extensible just like the method of registering aliases in the JIT is. That way when people add a new alias they can also easily extend this test.

Done here: #38746

mruberry · 2020-05-19T22:00:59Z

This is super cool, thanks @eellison! I left a few comments requesting clarification on how to add and test new aliases.

@glaringlee

Follow up to my comment #36597 This adds a pass to convert op aliases into a normalized form. Having two ops generated in our IR that do the same thing makes the IR harder for downstream consumers of the IR, such as TorchScript passes but also ONNX, glow, etc. Another solution would have been to fix our code generation to only emit `aten::abs` from the start. This seems trickier, and doesn't really buy us much if we still have to expose `aten::absolute` in C++, as @glaringlee of the C++ API thinks we should. Bike shedding: maybe this should be `CanonicalizeOps` instead [ghstack-poisoned]

mruberry · 2020-05-20T01:43:12Z

+namespace torch {
+namespace jit {
+
+static const std::unordered_map<Symbol, Symbol> alias_map = {


Sorry, my comment here didn't get saved, I guess. What about a comment here explaining how people should add (and test) new aliases?

I added a comment. I would prefer not to put testing details in source code because it makes our code more verbose and the details would likely grow stale. If someone wants to look at how something is tested the typical thing is to look at git blame or ask around.

Sounds good.

@glaringlee

Follow up to my comment #36597 This adds a pass to convert op aliases into a normalized form. Having two ops generated in our IR that do the same thing makes the IR harder for downstream consumers of the IR, such as TorchScript passes but also ONNX, glow, etc. Another solution would have been to fix our code generation to only emit `aten::abs` from the start. This seems trickier, and doesn't really buy us much if we still have to expose `aten::absolute` in C++, as @glaringlee of the C++ API thinks we should. Bike shedding: maybe this should be `CanonicalizeOps` instead [ghstack-poisoned]

jamesr66a

Cool!

Do we have a plan for extending this to other use cases? Main one is e.g. aten::_convolution v.s. aten::conv*d

jamesr66a · 2020-05-20T20:35:39Z

+namespace torch {
+namespace jit {
+
+// This passes converts aten ops to a normalized form. It is


nit: This pass :p

jamesr66a · 2020-05-20T20:37:37Z

+
        auto cu = get_python_cu();
        auto name = c10::QualifiedName(qualname);
+        NormalizeOps(graph);


Is this also needed in _create_method_from_trace? Can we just embed this call in trace() (called by createGraphByTracing)? Maybe here:
https://github.com/pytorch/pytorch/blob/master/torch/csrc/jit/frontend/tracer.cpp#L454

jamesr66a · 2020-05-20T20:39:10Z

+    {aten::absolute_, aten::abs_},
+};
+
+void replaceNodeWithNewSymbol(Node* node, Symbol new_symbol) {


Would this be a useful method to expose on Node or Graph? I can definitely think of places I've wanted a function like this elsewhere

Do you have any examples of other call sites ?

jamesr66a · 2020-05-20T20:41:36Z

+// having multiple ops in our IR that do the same thing makes the IR more
+// difficult to consumer for downstream user of the IR, such as our own
+// optimization passes here, we convert op aliases into a standard form
+bool normalizeOpAliases(graph_node_list_iterator& iter) {


static, or just wrap all the implementation code in an anonymous namespace

eellison · 2020-05-20T21:14:13Z

@jamesr66a no concrete plan to move over _convolution but it is my intention that this pass will eventually encompass that & other use cases

@glaringlee

Follow up to my comment #36597 This adds a pass to convert op aliases into a normalized form. Having two ops generated in our IR that do the same thing makes the IR harder for downstream consumers of the IR, such as TorchScript passes but also ONNX, glow, etc. Another solution would have been to fix our code generation to only emit `aten::abs` from the start. This seems trickier, and doesn't really buy us much if we still have to expose `aten::absolute` in C++, as @glaringlee of the C++ API thinks we should. Bike shedding: maybe this should be `CanonicalizeOps` instead [ghstack-poisoned]

facebook-github-bot · 2020-05-22T08:11:26Z

@eellison merged this pull request in f90dc74.

Summary: Pull Request resolved: pytorch#38735 Follow up to my comment pytorch#36597 This adds a pass to convert op aliases into a normalized form. Having two ops generated in our IR that do the same thing makes the IR harder for downstream consumers of the IR, such as TorchScript passes but also ONNX, glow, etc. Another solution would have been to fix our code generation to only emit `aten::abs` from the start. This seems trickier, and doesn't really buy us much if we still have to expose `aten::absolute` in C++, as glaringlee of the C++ API thinks we should. Bike shedding: maybe this should be `CanonicalizeOps` instead Test Plan: Imported from OSS Differential Revision: D21673108 Pulled By: eellison fbshipit-source-id: c328618907de1af22e07f57fd27fa619978c2817

[JIT] Normalize op aliases

503d72b

[ghstack-poisoned]

eellison requested a review from apaszke as a code owner May 19, 2020 19:44

eellison mentioned this pull request May 19, 2020

[JIT] Rename canonicalize ops #38734

Closed

eellison pushed a commit that referenced this pull request May 19, 2020

[JIT] Normalize op aliases

6c0e9ee

ghstack-source-id: c8df76f Pull Request resolved: #38735

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label May 19, 2020

eellison requested review from bhosmer, jamesr66a and mruberry May 19, 2020 19:56

mruberry reviewed May 19, 2020

View reviewed changes

eellison mentioned this pull request May 20, 2020

[JIT] Factor out aliases to separate test #38746

Closed

mruberry reviewed May 20, 2020

View reviewed changes

jamesr66a approved these changes May 20, 2020

View reviewed changes

facebook-github-bot closed this in f90dc74 May 22, 2020

facebook-github-bot added the merged label May 22, 2020

facebook-github-bot deleted the gh/eellison/80/head branch May 25, 2020 14:16

mruberry added the Merged label Oct 28, 2020

Conversation

eellison commented May 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci Bot commented May 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

ci.pytorch.org: 1 failed

Uh oh!

glaringlee commented May 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mruberry commented May 19, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jamesr66a left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eellison commented May 20, 2020

Uh oh!

facebook-github-bot commented May 22, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

eellison commented May 19, 2020 •

edited

Loading

dr-ci Bot commented May 19, 2020 •

edited

Loading

glaringlee commented May 19, 2020 •

edited

Loading