[quant][graphmode] Different rule for handling `aten::cat` by jerryzh168 · Pull Request #38570 · pytorch/pytorch

jerryzh168 · 2020-05-15T19:46:06Z

Stack from ghstack:

[quant] Make MinMaxObserver and MovingAverageMinMaxObserver work with List[Tensor] #38552 [quant] Make MinMaxObserver and MovingAverageMinMaxObserver work with List[Tensor]
[quant] Histogram Observer support List[Tensor] #38436 [quant] Histogram Observer support List[Tensor]
[quant][graphmode] Different rule for add/add_/mul/mul_ #38667 [quant][graphmode] Different rule for add/add_/mul/mul_
[quant][graphmode] Different rule for handling aten::cat #38570 [quant][graphmode] Different rule for handling aten::cat

Summary:
We changed the rule of quantizing aten::cat, previously aten::cat is considered to be
an op that should always be quantized, like aten::conv2d, but this is not ideal, a better
way is to quantize the output of aten::cat depending on whether the input is quantized, if it is
then we'll quantize the output, if not, then we will not quantize the output, since aten::cat works both on
quantized and non-quantized tensor.

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D21600160

Summary: We changed the rule of quantizing `aten::cat`, previously `aten::cat` is considered to be an op that should always be quantized, like `aten::conv2d`, but this is not ideal, a better way is to quantize the output of `aten::cat` depending on whether the input is quantized, if it is then we'll quantize the output, if not, then we will not quantize the output, since `aten::cat` works both on quantized and non-quantized tensor. Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

dr-ci · 2020-05-15T19:47:18Z

💊 CI failures summary and remediations

As of commit 7236764 (more details on the Dr. CI page):

2/2 failures possibly* introduced in this PR
- 2/2 non-CircleCI failure(s)

Extra GitHub checks: 1 failed

Failed: GitHub Actions - clang-format

ci.pytorch.org: 1 failed

Failed: pr/py3.6-clang7-rocmdeb-ubuntu16.04

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker.

See how this bot performed.

This comment has been revised 16 times.

Summary: We changed the rule of quantizing `aten::cat`, previously `aten::cat` is considered to be an op that should always be quantized, like `aten::conv2d`, but this is not ideal, a better way is to quantize the output of `aten::cat` depending on whether the input is quantized, if it is then we'll quantize the output, if not, then we will not quantize the output, since `aten::cat` works both on quantized and non-quantized tensor. Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D21600160](https://our.internmc.facebook.com/intern/diff/D21600160) [ghstack-poisoned]

raghuramank100 · 2020-05-15T20:03:46Z

+                   .run(m.graph)
+
+        # non quantized cat
+        m = torch.jit.script(NonQuantizedCat()).eval()


In this case, what is the expected behavior? Would you quantize all the inputs to non-quantized cat and quantize the output too? In general, dont you quantize the input to a model?

no, we don't quantize the inputs to cat in this case, the output of cat is only quantized when all the inputs are quantized.

raghuramank100 · 2020-05-15T20:05:15Z

              !isObserved(v, block_observed_values)) {
-            if (auto observer_opt = getObserverFor(v)) {
+            auto observer_opt = getObserverFor(v);
+            // If the node is one of the propagate quant node, e.g.


Isnt this logic also true for add or conv? Why special case concat?

This is only true for cat, for conv we'll always quantize its input and output

We need a consistent set of rules, consider the following two models:

import torch import torch.nn as nn class testM(nn.Module): def __init__(self): super().__init__() self.c = nn.Conv2d(3,5,1) self.d = nn.Conv2d(3,5,1) def forward(self, x): # If there is a nn.Identity or shape, will the inputs be quantized? y = self.c(x) z = self.d(x) w = torch.cat((y, z)) return w # Second one: class testM2(nn.Module): def __init__(self): super().__init__() self.c = nn.Conv2d(3,5,1) self.d = nn.Conv2d(3,5,1) def forward(self, x): w = torch.cat((x, x)) y = self.c(w) z = self.d(w) return w

In the first case, the input will be quantized. In the second case the input will not be.
What if in the first case I have an nn.Identity prior to the conv or a reshape? In that case do we quantize the inputs?

we don't quantize identity or reshape, we also don't accept input that's already quantized outside of the model.

In terms user interface, user will always provide a floating point Tensor, regardless of how the model is quantized

Got it, the input is always in fp and in certain cases the input is quantized (conv) and in certain cases it is not (cat)

Summary: We changed the rule of quantizing `aten::cat`, previously `aten::cat` is considered to be an op that should always be quantized, like `aten::conv2d`, but this is not ideal, a better way is to quantize the output of `aten::cat` depending on whether the input is quantized, if it is then we'll quantize the output, if not, then we will not quantize the output, since `aten::cat` works both on quantized and non-quantized tensor. Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D21600160](https://our.internmc.facebook.com/intern/diff/D21600160) [ghstack-poisoned]

facebook-github-bot · 2020-05-19T20:31:36Z

This pull request has been merged in 1ef77f9.

…8570) Summary: Pull Request resolved: pytorch#38570 We changed the rule of quantizing `aten::cat`, previously `aten::cat` is considered to be an op that should always be quantized, like `aten::conv2d`, but this is not ideal, a better way is to quantize the output of `aten::cat` depending on whether the input is quantized, if it is then we'll quantize the output, if not, then we will not quantize the output, since `aten::cat` works both on quantized and non-quantized tensor. Test Plan: Imported from OSS Differential Revision: D21600160 fbshipit-source-id: efa957e0eaa608fffefcdfefa7f442fab45605eb

jerryzh168 requested a review from apaszke as a code owner May 15, 2020 19:46

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label May 15, 2020

jerryzh168 requested review from raghuramank100 and supriyar May 15, 2020 19:46

raghuramank100 reviewed May 15, 2020

View reviewed changes

jerryzh168 requested a review from raghuramank100 May 15, 2020 20:38

jerryzh168 mentioned this pull request May 18, 2020

[quant][graphmode] Different rule for add/add_/mul/mul_ #38667

Closed

raghuramank100 approved these changes May 19, 2020

View reviewed changes

jerryzh168 added 2 commits May 18, 2020 18:33

facebook-github-bot closed this in 1ef77f9 May 19, 2020

facebook-github-bot added the merged label May 19, 2020

facebook-github-bot deleted the gh/jerryzh168/320/head branch May 23, 2020 14:16

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quant][graphmode] Different rule for handling `aten::cat`#38570

[quant][graphmode] Different rule for handling `aten::cat`#38570
jerryzh168 wants to merge 5 commits intogh/jerryzh168/320/basefrom
gh/jerryzh168/320/head

jerryzh168 commented May 15, 2020 •

edited

Loading

Uh oh!

dr-ci Bot commented May 15, 2020 •

edited

Loading

Uh oh!

raghuramank100 May 15, 2020

Uh oh!

jerryzh168 May 15, 2020 •

edited

Loading

Uh oh!

raghuramank100 May 15, 2020

Uh oh!

jerryzh168 May 15, 2020

Uh oh!

raghuramank100 May 19, 2020

Uh oh!

jerryzh168 May 19, 2020

Uh oh!

jerryzh168 May 19, 2020

Uh oh!

raghuramank100 May 19, 2020

Uh oh!

facebook-github-bot commented May 19, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

jerryzh168 commented May 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci Bot commented May 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Extra GitHub checks: 1 failed

ci.pytorch.org: 1 failed

Uh oh!

raghuramank100 May 15, 2020

Choose a reason for hiding this comment

Uh oh!

jerryzh168 May 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

raghuramank100 May 15, 2020

Choose a reason for hiding this comment

Uh oh!

jerryzh168 May 15, 2020

Choose a reason for hiding this comment

Uh oh!

raghuramank100 May 19, 2020

Choose a reason for hiding this comment

Uh oh!

jerryzh168 May 19, 2020

Choose a reason for hiding this comment

Uh oh!

jerryzh168 May 19, 2020

Choose a reason for hiding this comment

Uh oh!

raghuramank100 May 19, 2020

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented May 19, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jerryzh168 commented May 15, 2020 •

edited

Loading

dr-ci Bot commented May 15, 2020 •

edited

Loading

jerryzh168 May 15, 2020 •

edited

Loading