[quant] add quantized::batch_norm by jerryzh168 · Pull Request #39910 · pytorch/pytorch

jerryzh168 · 2020-06-12T01:03:21Z

Stack from ghstack:

[quant][graphmode] Support another use pattern of mean #40038 [quant][graphmode] Support another use pattern of mean
[quant] Support general op modules with inplace options #39919 [quant] Support general op modules with inplace options
[quant][graphmode] Support quantizing repeat #39925 [quant][graphmode] Support quantizing repeat
[quant][graphmode] Support squeeze/unsqueeze #39924 [quant][graphmode] Support squeeze/unsqueeze
[quant][graphmode] Run RemoveRedundantDequantize in the end #39923 [quant][graphmode] Run RemoveRedundantDequantize in the end
[quant][graphmode] Pass debug option into insert_quant_dequant pass #39915 [quant][graphmode] Pass debug option into insert_quant_dequant pass
[quant][graphmode] Use quantized::batch_norm in graph mode #39911 [quant][graphmode] Use quantized::batch_norm in graph mode
[quant] add quantized::batch_norm #39910 [quant] add quantized::batch_norm
[quant][graphmode] Support prim::TupleUnpack and prim::TupleConstruct #39895 [quant][graphmode] Support prim::TupleUnpack and prim::TupleConstruct

Summary:
We need this for graph mode quantization, since we only have aten::batch_norm the dimension
is only known at runtime, we'll need to quantize it to quantized::batch_norm

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D22012281

Summary: We need this for graph mode quantization, since we only have `aten::batch_norm` the dimension is only known at runtime, we'll need to quantize it to `quantized::batch_norm` Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

dr-ci · 2020-06-12T01:14:43Z

💊 CI failures summary and remediations

As of commit cefc65d (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 24 times.

Summary: We need this for graph mode quantization, since we only have `aten::batch_norm` the dimension is only known at runtime, we'll need to quantize it to `quantized::batch_norm` Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D22012281](https://our.internmc.facebook.com/intern/diff/D22012281) [ghstack-poisoned]

vkuzo · 2020-06-15T22:09:18Z

since we only have aten::batch_norm the dimension
is only known at runtime

for dimension, are we talking about BatchNorm{1|2|3}d? If so, can we get it from the module, since the majority of users will probably be using the default PT modules?

vkuzo · 2020-06-15T22:12:30Z

 }

 TORCH_LIBRARY_IMPL(quantized, QuantizedCPU, m) {
+  m.impl("batch_norm",        q_batch_norm_impl<false>);


is this function pointing to the 2d/3d implementations, but not supporting the 1d implementation? Not sure if that's super intuitive. Can we add a comment on why this is needed?

yeah, ideally this should support 1d as well, but there is no support for 1d atm. sure I'll add a comment

jerryzh168 · 2020-06-15T22:22:49Z

since we only have aten::batch_norm the dimension
is only known at runtime

for dimension, are we talking about BatchNorm{1|2|3}d? If so, can we get it from the module, since the majority of users will probably be using the default PT modules?

yeah, dim refers to batch_norm 1d/2d/3d, this informations is not available in aten ops, it's available only in module. we can't get it from module, we are dealing with aten ops in the IR.

vkuzo · 2020-06-15T22:27:16Z

yeah, dim refers to batch_norm 1d/2d/3d, this informations is not available in aten ops, it's available only in module. we can't get it from module, we are dealing with aten ops in the IR.

makes sense for the current implementation. I'm more asking for my own understanding, this info is in the graph before inlining, and we technically could spend eng time to preserve this info through the passes (by adding a new attribute, etc) if we really needed to (not saying we should do this, just trying to understand). Just curious if I understood that correctly.

Summary: We need this for graph mode quantization, since we only have `aten::batch_norm` the dimension is only known at runtime, we'll need to quantize it to `quantized::batch_norm` Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D22012281](https://our.internmc.facebook.com/intern/diff/D22012281) [ghstack-poisoned]

jerryzh168 · 2020-06-15T23:08:13Z

(not saying we should do this, just trying to understand). Just curious if I understood that correctly.

the dimension info is encoded in the check function right now: https://github.com/pytorch/pytorch/blob/master/torch/nn/modules/batchnorm.py#L190, the check function will be different for each module. we'll need to analyze the code to extract this info.

Another thing is if people use F.batch_norm(https://github.com/pytorch/pytorch/blob/master/torch/nn/functional.py#L1998), there is no way we can get the dimension info, the dimension is known at runtime.

jerryzh168 · 2020-06-15T23:10:28Z

In general this comes from the discrepancy of the APIs between floating point op and quantized op, we have aten::batch_norm in fp but quantized::batch_norm_2d and quantized::batch_norm_3d in quantized.

although in the future I think we should move towards breaking aten::batch_norm to aten::batch_norm_{n}d, this will allow us to handle inheritance.

vkuzo · 2020-06-15T23:19:49Z

the dimension info is encoded in the check function right now: https://github.com/pytorch/pytorch/blob/master/torch/nn/modules/batchnorm.py#L190, the check function will be different for each module. we'll need to analyze the code to extract this info.

oh, I was asking about parsing it from the module type instead, not from the graph of the function calls. I.e. if someone is running a vanilla nn.BatchNorm{1|2|3}d Is that challenging / not possible to capture that from the module graph before any kind of inlining, and then assign it to the correponding aten node in the inlining pass? To clarify, not suggesting it, more just looking to understand curious on if this possible in the future if it's needed.

jerryzh168 · 2020-06-16T00:20:21Z

the dimension info is encoded in the check function right now: https://github.com/pytorch/pytorch/blob/master/torch/nn/modules/batchnorm.py#L190, the check function will be different for each module. we'll need to analyze the code to extract this info.

oh, I was asking about parsing it from the module type instead, not from the graph of the function calls. I.e. if someone is running a vanilla nn.BatchNorm{1|2|3}d Is that challenging / not possible to capture that from the module graph before any kind of inlining, and then assign it to the correponding aten node in the inlining pass? To clarify, not suggesting it, more just looking to understand curious on if this possible in the future if it's needed.

This is certainly possible, but I don't think we'll do this in the future. I think you'll get a better idea as you work more on graph mode.

We would like to restrict the analysis and transformations locally instead of spreading it out, e.g. you mentioned we need to get this information in module and then "pass" it to aten node, we would only do this when it is absolutely necessary I think, since this can be achieved with a much simpler alternative, as shown in the PR, I don't think it make sense to do it.

vkuzo · 2020-06-16T00:57:34Z

the dimension info is encoded in the check function right now: https://github.com/pytorch/pytorch/blob/master/torch/nn/modules/batchnorm.py#L190, the check function will be different for each module. we'll need to analyze the code to extract this info.

oh, I was asking about parsing it from the module type instead, not from the graph of the function calls. I.e. if someone is running a vanilla nn.BatchNorm{1|2|3}d Is that challenging / not possible to capture that from the module graph before any kind of inlining, and then assign it to the correponding aten node in the inlining pass? To clarify, not suggesting it, more just looking to understand curious on if this possible in the future if it's needed.

This is certainly possible, but I don't think we'll do this in the future. I think you'll get a better idea as you work more on graph mode.

We would like to restrict the analysis and transformations locally instead of spreading it out, e.g. you mentioned we need to get this information in module and then "pass" it to aten node, we would only do this when it is absolutely necessary I think, since this can be achieved with a much simpler alternative, as shown in the PR, I don't think it make sense to do it.

thanks a ton. This is really helpful to understand the past tradeoffs and design principles taken in this codebase.

facebook-github-bot · 2020-06-16T06:10:24Z

This pull request has been merged in 1a388da.

Summary: Pull Request resolved: pytorch#39910 We need this for graph mode quantization, since we only have `aten::batch_norm` the dimension is only known at runtime, we'll need to quantize it to `quantized::batch_norm` Test Plan: Imported from OSS Differential Revision: D22012281 fbshipit-source-id: 2973d86a17a02b7bdc36bd1e703e91584d9139d0

[quant] add quantized::batch_norm

5e635f2

Summary: We need this for graph mode quantization, since we only have `aten::batch_norm` the dimension is only known at runtime, we'll need to quantize it to `quantized::batch_norm` Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

jerryzh168 added 2 commits June 11, 2020 18:15

jerryzh168 mentioned this pull request Jun 12, 2020

[quant][graphmode] Pass debug option into insert_quant_dequant pass #39915

Closed

jerryzh168 mentioned this pull request Jun 12, 2020

[quant] Support general op modules with inplace options #39919

Closed

jerryzh168 added 2 commits June 11, 2020 20:29

This was referenced Jun 12, 2020

[quant][graphmode] Run RemoveRedundantDequantize in the end #39923

Closed

[quant][graphmode] Support squeeze/unsqueeze #39924

Closed

[quant][graphmode] Support quantizing repeat #39925

Closed

jerryzh168 mentioned this pull request Jun 15, 2020

[quant][graphmode] Support another use pattern of mean #40038

Closed

vkuzo reviewed Jun 15, 2020

View reviewed changes

vkuzo approved these changes Jun 15, 2020

View reviewed changes

facebook-github-bot closed this in 1a388da Jun 16, 2020

facebook-github-bot added the merged label Jun 16, 2020

facebook-github-bot deleted the gh/jerryzh168/345/head branch June 19, 2020 14:16

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quant] add quantized::batch_norm#39910

[quant] add quantized::batch_norm#39910
jerryzh168 wants to merge 8 commits intogh/jerryzh168/345/basefrom
gh/jerryzh168/345/head

jerryzh168 commented Jun 12, 2020 •

edited

Loading

Uh oh!

dr-ci Bot commented Jun 12, 2020 •

edited

Loading

Uh oh!

vkuzo commented Jun 15, 2020

Uh oh!

vkuzo Jun 15, 2020

Uh oh!

jerryzh168 Jun 15, 2020

Uh oh!

jerryzh168 commented Jun 15, 2020

Uh oh!

vkuzo commented Jun 15, 2020

Uh oh!

jerryzh168 commented Jun 15, 2020

Uh oh!

jerryzh168 commented Jun 15, 2020

Uh oh!

vkuzo commented Jun 15, 2020

Uh oh!

jerryzh168 commented Jun 16, 2020 •

edited

Loading

Uh oh!

vkuzo commented Jun 16, 2020

Uh oh!

facebook-github-bot commented Jun 16, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

jerryzh168 commented Jun 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci Bot commented Jun 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

vkuzo commented Jun 15, 2020

Uh oh!

vkuzo Jun 15, 2020

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Jun 15, 2020

Choose a reason for hiding this comment

Uh oh!

jerryzh168 commented Jun 15, 2020

Uh oh!

vkuzo commented Jun 15, 2020

Uh oh!

jerryzh168 commented Jun 15, 2020

Uh oh!

jerryzh168 commented Jun 15, 2020

Uh oh!

vkuzo commented Jun 15, 2020

Uh oh!

jerryzh168 commented Jun 16, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vkuzo commented Jun 16, 2020

Uh oh!

facebook-github-bot commented Jun 16, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jerryzh168 commented Jun 12, 2020 •

edited

Loading

dr-ci Bot commented Jun 12, 2020 •

edited

Loading

jerryzh168 commented Jun 16, 2020 •

edited

Loading