Quantization aware training: Freeze batch norm support by raghuramank100 · Pull Request #26624 · pytorch/pytorch

raghuramank100 · 2019-09-22T08:46:08Z

Stack from ghstack:

Default observer and fake-quant for backends #26627 Default observer and fake-quant for backends
Emulate weight and activation only quant with fake quant, numerics test #26625 Emulate weight and activation only quant with fake quant, numerics test
Quantization aware training: Freeze batch norm support #26624 Quantization aware training: Freeze batch norm support
Per channel fake quant #26623 Per channel fake quant

For QAT we need to be able to control batch norm for all modules from the top. Adding helper functions to enable/disable batch norm freezing during training

Differential Revision: D17512199

For QAT we need to be able to control batch norm for all modules from the top. Adding helper functions to enable/disable batch norm freezing during training Differential Revision: [D17512199](https://our.internmc.facebook.com/intern/diff/D17512199/) [ghstack-poisoned]

torch/nn/_intrinsic/qat/modules/conv_fused.py

jerryzh168 · 2019-09-23T23:50:39Z

torch/nn/_intrinsic/qat/modules/conv_fused.py

        return super(ConvReLU2d, cls).from_float(mod, qconfig)
+
+def update_bn_stats(mod):
+    if type(mod) in set([ConvBnReLU2d,ConvBn2d]):


For QAT we need to be able to control batch norm for all modules from the top. Adding helper functions to enable/disable batch norm freezing during training Differential Revision: [D17512199](https://our.internmc.facebook.com/intern/diff/D17512199/) [ghstack-poisoned]

dzhulgakov

General question, probably not for this diff - do we need freeze_bn support also in non-fused modules?

For QAT we need to be able to control batch norm for all modules from the top. Adding helper functions to enable/disable batch norm freezing during training Differential Revision: [D17512199](https://our.internmc.facebook.com/intern/diff/D17512199/) [ghstack-poisoned]

raghuramank100 · 2019-09-27T22:46:39Z

General question, probably not for this diff - do we need freeze_bn support also in non-fused modules?

Setting bn to eval() would do that and allow for frozen statistics to be used during training, but this is ugly as we want the rest of the modules to be in train(), with bn alone being in eval.
I think we need this supported in bn so that we have an independent way to control this outside of .train() and .eval(). An example for this need is the frozenbatchnorm module in torchvision: https://github.com/pytorch/vision/blob/master/torchvision/ops/misc.py#L135

For QAT we need to be able to control batch norm for all modules from the top. Adding helper functions to enable/disable batch norm freezing during training Differential Revision: [D17512199](https://our.internmc.facebook.com/intern/diff/D17512199/) [ghstack-poisoned]

dzhulgakov · 2019-09-28T07:55:38Z

torch/nn/_intrinsic/qat/modules/conv_fused.py

        return super(ConvReLU2d, cls).from_float(mod, qconfig)
+
+def update_bn_stats(mod):
+    if type(mod) in set([ConvBnReLU2d, ConvBn2d]):


you can also just do 'hasattr' (in case more modules appear in the future)

Is this safer? We want to modify this field only for specific modules. Tagging based on a specific attribute could be theoretically more risky as we would set freeze_bn to false for any module that has this field. Leaving this as is so that we have explicit control over which modules we apply this modification to.

For QAT we need to be able to control batch norm for all modules from the top. Adding helper functions to enable/disable batch norm freezing during training Differential Revision: [D17512199](https://our.internmc.facebook.com/intern/diff/D17512199/) [ghstack-poisoned]

facebook-github-bot · 2019-09-30T08:34:20Z

This pull request has been merged in 84ee8ac.

Summary: Pull Request resolved: #26624 For QAT we need to be able to control batch norm for all modules from the top. Adding helper functions to enable/disable batch norm freezing during training ghstack-source-id: 91008297 Test Plan: buck test caffe2/test:quantization -- --print-passing-details Differential Revision: D17512199 fbshipit-source-id: f7b981e2b1966ab01c4dbb161030177274a998b6

Summary: Pull Request resolved: pytorch#26624 For QAT we need to be able to control batch norm for all modules from the top. Adding helper functions to enable/disable batch norm freezing during training ghstack-source-id: 91008297 Test Plan: buck test caffe2/test:quantization -- --print-passing-details Differential Revision: D17512199 fbshipit-source-id: f7b981e2b1966ab01c4dbb161030177274a998b6

Pull Request resolved: pytorch/pytorch#26624 For QAT we need to be able to control batch norm for all modules from the top. Adding helper functions to enable/disable batch norm freezing during training ghstack-source-id: 90704775 Differential Revision: [D17512199](https://our.internmc.facebook.com/intern/diff/D17512199/)

raghuramank100 requested a review from apaszke as a code owner September 22, 2019 08:46

pytorchbot added the module: nn Related to torch.nn label Sep 22, 2019

raghuramank100 requested a review from jerryzh168 September 23, 2019 21:20

raghuramank10000 added 2 commits September 23, 2019 15:48

jerryzh168 reviewed Sep 23, 2019

View reviewed changes

torch/nn/_intrinsic/qat/modules/conv_fused.py Outdated Show resolved Hide resolved

jerryzh168 reviewed Sep 23, 2019

View reviewed changes

dzhulgakov requested a review from gchanan September 24, 2019 05:29

dzhulgakov reviewed Sep 24, 2019

View reviewed changes

raghuramank100 added this to the 1.3 milestone Sep 27, 2019

dzhulgakov approved these changes Sep 28, 2019

View reviewed changes

raghuramank10000 added 2 commits September 29, 2019 19:50

facebook-github-bot closed this in 84ee8ac Sep 30, 2019

facebook-github-bot added the merged label Sep 30, 2019

raghuramank100 mentioned this pull request Sep 30, 2019

[v1.3.0] Release Tracker #27011

Closed

facebook-github-bot deleted the gh/raghuramank100/35/head branch October 28, 2019 22:19

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantization aware training: Freeze batch norm support#26624

Quantization aware training: Freeze batch norm support#26624
raghuramank100 wants to merge 11 commits intogh/raghuramank100/35/basefrom
gh/raghuramank100/35/head

raghuramank100 commented Sep 22, 2019 •

edited

Loading

Uh oh!

Uh oh!

jerryzh168 Sep 23, 2019

Uh oh!

dzhulgakov left a comment

Uh oh!

raghuramank100 commented Sep 27, 2019

Uh oh!

dzhulgakov Sep 28, 2019

Uh oh!

raghuramank100 Sep 29, 2019

Uh oh!

facebook-github-bot commented Sep 30, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

raghuramank100 commented Sep 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

jerryzh168 Sep 23, 2019

Choose a reason for hiding this comment

Uh oh!

dzhulgakov left a comment

Choose a reason for hiding this comment

Uh oh!

raghuramank100 commented Sep 27, 2019

Uh oh!

dzhulgakov Sep 28, 2019

Choose a reason for hiding this comment

Uh oh!

raghuramank100 Sep 29, 2019

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Sep 30, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

raghuramank100 commented Sep 22, 2019 •

edited

Loading