Fold weight permutation inside quantized conv operator by dzhulgakov · Pull Request #26241 · pytorch/pytorch

dzhulgakov · 2019-09-14T23:44:12Z

Stack from ghstack:

Serialization for per channel qtensor #26339 Serialization for per channel qtensor
Fix _empty_per_channel_affine_quantized to be less hacky #26243 Fix _empty_per_channel_affine_quantized to be less hacky
Fold activation permutation inside quantized conv operator #26242 Fold activation permutation inside quantized conv operator
Fold weight permutation inside quantized conv operator #26241 Fold weight permutation inside quantized conv operator
Implement more support for per-channel quantization #26240 Implement more support for per-channel quantization

According to #19092 we always keep NCHW order and do handling inside the kernels. This PR fixes it for weights of the qconv by using MemoryLayout mechanism.

Differential Revision: D17443219

According to #19092 we always keep NCHW order and do handling inside the kernels. This PR fixes it for weights of the qconv by using MemoryLayout mechanism. [ghstack-poisoned]

jerryzh168

LGTM

jerryzh168 · 2019-09-17T17:44:42Z

What about quantize linear ops?

dzhulgakov · 2019-09-17T23:45:54Z

What do you mean about linear? Linear ops don't have memory order

Also cc @VitalyFedyunin to verify that memory format stuff looks good

supriyar · 2019-09-17T23:49:18Z


        b = torch.randn(oC, dtype=torch.float32) if use_bias else None
        q_bias = torch.quantize_linear(b, scale=1.0 / 1024, zero_point=0, dtype=torch.qint32) if use_bias else None
-        q_filters_ref = torch.ops.quantized.conv_prepack(qw.permute([0, 2, 3, 1]),


Don't you also need similar changes to the tests in test_quantized.py?

oh, I think I put it accidentally in #26242 - let me just squash them together

According to #19092 we always keep NCHW order and do handling inside the kernels. This PR fixes it for weights of the qconv by using MemoryLayout mechanism. [ghstack-poisoned]

VitalyFedyunin · 2019-09-18T18:36:24Z

Please don't land, I need to read it carefully.

VitalyFedyunin

Memory Format manipulations looks good!

According to #19092 we always keep NCHW order and do handling inside the kernels. This PR fixes it for weights of the qconv by using MemoryLayout mechanism. Differential Revision: [D17443219](https://our.internmc.facebook.com/intern/diff/D17443219) [ghstack-poisoned]

supriyar · 2019-09-19T16:04:26Z

-        W_KRSC = W.permute([0, 2, 3, 1]).contiguous()
        if channelwise:
-            W_q = torch.quantize_linear_per_channel(W_KRSC,
+            W_q = torch.quantize_linear_per_channel(W,


Thanks for changing this too. Did you test it locally by setting PYTORCH_TEST_WITH_QNNPACK ?

Yes, I did :)

supriyar · 2019-09-19T16:07:13Z

          scales.toType(kDouble),
          zero_points.toType(kLong),
-          {output_channels, kernel_h, kernel_w, C_per_G},
+          {output_channels, C_per_G, kernel_h, kernel_w},


Do we need to change it similarly for QNNPACK as well below since we store it as MemoryFormat::ChannelsLast in the packed struct?

No, you store the original Tensor and it's fine to return Tensor in ChannelLast format (it'd just happen to be non-contiguous but still semantically correct)

According to #19092 we always keep NCHW order and do handling inside the kernels. This PR fixes it for weights of the qconv by using MemoryLayout mechanism. Differential Revision: [D17443219](https://our.internmc.facebook.com/intern/diff/D17443219) [ghstack-poisoned]

Summary: Pull Request resolved: pytorch/pytorch#26241 According to pytorch/pytorch#19092 we always keep NCHW order and do handling inside the kernels. This PR fixes it for weights of the qconv by using MemoryLayout mechanism. Test Plan: Imported from OSS Differential Revision: D17443219 Pulled By: dzhulgakov fbshipit-source-id: ce0eb92034a9977b3303dafab8b0414575171062

facebook-github-bot · 2019-09-20T01:05:48Z

@dzhulgakov merged this pull request in d5daac7.

Summary: Pull Request resolved: pytorch#26241 According to pytorch#19092 we always keep NCHW order and do handling inside the kernels. This PR fixes it for weights of the qconv by using MemoryLayout mechanism. Test Plan: Imported from OSS Differential Revision: D17443219 Pulled By: dzhulgakov fbshipit-source-id: ce0eb92034a9977b3303dafab8b0414575171062

permute for weights

53f035d

dzhulgakov requested a review from apaszke as a code owner September 14, 2019 23:44

pytorchbot added module: nn Related to torch.nn module: operators oncall: quantization Quantization support in PyTorch labels Sep 14, 2019

This was referenced Sep 14, 2019

Implement more support for per-channel quantization #26240

Closed

Fold activation permutation inside quantized conv operator #26242

Closed

Fix _empty_per_channel_affine_quantized to be less hacky #26243

Closed

dzhulgakov changed the title ~~permute for weights~~ Fold weight permutation inside quantized fold operator Sep 14, 2019

dzhulgakov changed the title ~~Fold weight permutation inside quantized fold operator~~ Fold weight permutation inside quantized conv operator Sep 14, 2019

dzhulgakov requested review from VitalyFedyunin, dskhudia, jamesr66a, raghuramank100 and supriyar September 14, 2019 23:57

Update on "Fold weight permutation inside quantized conv operator"

9b0cc7b

According to #19092 we always keep NCHW order and do handling inside the kernels. This PR fixes it for weights of the qconv by using MemoryLayout mechanism. [ghstack-poisoned]

dzhulgakov mentioned this pull request Sep 17, 2019

Serialization for per channel qtensor #26339

Closed

jerryzh168 approved these changes Sep 17, 2019

View reviewed changes

supriyar reviewed Sep 17, 2019

View reviewed changes

Dmytro Dzhulgakov added 2 commits September 17, 2019 20:04

Update on "Fold weight permutation inside quantized conv operator"

cf747ad

According to #19092 we always keep NCHW order and do handling inside the kernels. This PR fixes it for weights of the qconv by using MemoryLayout mechanism. [ghstack-poisoned]

Update on "Fold weight permutation inside quantized conv operator"

2fa19bd

According to #19092 we always keep NCHW order and do handling inside the kernels. This PR fixes it for weights of the qconv by using MemoryLayout mechanism. [ghstack-poisoned]

VitalyFedyunin approved these changes Sep 18, 2019

View reviewed changes

raghuramank100 approved these changes Sep 18, 2019

View reviewed changes

Dmytro Dzhulgakov added 3 commits September 18, 2019 18:00

supriyar reviewed Sep 19, 2019

View reviewed changes

facebook-github-bot closed this in d5daac7 Sep 19, 2019

facebook-github-bot added the merged label Sep 20, 2019

facebook-github-bot deleted the gh/dzhulgakov/2/head branch October 28, 2019 22:08

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fold weight permutation inside quantized conv operator#26241

Fold weight permutation inside quantized conv operator#26241
dzhulgakov wants to merge 8 commits intogh/dzhulgakov/2/basefrom
gh/dzhulgakov/2/head

dzhulgakov commented Sep 14, 2019 •

edited

Loading

Uh oh!

jerryzh168 left a comment

Uh oh!

jerryzh168 commented Sep 17, 2019

Uh oh!

dzhulgakov commented Sep 17, 2019

Uh oh!

supriyar Sep 17, 2019

Uh oh!

dzhulgakov Sep 18, 2019

Uh oh!

VitalyFedyunin commented Sep 18, 2019

Uh oh!

VitalyFedyunin left a comment

Uh oh!

supriyar Sep 19, 2019

Uh oh!

dzhulgakov Sep 19, 2019

Uh oh!

supriyar Sep 19, 2019

Uh oh!

dzhulgakov Sep 19, 2019

Uh oh!

facebook-github-bot commented Sep 20, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Conversation

dzhulgakov commented Sep 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

jerryzh168 commented Sep 17, 2019

Uh oh!

dzhulgakov commented Sep 17, 2019

Uh oh!

supriyar Sep 17, 2019

Choose a reason for hiding this comment

Uh oh!

dzhulgakov Sep 18, 2019

Choose a reason for hiding this comment

Uh oh!

VitalyFedyunin commented Sep 18, 2019

Uh oh!

VitalyFedyunin left a comment

Choose a reason for hiding this comment

Uh oh!

supriyar Sep 19, 2019

Choose a reason for hiding this comment

Uh oh!

dzhulgakov Sep 19, 2019

Choose a reason for hiding this comment

Uh oh!

supriyar Sep 19, 2019

Choose a reason for hiding this comment

Uh oh!

dzhulgakov Sep 19, 2019

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Sep 20, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

dzhulgakov commented Sep 14, 2019 •

edited

Loading