Fix the QuantizedAVX2 build issue by lly-zero-one · Pull Request #26854 · pytorch/pytorch

lly-zero-one · 2019-09-26T00:26:54Z

The QuantizedAVx2 does not support the int32 type. We switch to use at::quantize_vec function instead.

jamesr66a · 2019-09-26T18:44:47Z

Any perf checks?

facebook-github-bot

@llyfacebook has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

lly-zero-one · 2019-09-26T20:36:54Z

Any perf checks?

Yes, I found it is even faster than the original one. (For bilinear2d interpolate case.)

**** torch.qint8 *****
/home/lingyiliu/local/anaconda3/lib/python3.7/site-packages/torch/nn/functional.py:2494: UserWarning: Default upsampling behavior when mode=bilinear is changed to align_corners=False since 0.4.0. Please specify align_corners=True if the old behavior is desired. See the documentation of nn.Upsample for details.
  "See the documentation of nn.Upsample for details.".format(mode))
time/iter ms (float)    time/iter ms (quant)    quant/float
2.776181697845459       0.03426074981689453     0.012340960911702442
GB/s float      GB/s quant
1.165940976598206       23.619331285066114
**** torch.quint8 *****
time/iter ms (float)    time/iter ms (quant)    quant/float
2.8192615509033203      0.032978057861328125    0.011697409859245451
GB/s float      GB/s quant
1.1481247630121 24.538012620474262
**** torch.qint32 *****
time/iter ms (float)    time/iter ms (quant)    quant/float
2.0195364952087402      1.0663747787475586      0.5280294667996766
GB/s float      GB/s quant
1.602775690203824       3.0353906192358084

hx89 · 2019-09-26T20:48:32Z

I got some undeclared identifier errors when build locally:
stderr: caffe2/aten/src/ATen/native/quantized/cpu/kernels/QuantizedOpKernels.cpp:654:523: error: use of undeclared identifier 'area_pixel_compute_s
cale'
caffe2/aten/src/ATen/native/quantized/cpu/kernels/QuantizedOpKernels.cpp:654:553: error: expected '(' for function-style cast or type construction
caffe2/aten/src/ATen/native/quantized/cpu/kernels/QuantizedOpKernels.cpp:654:621: error: use of undeclared identifier 'area_pixel_compute_scale'

jamesr66a · 2019-09-26T21:11:59Z

@llyfacebook yeah, QuantizeAVX2 operates on vectors of 32, but you were feeding it vectors of 8, so it was just running scalar code :p

jamesr66a · 2019-09-26T21:14:55Z

@llyfacebook I'd expect another 2x speedup if you switch to doing the float operations 4-wide and using QuantizeAVX2 again

hx89

LGTM! My local build pass after rebasing.

facebook-github-bot

@llyfacebook is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-09-27T17:36:55Z

@llyfacebook merged this pull request in 428204d.

Summary: The QuantizedAVx2 does not support the int32 type. We switch to use at::quantize_vec function instead. Pull Request resolved: pytorch/pytorch#26854 Differential Revision: D17609872 Pulled By: llyfacebook fbshipit-source-id: b4a77d93ce0ebfef696506b5cdbe3e91fe44bb36

Summary: The QuantizedAVx2 does not support the int32 type. We switch to use at::quantize_vec function instead. Pull Request resolved: pytorch#26854 Differential Revision: D17609872 Pulled By: llyfacebook fbshipit-source-id: b4a77d93ce0ebfef696506b5cdbe3e91fe44bb36

Reset the remote fork to master

9ac958b

pytorchbot added module: operators oncall: quantization Quantization support in PyTorch labels Sep 26, 2019

lly-zero-one requested review from hx89 and jamesr66a September 26, 2019 00:27

make the style consistent

ebe646b

lly-zero-one commented Sep 26, 2019

View reviewed changes

Comment thread aten/src/ATen/native/quantized/cpu/kernels/QuantizedOpKernels.cpp

fix the test

543aa49

facebook-github-bot reviewed Sep 26, 2019

View reviewed changes

hx89 approved these changes Sep 27, 2019

View reviewed changes

get rid of premuate since nhwc is covered by another test

c22108a

facebook-github-bot reviewed Sep 27, 2019

View reviewed changes

facebook-github-bot closed this in 428204d Sep 27, 2019

facebook-github-bot added the merged label Sep 27, 2019

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the QuantizedAVX2 build issue#26854

Fix the QuantizedAVX2 build issue#26854
lly-zero-one wants to merge 4 commits intopytorch:masterfrom
lly-zero-one:fix_avx2

lly-zero-one commented Sep 26, 2019

Uh oh!

Uh oh!

jamesr66a commented Sep 26, 2019

Uh oh!

facebook-github-bot left a comment

Uh oh!

lly-zero-one commented Sep 26, 2019

Uh oh!

hx89 commented Sep 26, 2019

Uh oh!

jamesr66a commented Sep 26, 2019

Uh oh!

jamesr66a commented Sep 26, 2019

Uh oh!

hx89 left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot commented Sep 27, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

lly-zero-one commented Sep 26, 2019

Uh oh!

Uh oh!

jamesr66a commented Sep 26, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

lly-zero-one commented Sep 26, 2019

Uh oh!

hx89 commented Sep 26, 2019

Uh oh!

jamesr66a commented Sep 26, 2019

Uh oh!

jamesr66a commented Sep 26, 2019

Uh oh!

hx89 left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Sep 27, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants