fake_quant: make qparams shape consistent by vkuzo · Pull Request #38587 · pytorch/pytorch

vkuzo · 2020-05-15T23:50:05Z

Stack from ghstack:

fake_quant: make qparams shape consistent #38587 fake_quant: make qparams shape consistent

Summary:

Before this diff, scale+zp were initialized to tensors
with a single dimension and 1 element, and then switched
to scalar tensors after the first forward.

This diff makes the shape stay consistent. This should fix
an issue reported when saving/loading models, which crashes
on this inconsistent shape.

Test Plan:

python test/test_quantization.py TestFakeQuantizePerTensor.test_fake_quant_preserves_qparam_shapes_for_activations

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D21605532

Summary: Before this diff, scale+zp were initialized to tensors with a single dimension and 1 element, and then switched to scalar tensors after the first forward. This diff makes the shape stay consistent. This should fix an issue reported when saving/loading models, which crashes on this inconsistent shape. Test Plan: ``` python test/test_quantization.py TestFakeQuantizePerTensor.test_fake_quant_preserves_qparam_shapes_for_activations ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: Before this diff, scale+zp were initialized to tensors with a single dimension and 1 element, and then switched to scalar tensors after the first forward. This diff makes the shape stay consistent. This should fix an issue reported when saving/loading models, which crashes on this inconsistent shape. Test Plan: ``` python test/test_quantization.py TestFakeQuantizePerTensor.test_fake_quant_preserves_qparam_shapes_for_activations ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: c090edd Pull Request resolved: #38587

dr-ci · 2020-05-16T00:40:56Z

💊 CI failures summary and remediations

As of commit c087aad (more details on the Dr. CI page):

4/4 failures possibly* introduced in this PR
- 1/4 non-CircleCI failure(s)

🕵️ 2 new failures recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

pytorch_windows_vs2019_py36_cpu_build (1/2)

Step: "Build" (full log | diagnosis details | 🔁 rerun)

..\third_party\fbgemm\include\fbgemm/FbgemmFP16.h(100): error C3861: 'runtime_error': identifier not found


de -I..\cmake\..\third_party\googletest\googlemock\include -I..\cmake\..\third_party\googletest\googletest\include -I..\third_party\protobuf\src -Iwin_tmp\mkl\include /DWIN32 /D_WINDOWS /GR /EHsc /w /bigobj -openmp:experimental /wd4244 /wd4267 /wd4305 /wd4309 /MD /O2 /Ob2 /DNDEBUG /w /bigobj   -std:c++14 /showIncludes /Fothird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\src\FbgemmI8Spmdm.cc.obj /Fdthird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\ /FS -c ..\third_party\fbgemm\src\FbgemmI8Spmdm.cc 
Microsoft (R) C/C++ Optimizing Compiler Version 19.26.28805 for x64
Copyright (C) Microsoft Corporation.  All rights reserved.

\include -I..\cmake\..\third_party\googletest\googlemock\include -I..\cmake\..\third_party\googletest\googletest\include -I..\third_party\protobuf\src -Iwin_tmp\mkl\include /DWIN32 /D_WINDOWS /GR /EHsc /w /bigobj -openmp:experimental /wd4244 /wd4267 /wd4305 /wd4309 /MD /O2 /Ob2 /DNDEBUG /w /bigobj   -std:c++14 /showIncludes /Fothird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\src\FbgemmFP16.cc.obj /Fdthird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\ /FS -c ..\third_party\fbgemm\src\FbgemmFP16.cc 
FAILED: third_party/fbgemm/CMakeFiles/fbgemm_generic.dir/src/FbgemmFP16.cc.obj  
\include -I..\cmake\..\third_party\googletest\googlemock\include -I..\cmake\..\third_party\googletest\googletest\include -I..\third_party\protobuf\src -Iwin_tmp\mkl\include /DWIN32 /D_WINDOWS /GR /EHsc /w /bigobj -openmp:experimental /wd4244 /wd4267 /wd4305 /wd4309 /MD /O2 /Ob2 /DNDEBUG /w /bigobj   -std:c++14 /showIncludes /Fothird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\src\FbgemmFP16.cc.obj /Fdthird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\ /FS -c ..\third_party\fbgemm\src\FbgemmFP16.cc 
..\third_party\fbgemm\include\fbgemm/FbgemmFP16.h(100): error C2039: 'runtime_error': is not a member of 'std'
C:\Program Files (x86)\Microsoft Visual Studio\2019\Community\VC\Tools\MSVC\14.26.28801\include\vector(24): note: see declaration of 'std'
..\third_party\fbgemm\include\fbgemm/FbgemmFP16.h(100): error C3861: 'runtime_error': identifier not found
Microsoft (R) C/C++ Optimizing Compiler Version 19.26.28805 for x64
Copyright (C) Microsoft Corporation.  All rights reserved.

nclude -I..\cmake\..\third_party\googletest\googlemock\include -I..\cmake\..\third_party\googletest\googletest\include -I..\third_party\protobuf\src -Iwin_tmp\mkl\include /DWIN32 /D_WINDOWS /GR /EHsc /w /bigobj -openmp:experimental /wd4244 /wd4267 /wd4305 /wd4309 /MD /O2 /Ob2 /DNDEBUG /w /bigobj   -std:c++14 /showIncludes /Fothird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\src\PackAMatrix.cc.obj /Fdthird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\ /FS -c ..\third_party\fbgemm\src\PackAMatrix.cc 
Microsoft (R) C/C++ Optimizing Compiler Version 19.26.28805 for x64
Copyright (C) Microsoft Corporation.  All rights reserved.

 -I..\cmake\..\third_party\googletest\googlemock\include -I..\cmake\..\third_party\googletest\googletest\include -I..\third_party\protobuf\src -Iwin_tmp\mkl\include /DWIN32 /D_WINDOWS /GR /EHsc /w /bigobj -openmp:experimental /wd4244 /wd4267 /wd4305 /wd4309 /MD /O2 /Ob2 /DNDEBUG /w /bigobj   -std:c++14 /showIncludes /Fothird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\src\EmbeddingSpMDM.cc.obj /Fdthird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\ /FS -c ..\third_party\fbgemm\src\EmbeddingSpMDM.cc 
Microsoft (R) C/C++ Optimizing Compiler Version 19.26.28805 for x64
Copyright (C) Microsoft Corporation.  All rights reserved.

pytorch_windows_vs2019_py36_cuda10.1_build (2/2)

Step: "Build" (full log | diagnosis details | 🔁 rerun)

..\third_party\fbgemm\include\fbgemm/FbgemmFP16.h(100): error C3861: 'runtime_error': identifier not found


..\third_party\googletest\googlemock\include -I..\cmake\..\third_party\googletest\googletest\include -I..\third_party\protobuf\src -Iwin_tmp\mkl\include /DWIN32 /D_WINDOWS /GR /EHsc /w /bigobj -openmp:experimental /wd4244 /wd4267 /wd4305 /wd4309 /MD /O2 /Ob2 /DNDEBUG /w /bigobj   -std:c++14 /showIncludes /Fothird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\src\FbgemmFloat16Convert.cc.obj /Fdthird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\ /FS -c ..\third_party\fbgemm\src\FbgemmFloat16Convert.cc 
Microsoft (R) C/C++ Optimizing Compiler Version 19.26.28805 for x64
Copyright (C) Microsoft Corporation.  All rights reserved.

\include -I..\cmake\..\third_party\googletest\googlemock\include -I..\cmake\..\third_party\googletest\googletest\include -I..\third_party\protobuf\src -Iwin_tmp\mkl\include /DWIN32 /D_WINDOWS /GR /EHsc /w /bigobj -openmp:experimental /wd4244 /wd4267 /wd4305 /wd4309 /MD /O2 /Ob2 /DNDEBUG /w /bigobj   -std:c++14 /showIncludes /Fothird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\src\FbgemmFP16.cc.obj /Fdthird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\ /FS -c ..\third_party\fbgemm\src\FbgemmFP16.cc 
FAILED: third_party/fbgemm/CMakeFiles/fbgemm_generic.dir/src/FbgemmFP16.cc.obj  
\include -I..\cmake\..\third_party\googletest\googlemock\include -I..\cmake\..\third_party\googletest\googletest\include -I..\third_party\protobuf\src -Iwin_tmp\mkl\include /DWIN32 /D_WINDOWS /GR /EHsc /w /bigobj -openmp:experimental /wd4244 /wd4267 /wd4305 /wd4309 /MD /O2 /Ob2 /DNDEBUG /w /bigobj   -std:c++14 /showIncludes /Fothird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\src\FbgemmFP16.cc.obj /Fdthird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\ /FS -c ..\third_party\fbgemm\src\FbgemmFP16.cc 
..\third_party\fbgemm\include\fbgemm/FbgemmFP16.h(100): error C2039: 'runtime_error': is not a member of 'std'
C:\Program Files (x86)\Microsoft Visual Studio\2019\Community\VC\Tools\MSVC\14.26.28801\include\vector(24): note: see declaration of 'std'
..\third_party\fbgemm\include\fbgemm/FbgemmFP16.h(100): error C3861: 'runtime_error': identifier not found
Microsoft (R) C/C++ Optimizing Compiler Version 19.26.28805 for x64
Copyright (C) Microsoft Corporation.  All rights reserved.

nclude -I..\cmake\..\third_party\googletest\googlemock\include -I..\cmake\..\third_party\googletest\googletest\include -I..\third_party\protobuf\src -Iwin_tmp\mkl\include /DWIN32 /D_WINDOWS /GR /EHsc /w /bigobj -openmp:experimental /wd4244 /wd4267 /wd4305 /wd4309 /MD /O2 /Ob2 /DNDEBUG /w /bigobj   -std:c++14 /showIncludes /Fothird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\src\PackAMatrix.cc.obj /Fdthird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\ /FS -c ..\third_party\fbgemm\src\PackAMatrix.cc 
Microsoft (R) C/C++ Optimizing Compiler Version 19.26.28805 for x64
Copyright (C) Microsoft Corporation.  All rights reserved.

rd_party\googletest\googlemock\include -I..\cmake\..\third_party\googletest\googletest\include -I..\third_party\protobuf\src -Iwin_tmp\mkl\include /DWIN32 /D_WINDOWS /GR /EHsc /w /bigobj -openmp:experimental /wd4244 /wd4267 /wd4305 /wd4309 /MD /O2 /Ob2 /DNDEBUG /w /bigobj   -std:c++14 /showIncludes /Fothird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\src\PackAWithQuantRowOffset.cc.obj /Fdthird_party\fbgemm\CMakeFiles\fbgemm_generic.dir\ /FS -c ..\third_party\fbgemm\src\PackAWithQuantRowOffset.cc 
Microsoft (R) C/C++ Optimizing Compiler Version 19.26.28805 for x64
Copyright (C) Microsoft Corporation.  All rights reserved.

1 failure not recognized by patterns:

Job	Step	Action
^{binary_windows_libtorch_3_7_cpu_release_build}	^Build	🔁 rerun

ci.pytorch.org: 1 failed

Failed: pr/py3.6-clang7-rocmdeb-ubuntu16.04

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 16 times.

Summary: Before this diff, scale+zp were initialized to tensors with a single dimension and 1 element, and then switched to scalar tensors after the first forward. This diff makes the shape stay consistent. This should fix an issue reported when saving/loading models, which crashes on this inconsistent shape. Test Plan: ``` python test/test_quantization.py TestFakeQuantizePerTensor.test_fake_quant_preserves_qparam_shapes_for_activations ``` Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D21605532](https://our.internmc.facebook.com/intern/diff/D21605532) [ghstack-poisoned]

Summary: Before this diff, scale+zp were initialized to tensors with a single dimension and 1 element, and then switched to scalar tensors after the first forward. This diff makes the shape stay consistent. This should fix an issue reported when saving/loading models, which crashes on this inconsistent shape. Test Plan: ``` python test/test_quantization.py TestFakeQuantizePerTensor.test_fake_quant_preserves_qparam_shapes_for_activations ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: a8244db Pull Request resolved: #38587

raghuramank100 · 2020-05-21T21:08:06Z

+        x = torch.rand(4, 4, 4, 4)
+        m(x)
+        scale_shape_after = m.linear.activation_post_process.scale.shape
+        zero_point_shape_after = m.linear.activation_post_process.zero_point.shape


Looks good, should we also check for per-channel quant (i.e the weights?)

just double checked, we expect the per-channel params to change (for C > 1), so it did not have the issue solved by this diff

Summary: Before this diff, scale+zp were initialized to tensors with a single dimension and 1 element, and then switched to scalar tensors after the first forward. This diff makes the shape stay consistent. This should fix an issue reported when saving/loading models, which crashes on this inconsistent shape. Test Plan: ``` python test/test_quantization.py TestFakeQuantizePerTensor.test_fake_quant_preserves_qparam_shapes_for_activations ``` Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D21605532](https://our.internmc.facebook.com/intern/diff/D21605532) [ghstack-poisoned]

Summary: Before this diff, scale+zp were initialized to tensors with a single dimension and 1 element, and then switched to scalar tensors after the first forward. This diff makes the shape stay consistent. This should fix an issue reported when saving/loading models, which crashes on this inconsistent shape. Test Plan: ``` python test/test_quantization.py TestFakeQuantizePerTensor.test_fake_quant_preserves_qparam_shapes_for_activations ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: eab594c Pull Request resolved: #38587

facebook-github-bot · 2020-05-22T02:14:09Z

This pull request has been merged in 8d8b586.

Summary: Pull Request resolved: pytorch#38587 Before this diff, scale+zp were initialized to tensors with a single dimension and 1 element, and then switched to scalar tensors after the first forward. This diff makes the shape stay consistent. This should fix an issue reported when saving/loading models, which crashes on this inconsistent shape. Test Plan: ``` python test/test_quantization.py TestFakeQuantizePerTensor.test_fake_quant_preserves_qparam_shapes_for_activations ``` Imported from OSS Differential Revision: D21605532 fbshipit-source-id: e00cd268d6d3ded1006d18d6c6759c911b3a74ea

vkuzo requested review from jerryzh168, raghuramank100 and supriyar May 15, 2020 23:51

raghuramank100 reviewed May 21, 2020

View reviewed changes

raghuramank100 approved these changes May 21, 2020

View reviewed changes

facebook-github-bot closed this in 8d8b586 May 22, 2020

facebook-github-bot added the merged label May 22, 2020

facebook-github-bot deleted the gh/vkuzo/66/head branch May 25, 2020 14:16

vkuzo mentioned this pull request Jul 8, 2020

Broadcasting does not work for Quantization aware training with multiple GPUs #37270

Closed

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fake_quant: make qparams shape consistent#38587

fake_quant: make qparams shape consistent#38587
vkuzo wants to merge 4 commits intogh/vkuzo/66/basefrom
gh/vkuzo/66/head

vkuzo commented May 15, 2020 •

edited

Loading

Uh oh!

dr-ci Bot commented May 16, 2020 •

edited

Loading

Uh oh!

raghuramank100 May 21, 2020

Uh oh!

vkuzo May 21, 2020

Uh oh!

facebook-github-bot commented May 22, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

vkuzo commented May 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci Bot commented May 16, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

🕵️ 2 new failures recognized by patterns

pytorch_windows_vs2019_py36_cpu_build (1/2)

pytorch_windows_vs2019_py36_cuda10.1_build (2/2)

1 failure not recognized by patterns:

ci.pytorch.org: 1 failed

Uh oh!

raghuramank100 May 21, 2020

Choose a reason for hiding this comment

Uh oh!

vkuzo May 21, 2020

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented May 22, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vkuzo commented May 15, 2020 •

edited

Loading

dr-ci Bot commented May 16, 2020 •

edited

Loading