[quant][core][gpu][improvement] Removed conv_output and set output tensors as virtual in quantized cudnn conv2d op by dzdang · Pull Request #76787 · pytorch/pytorch

dzdang · 2022-05-04T03:40:45Z

Stack from ghstack:

[quant][core][gpu][improvement] Made plan and run for quantized cudnn conv op conform with Conv_v8.cpp #76788
-> [quant][core][gpu][improvement] Removed conv_output and set output tensors as virtual in quantized cudnn conv2d op #76787
[quant][core][gpu][improvement] Enabled broadcasting multiplication support for requantize_multiplier_tensor in quantized cudnn add, linear, and conv2d ops #76518

Summary:
With support for virtual tensors in cudnn, we no longer have to allocate
conv_output.

Test plan:

python test/test_quantization.py -k test_qconv2d_cudnn

Differential Revision: D36121583

…nsors as virtual in quantized cudnn conv2d op Summary: With support for virtual tensors in cudnn, we no longer have to allocate conv_output. Test plan: ``` python test/test_quantization.py -k test_qconv2d_cudnn ``` [ghstack-poisoned]

facebook-github-bot · 2022-05-04T03:40:52Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/76787
📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓Need help or want to give feedback on the CI? Visit our office hours

✅ No Failures (0 Pending)

As of commit 2d2ffc3 (more details on the Dr. CI page):

Expand to see more

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

dzdang · 2022-05-04T03:43:37Z

@dzdang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

jerryzh168 · 2022-05-10T23:51:26Z

aten/src/ATen/native/quantized/cudnn/Conv.cpp

      .setxDesc(cudnn_utils::getTensorDescriptor(input.sizes(), input.strides(), CUDNN_DATA_INT8, 'x', key.input_alignment))
-      .setyDesc(cudnn_utils::getTensorDescriptor(conv_output, 'y', key.output_alignment))
+      // for virtual tensors, the alignment is not used, so we can just put an arbitrary value here, e.g., key.output_alignment
+      .setyDesc(cudnn_utils::getTensorDescriptor(quantized_output.sizes(), quantized_output.strides(), CUDNN_DATA_FLOAT, 'y', key.output_alignment, true))


should we use a constant here so that it is the same for all calls?

this setyDesc is not really used since the output is a virtual tensor, is that corrrect?

@jerryzh168 no. setyDesc is still used. we still need to provide the tensor size, stride, and dtype for cudNN (I don't think they have the support that allows them to determine the output shape and dtype based on the input and weight). the alignment and uid are not used.

aten/src/ATen/native/quantized/cudnn/Conv.cpp

…t output tensors as virtual in quantized cudnn conv2d op" Summary: With support for virtual tensors in cudnn, we no longer have to allocate conv_output. Test plan: ``` python test/test_quantization.py -k test_qconv2d_cudnn ``` Differential Revision: [D36121583](https://our.internmc.facebook.com/intern/diff/D36121583) [ghstack-poisoned]

dzdang · 2022-05-11T20:04:15Z

@dzdang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

…t output tensors as virtual in quantized cudnn conv2d op" Summary: With support for virtual tensors in cudnn, we no longer have to allocate conv_output. Test plan: ``` python test/test_quantization.py -k test_qconv2d_cudnn ``` Differential Revision: [D36121583](https://our.internmc.facebook.com/intern/diff/D36121583) [ghstack-poisoned]

dzdang · 2022-05-16T02:22:33Z

@dzdang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

…t output tensors as virtual in quantized cudnn conv2d op" Summary: With support for virtual tensors in cudnn, we no longer have to allocate conv_output. Test plan: ``` python test/test_quantization.py -k test_qconv2d_cudnn ``` Differential Revision: [D36121583](https://our.internmc.facebook.com/intern/diff/D36121583) [ghstack-poisoned]

dzdang · 2022-05-24T21:29:02Z

@dzdang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-05-25T03:07:01Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

…nsors as virtual in quantized cudnn conv2d op (#76787) Summary: Pull Request resolved: #76787 With support for virtual tensors in cudnn, we no longer have to allocate conv_output. Test Plan: ``` python test/test_quantization.py -k test_qconv2d_cudnn ``` ``` python test/test_quantization.py -k test_qconv2d_cudnn ``` Differential Revision: D36121583 D36121583 Reviewed By: jerryzh168 Pulled By: dzdang fbshipit-source-id: 7269fd4eaaad5ae6faf711add99b58731efb717a

…tensors as virtual in quantized cudnn linear op Summary: See #76787. Same idea applied here but for `linear_output`. Test plan: ``` python test/test_quantization.py --k test_qlinear_cudnn ``` Pull Request resolved: #77518 Approved by: https://github.com/jerryzh168

…tensors as virtual in quantized cudnn linear op (#77518) Summary: Pull Request resolved: #77518 See #76787. Same idea applied here but for `linear_output`. Test Plan: ``` python test/test_quantization.py --k test_qlinear_cudnn ``` Reviewed By: jerryzh168 Differential Revision: D36403832 Pulled By: dzdang fbshipit-source-id: 4ec3bab6a9d82a58fecd38bc943b2137a4f4157e

facebook-github-bot added the cla signed label May 4, 2022

dzdang mentioned this pull request May 4, 2022

[quant][core][gpu][improvement] Enabled broadcasting multiplication support for requantize_multiplier_tensor in quantized cudnn add, linear, and conv2d ops #76518

Closed

dzdang mentioned this pull request May 4, 2022

[quant][core][gpu][improvement] Made plan and run for quantized cudnn conv op conform with Conv_v8.cpp #76788

Closed

dzdang added release notes: quantization release notes category topic: improvements topic category labels May 4, 2022

dzdang requested a review from jerryzh168 May 4, 2022 12:41

jerryzh168 approved these changes May 10, 2022

View reviewed changes

jerryzh168 reviewed May 10, 2022

View reviewed changes

aten/src/ATen/native/quantized/cudnn/Conv.cpp Show resolved Hide resolved

dzdang mentioned this pull request May 16, 2022

[quant][core][gpu][improvement] Removed linear_output and set output tensors as virtual in quantized cudnn linear op #77518

Closed

dzdang added 2 commits May 24, 2022 16:32

pytorchmergebot added the Merged label May 25, 2022

pytorchmergebot closed this in bd5ec6c May 25, 2022

facebook-github-bot deleted the gh/dzdang/104/head branch May 28, 2022 14:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quant][core][gpu][improvement] Removed conv_output and set output tensors as virtual in quantized cudnn conv2d op#76787

[quant][core][gpu][improvement] Removed conv_output and set output tensors as virtual in quantized cudnn conv2d op#76787
dzdang wants to merge 5 commits intogh/dzdang/104/basefrom
gh/dzdang/104/head

dzdang commented May 4, 2022 •

edited

Loading

Uh oh!

facebook-github-bot commented May 4, 2022 •

edited

Loading

Uh oh!

dzdang commented May 4, 2022

Uh oh!

jerryzh168 May 10, 2022 •

edited

Loading

Uh oh!

jerryzh168 May 10, 2022

Uh oh!

dzdang May 11, 2022

Uh oh!

Uh oh!

dzdang commented May 11, 2022

Uh oh!

dzdang commented May 16, 2022

Uh oh!

dzdang commented May 24, 2022

Uh oh!

facebook-github-bot commented May 25, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

dzdang commented May 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented May 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

✅ No Failures (0 Pending)

Uh oh!

dzdang commented May 4, 2022

Uh oh!

jerryzh168 May 10, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jerryzh168 May 10, 2022

Choose a reason for hiding this comment

Uh oh!

dzdang May 11, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dzdang commented May 11, 2022

Uh oh!

dzdang commented May 16, 2022

Uh oh!

dzdang commented May 24, 2022

Uh oh!

facebook-github-bot commented May 25, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dzdang commented May 4, 2022 •

edited

Loading

facebook-github-bot commented May 4, 2022 •

edited

Loading

jerryzh168 May 10, 2022 •

edited

Loading