Replace empty_affine_quantizer with direct dispatch to at::native::empty_affine.. . by kimishpatel · Pull Request #36814 · pytorch/pytorch

kimishpatel · 2020-04-17T18:26:36Z

Stack from ghstack:

Add channel shuffle op fp32 + quantized. #36815 Add channel shuffle op fp32 + quantized.
Replace empty_affine_quantizer with direct dispatch to at::native::empty_affine.. . #36814 Replace empty_affine_quantizer with direct dispatch to at::native::empty_affine.. .
Add quantized adaptive avgpool. #36813 Add quantized adaptive avgpool.
Move to using MemoryFormat::ChannelsLast for quantized avgpool2d. #36812 Move to using MemoryFormat::ChannelsLast for quantized avgpool2d.

From the flamegraph it seems 40% the time we are spending going through the dispatch stack. I think in quantized model where compute can take less time, such overheads become noticeable

Differential Revision: D21093840

Differential Revision: [D21093840](https://our.internmc.facebook.com/intern/diff/D21093840/) [ghstack-poisoned]

dr-ci · 2020-04-17T18:55:52Z

💊 Build failures summary and remediations

As of commit 729b708 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker.

See how this bot performed.

This comment has been revised 35 times.

dreiss · 2020-04-17T20:35:50Z

Can you update the commit message to explain the motivation for this change?

kimishpatel · 2020-04-17T20:40:14Z

Can you update the commit message to explain the motivation for this change?

Updated.

jerryzh168 · 2020-04-23T17:02:30Z

  const auto b_scale = qb_contig.q_scale();

-  Tensor qy = at::_empty_affine_quantized(
+  Tensor qy = at::new_qtensor_cpu(


could you use at::native:: empty_affine_quantized?

Whey do you think that would be better? Does it get around the dispatch overhead?

It is better because it has the same API as at::_empty_affine_quantized.
yes, it will get around the dispatch overhead.

Sure. Let me try.

This is still slower, but not a whole lot. So I think it can make for a decent compromise.

From the flamegraph it seems 40% the time we are spending going through the dispatch stack. I think in quantized model where compute can take less time, such overheads become noticeable Differential Revision: [D21093840](https://our.internmc.facebook.com/intern/diff/D21093840/) [ghstack-poisoned]

jerryzh168 · 2020-04-27T17:20:20Z

 #include <ATen/native/quantized/cpu/quantized_ops.h>
 #include <ATen/native/quantized/cpu/init_qnnpack.h>
 #include <ATen/native/quantized/cpu/qnnpack_utils.h>
+#include <c10/core/TensorOptions.h>


can these includes be removed?

jerryzh168 · 2020-04-27T17:20:36Z

+#include <ATen/quantized/Quantizer.h>
 #include <ATen/native/quantized/cpu/fbgemm_utils.h>
 #include <ATen/native/quantized/cpu/qnnpack_utils.h>
+#include <c10/core/TensorOptions.h>


can these includes be removed?

I mean the added ones

Yes, I was gonna do that.

From the flamegraph it seems 40% the time we are spending going through the dispatch stack. I think in quantized model where compute can take less time, such overheads become noticeable Differential Revision: [D21093840](https://our.internmc.facebook.com/intern/diff/D21093840/) [ghstack-poisoned]

jerryzh168

Looks good

…:native::empty_affine.. ." From the flamegraph it seems 40% the time we are spending going through the dispatch stack. I think in quantized model where compute can take less time, such overheads become noticeable Differential Revision: [D21093840](https://our.internmc.facebook.com/intern/diff/D21093840/) [ghstack-poisoned]

facebook-github-bot · 2020-05-01T18:15:39Z

This pull request has been merged in 1510bdd.

Summary: Pull Request resolved: pytorch#36814 ghstack-source-id: 103218412 From the flamegraph it seems 40% the time we are spending going through the dispatch stack. I think in quantized model where compute can take less time, such overheads become noticeable {F234432545} Test Plan: Quantized op tests. Reviewed By: jerryzh168 Differential Revision: D21093840 fbshipit-source-id: 1b98b57eae403353596fc31171069d2f43b13385

Replace empty_affine_quantizer with new_qtensor_cpu.

7b0bcab

Differential Revision: [D21093840](https://our.internmc.facebook.com/intern/diff/D21093840/) [ghstack-poisoned]

This was referenced Apr 17, 2020

Move to using MemoryFormat::ChannelsLast for quantized avgpool2d. #36812

Closed

Add quantized adaptive avgpool. #36813

Closed

Add channel shuffle op fp32 + quantized. #36815

Closed

kimishpatel requested review from dreiss and supriyar April 17, 2020 18:33

kimishpatel requested review from jerryzh168 and supriyar and removed request for supriyar April 23, 2020 16:48

jerryzh168 reviewed Apr 23, 2020

View reviewed changes

kimishpatel added 3 commits April 24, 2020 10:09

jerryzh168 reviewed Apr 27, 2020

View reviewed changes

kimishpatel added 2 commits April 27, 2020 10:47

jerryzh168 approved these changes Apr 28, 2020

View reviewed changes

kimishpatel changed the title ~~Replace empty_affine_quantizer with new_qtensor_cpu.~~ Replace empty_affine_quantizer with direct dispatch to at::native::empty_affine.. . Apr 28, 2020

kimishpatel added 3 commits April 29, 2020 07:50

facebook-github-bot closed this in 1510bdd May 1, 2020

facebook-github-bot added the merged label May 1, 2020

facebook-github-bot deleted the gh/kimishpatel/3/head branch May 5, 2020 14:17

mruberry added the Merged label Oct 28, 2020

Conversation

kimishpatel commented Apr 17, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci Bot commented Apr 17, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 Build failures summary and remediations

Uh oh!

dreiss commented Apr 17, 2020

Uh oh!

kimishpatel commented Apr 17, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Apr 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented May 1, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

kimishpatel commented Apr 17, 2020 •

edited

Loading

dr-ci Bot commented Apr 17, 2020 •

edited

Loading

jerryzh168 Apr 27, 2020 •

edited

Loading