[Quant][X86] add an op to compute uint8 pointwise mul by Xia-Weiwen · Pull Request #151112 · pytorch/pytorch

Xia-Weiwen · 2025-04-11T14:51:58Z

Stack from ghstack (oldest at bottom):

-> [Quant][X86] add an op to compute uint8 pointwise mul #151112

Summary
Add a new op, onednn.qmul.tensor, for int8 elementwise mul, which accepts inputs on CPU device (instead of QuantizedCPU).
The new op is implemented by AVX512 instructions and it provides similar or better performance, depending on shape, than its counterpart for QuantizedCPU device quantized.mul.
The new op supports output dtypes other than uint8 (fp32, fp16 and bf16 are supported).

Test plan

pytest test/quantization/core/test_quantized_op.py -k test_int8_mul_onednn

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @jerryzh168

[ghstack-poisoned]

pytorch-bot · 2025-04-11T14:52:02Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/151112

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 8afe2e8 with merge base 7f28c03 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: badb5cb Pull Request resolved: #151112

[ghstack-poisoned]

ghstack-source-id: 76b8871 Pull Request resolved: #151112

Xia-Weiwen · 2025-04-18T08:58:29Z

Hi @jerryzh168 Could you please review this PR? Thanks.

Xia-Weiwen · 2025-04-22T01:41:20Z

Hi @jerryzh168 Could you please review this PR? Thanks.

ghstack-source-id: b785a3d Pull Request resolved: pytorch/pytorch#151112

jerryzh168 · 2025-04-24T19:56:27Z

+            qa = torch.quantize_per_tensor(a, s_a, z_a, torch.quint8)
+            qb = torch.quantize_per_tensor(b, s_b, z_b, torch.quint8)
+            dqa = qa.dequantize()
+            dqb = qb.dequantize()
+            c_ref = dqa * dqb
+            if output_dtype == torch.uint8:
+                c_ref = torch.quantize_per_tensor(c_ref, s_c, z_c, torch.quint8).int_repr()


we have the quantized_decomposed ops that's more recent btw

Thanks for the suggestion. I have changed it to the new version.

[ghstack-poisoned]

ghstack-source-id: 0c4a635 Pull Request resolved: #151112

Xia-Weiwen · 2025-04-25T10:12:21Z

@pytorchbot merge

pytorchmergebot · 2025-04-25T10:15:12Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Update

8eb4d3e

[ghstack-poisoned]

Xia-Weiwen requested review from digantdesai, jerryzh168, jianyuh, kimishpatel and salilsdesai as code owners April 11, 2025 14:51

Xia-Weiwen mentioned this pull request Apr 11, 2025

[Quant][PT2E][X86] Enable annotation of aten.mul.tensor with X86InductorQuantizer #150831

Closed

pytorch-bot Bot added module: cpu CPU specific problem (e.g., perf, algorithm) release notes: quantization release notes category labels Apr 11, 2025

Xia-Weiwen added a commit that referenced this pull request Apr 11, 2025

[Quant][X86] add an op to compute uint8 pointwise mul

c4fa9b1

ghstack-source-id: badb5cb Pull Request resolved: #151112

Xia-Weiwen marked this pull request as draft April 11, 2025 15:01

Xia-Weiwen removed request for digantdesai, jerryzh168, jianyuh, kimishpatel and salilsdesai April 11, 2025 15:02

pytorchbot added the open source label Apr 11, 2025

Xia-Weiwen added 2 commits April 14, 2025 01:05

Update

04a41cf

[ghstack-poisoned]

Update

b21dbc0

[ghstack-poisoned]

Xia-Weiwen requested a review from leslie-fang-intel April 14, 2025 12:41

leslie-fang-intel approved these changes Apr 15, 2025

View reviewed changes

Xia-Weiwen marked this pull request as ready for review April 15, 2025 05:04

Xia-Weiwen added the intel This tag is for PR from Intel label Apr 17, 2025

Xia-Weiwen requested a review from jerryzh168 April 17, 2025 00:33

Update

2b9c7f1

[ghstack-poisoned]

Xia-Weiwen added a commit that referenced this pull request Apr 18, 2025

[Quant][X86] add an op to compute uint8 pointwise mul

924b64d

ghstack-source-id: 76b8871 Pull Request resolved: #151112

Divigroup-RAP pushed a commit to Divigroup-RAP/PYTORCH that referenced this pull request Apr 22, 2025

[Quant][X86] add an op to compute uint8 pointwise mul

8172fa5

ghstack-source-id: b785a3d Pull Request resolved: pytorch/pytorch#151112

jerryzh168 reviewed Apr 24, 2025

View reviewed changes

jerryzh168 approved these changes Apr 24, 2025

View reviewed changes

Update

a09f4bd

[ghstack-poisoned]

Update

8afe2e8

[ghstack-poisoned]

Xia-Weiwen added a commit that referenced this pull request Apr 25, 2025

[Quant][X86] add an op to compute uint8 pointwise mul

40c34e0

ghstack-source-id: 0c4a635 Pull Request resolved: #151112

pytorch-bot Bot added the ciflow/trunk Trigger trunk jobs on your pull request label Apr 25, 2025

pytorchmergebot added the merging label Apr 25, 2025

pytorchmergebot added the Merged label Apr 25, 2025

pytorchmergebot closed this in c1c8c1f Apr 25, 2025

pytorchmergebot removed the merging label Apr 25, 2025

github-actions Bot deleted the gh/Xia-Weiwen/37/head branch June 14, 2025 02:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Quant][X86] add an op to compute uint8 pointwise mul#151112

[Quant][X86] add an op to compute uint8 pointwise mul#151112
Xia-Weiwen wants to merge 6 commits into
gh/Xia-Weiwen/37/basefrom
gh/Xia-Weiwen/37/head

Xia-Weiwen commented Apr 11, 2025 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Apr 11, 2025 •

edited

Loading

Uh oh!

Xia-Weiwen commented Apr 18, 2025

Uh oh!

Xia-Weiwen commented Apr 22, 2025

Uh oh!

jerryzh168 Apr 24, 2025

Uh oh!

Xia-Weiwen Apr 25, 2025

Uh oh!

Xia-Weiwen commented Apr 25, 2025

Uh oh!

pytorchmergebot commented Apr 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

Xia-Weiwen commented Apr 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Apr 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/151112

✅ No Failures

Uh oh!

Xia-Weiwen commented Apr 18, 2025

Uh oh!

Xia-Weiwen commented Apr 22, 2025

Uh oh!

jerryzh168 Apr 24, 2025

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen commented Apr 25, 2025

Uh oh!

pytorchmergebot commented Apr 25, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Xia-Weiwen commented Apr 11, 2025 •

edited

Loading

pytorch-bot Bot commented Apr 11, 2025 •

edited

Loading