[PT2E][X86] Add Inductor fusion passes of float8 qconv for X86Inductor backend by jiayisunx · Pull Request #3261 · pytorch/ao

jiayisunx · 2025-10-30T09:49:35Z

Summary:
This PR aims to support QConv weight prepacking, QConv unary fusion, and QConv binary fusion for fp8 data type in Inductor, to lower the ref quant model to X86Inductor backend.

pytorch-bot · 2025-10-30T09:49:38Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3261

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 2 Active SEVs

There are 2 currently active SEVs. If your PR is affected, please view them below:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jerryzh168

stamping since this doesn't touch the main torchao API or user API

can you show any performance metrics before and after the fusion as well

Xia-Weiwen · 2025-11-26T09:49:52Z

stamping since this doesn't touch the main torchao API or user API

can you show any performance metrics before and after the fusion as well

Thanks for reviewing. This fusion passes are actually for lowering, i.e., to fuse the pattern dq - conv q to qconv. So performance comparisons might not be meaningful.

This reverts commit 316ef03.

Revert "Fix style after #3261 (#3397)" This reverts commit 316ef03.

…r backend (pytorch#3261) * [Inductor][float8] Register qconv weight prepack pass for float8 * [Inductor][float8] Register qconv-unary fusion pass for float8 * [Inductor][float8] Register qconv-binary fusion pass for float8 * add comments

Revert "Fix style after pytorch#3261 (pytorch#3397)" This reverts commit 316ef03.

jiayisunx marked this pull request as draft October 30, 2025 09:49

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 30, 2025

jiayisunx force-pushed the jiayi/qconv branch 4 times, most recently from 00600ee to d1163ba Compare November 18, 2025 08:35

jiayisunx added 3 commits November 19, 2025 03:18

[Inductor][float8] Register qconv weight prepack pass for float8

5e19f94

[Inductor][float8] Register qconv-unary fusion pass for float8

77edd82

[Inductor][float8] Register qconv-binary fusion pass for float8

fa88f2f

jiayisunx force-pushed the jiayi/qconv branch from d1163ba to fa88f2f Compare November 19, 2025 04:15

Xia-Weiwen approved these changes Nov 19, 2025

View reviewed changes

Comment thread torchao/quantization/pt2e/inductor_passes/x86.py Outdated

jiayisunx force-pushed the jiayi/qconv branch from 4e834a6 to c92f1ee Compare November 26, 2025 03:44

add comments

c92f1ee

Xia-Weiwen changed the title ~~[Inductor][float8] Support qconv for float8 in inductor~~ [PT2E][X86] Add Inductor fusion passes of float8 qconv for X86Inductor backend Nov 26, 2025

Xia-Weiwen marked this pull request as ready for review November 26, 2025 05:03

Xia-Weiwen added the module: not user facing Use this tag if you don't want this PR to show up in release notes label Nov 26, 2025

Xia-Weiwen requested a review from jerryzh168 November 26, 2025 05:04

jerryzh168 approved these changes Nov 26, 2025

View reviewed changes

Xia-Weiwen merged commit f266b07 into pytorch:main Nov 26, 2025
3 checks passed

andrewor14 added a commit that referenced this pull request Nov 26, 2025

Fix style after #3261

7e6651c

andrewor14 added a commit that referenced this pull request Nov 26, 2025

Fix style after #3261 (#3397)

316ef03

jcaip added a commit that referenced this pull request Dec 2, 2025

Revert "Fix style after #3261 (#3397)"

f1f35a5

This reverts commit 316ef03.

jcaip added a commit that referenced this pull request Dec 2, 2025

Revert "Fix style after #3261" (#3412)

0ffbac1

Revert "Fix style after #3261 (#3397)" This reverts commit 316ef03.

jcaip mentioned this pull request Dec 2, 2025

Revert "[PT2E][X86] Add Inductor fusion passes of float8 qconv for X86Inductor backend" #3413

Merged

jiayisunx mentioned this pull request Dec 3, 2025

[Reland][PT2E][X86] Add Inductor fusion passes of float8 qconv for X8… #3418

Merged

namgyu-youn pushed a commit to namgyu-youn/ao that referenced this pull request Dec 19, 2025

Fix style after pytorch#3261 (pytorch#3397)

bc545f1

namgyu-youn pushed a commit to namgyu-youn/ao that referenced this pull request Dec 19, 2025

Revert "Fix style after pytorch#3261" (pytorch#3412)

59aac76

Revert "Fix style after pytorch#3261 (pytorch#3397)" This reverts commit 316ef03.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PT2E][X86] Add Inductor fusion passes of float8 qconv for X86Inductor backend#3261

[PT2E][X86] Add Inductor fusion passes of float8 qconv for X86Inductor backend#3261
Xia-Weiwen merged 4 commits intopytorch:mainfrom
jiayisunx:jiayi/qconv

jiayisunx commented Oct 30, 2025 •

edited by Xia-Weiwen

Loading

Uh oh!

pytorch-bot Bot commented Oct 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

jerryzh168 left a comment

Uh oh!

Xia-Weiwen commented Nov 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jiayisunx commented Oct 30, 2025 • edited by Xia-Weiwen Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3261

❗ 2 Active SEVs

Uh oh!

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen commented Nov 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jiayisunx commented Oct 30, 2025 •

edited by Xia-Weiwen

Loading

pytorch-bot Bot commented Oct 30, 2025 •

edited

Loading