Add torchao_convert to PARQ's QuantOptimizer by lisjin · Pull Request #2947 · pytorch/ao

lisjin · 2025-09-05T23:25:36Z

The original plan was to manually replace each weight tensor with an instance of IntxUnpackedToInt8Tensor. However, I found that this already happens in IntxWeightOnlyConfig and Int8DynamicActivationIntxWeightConfig when they are initialized with version=2.

I leverage this code and call quantize_ once per regularized param_group in QuantOptimizer.torchao_convert
For params quantized with StretchedUnifTorchaoQuantizer, I fetch the qparams, scale, zero_point to initialize IntxUnpackedToInt8Tensor (as suggested by Scott)

The PR also adds the num_steps property to QuantOptimizer to mirror D81526700.

pytorch-bot · 2025-09-05T23:26:05Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2947

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 4cfa628 with merge base b99904b ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

metascroy · 2025-09-10T05:50:33Z

Thanks! Overall looks great! Left some comments.

andrewor14

Looks good from my side, will let @metascroy do a final pass/stamp

metascroy · 2025-09-10T22:41:51Z

Overall, I think it looks great! Approving PR, but address the comments above before landing.

lisjin · 2025-09-11T00:24:07Z

Thanks for the comments @andrewor14 @metascroy :) I think I've addressed them all now, but let me know if I missed anything!

jerryzh168 · 2025-09-11T01:24:11Z

+    block_size = (1, group_size)
+    target_dtype = torch.int8
+    q_args = (weight, mapping_type, block_size, target_dtype, config.b)
+    if config.version == 2:


for this it's probably fine to break BC soon since it's a prototype feature?

Got it, thanks. I kept the version=1 convention for this config to preserve the old functionality (StretchedAffineQuantizedTensor). The new version=2 will convert tensors to IntxUnpackedToInt8Tensor, which is consistent with other config classes like IntxWeightOnlyConfig

@metascroy Should we remove StretchedAffineQuantizedTensor entirely?

I have no objections to removing it. I assumed you wanted it for Int4CPULayout, but if you don't need that, I think it's fine to remove.

On ExecuTorch side, IntxUnpackedToInt8 tensor will be sufficient

I think you're right about CPU support. I'll leave it for now but consider removing it in the future

Int4CPULayout is also moved to v2 already: #2845 you can set int4_packing_format to plain_int32 to get it

lisjin requested a review from metascroy September 5, 2025 23:25

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 5, 2025

lisjin added the topic: new feature Use this tag if this PR adds a new feature label Sep 5, 2025

metascroy reviewed Sep 5, 2025

View reviewed changes

Comment thread torchao/prototype/parq/optim/quantopt.py Outdated

lisjin force-pushed the lvj/parq-convert branch 4 times, most recently from d593e74 to 8e39be5 Compare September 8, 2025 14:58

lisjin changed the title ~~First attempt at QuantOptimizer.torchao_convert~~ Add torchao_convert to PARQ's QuantOptimizer Sep 8, 2025

lisjin commented Sep 8, 2025

View reviewed changes

Comment thread torchao/prototype/parq/quant/config_torchao.py

lisjin force-pushed the lvj/parq-convert branch from 3bdfd2a to 01e14bd Compare September 8, 2025 15:42

lisjin commented Sep 8, 2025

View reviewed changes

Comment thread torchao/prototype/parq/optim/quantopt.py

lisjin marked this pull request as ready for review September 8, 2025 16:41

lisjin requested a review from metascroy September 8, 2025 17:19

lisjin force-pushed the lvj/parq-convert branch 2 times, most recently from a3c4e94 to e608880 Compare September 8, 2025 17:37

lisjin requested a review from andrewor14 September 9, 2025 12:44

metascroy reviewed Sep 10, 2025

View reviewed changes

Comment thread torchao/prototype/parq/quant/config_torchao.py

metascroy reviewed Sep 10, 2025

View reviewed changes

Comment thread torchao/prototype/parq/quant/config_torchao.py Outdated

metascroy reviewed Sep 10, 2025

View reviewed changes

Comment thread torchao/prototype/parq/quant/config_torchao.py Outdated

lisjin force-pushed the lvj/parq-convert branch 3 times, most recently from 1beacfa to f352a85 Compare September 10, 2025 16:52

lisjin added 5 commits September 10, 2025 13:01

First attempt at QuantOptimizer.torchao_convert

deda8f0

Use Scott's IntxUnpackedToInt8Tensor conversion

ae01552

Refactor torchao.prototype.parq.quant.quant_api

a49c672

Add check_torchao_tensor_subclass

e4fc74c

PackingFormat -> IntxPackingFormat

a8ea3dd

lisjin added 3 commits September 10, 2025 13:01

Address Scott's comments

20b9261

Fix test_dynamic_activation_lut.py

542f6a9

Add HF quantization config in torchao_convert

38cf423

lisjin force-pushed the lvj/parq-convert branch from b0f15c8 to 38cf423 Compare September 10, 2025 20:02

andrewor14 reviewed Sep 10, 2025

View reviewed changes

Comment thread torchao/prototype/parq/quant/config_torchao.py Outdated

Comment thread torchao/prototype/parq/quant/config_torchao.py Outdated

Comment thread torchao/prototype/parq/quant/config_torchao.py Outdated

Comment thread torchao/prototype/parq/optim/quantopt.py Outdated

Fix fbgemm-gpu-genai import error

b75a0f1

metascroy reviewed Sep 10, 2025

View reviewed changes

Comment thread torchao/prototype/parq/optim/quantopt.py Outdated

metascroy reviewed Sep 10, 2025

View reviewed changes

Comment thread test/prototype/test_parq.py

metascroy reviewed Sep 10, 2025

View reviewed changes

Comment thread torchao/prototype/parq/quant/config_torchao.py Outdated

metascroy approved these changes Sep 10, 2025

View reviewed changes

lisjin force-pushed the lvj/parq-convert branch from ec5e1f7 to 01582ae Compare September 11, 2025 00:13

Address some comments

1ccb298

lisjin force-pushed the lvj/parq-convert branch 3 times, most recently from e33264a to a854d79 Compare September 11, 2025 01:01

Rename to StretchedIntxWeightConfig

1ed5b32

lisjin force-pushed the lvj/parq-convert branch from a854d79 to 1ed5b32 Compare September 11, 2025 01:02

jerryzh168 reviewed Sep 11, 2025

View reviewed changes

Fix fbgemm-gpu-genai error for test_int4_weight_only

4cfa628

lisjin merged commit 481be64 into main Sep 11, 2025
18 checks passed

lisjin deleted the lvj/parq-convert branch September 11, 2025 17:10

metascroy mentioned this pull request Sep 11, 2025

Updates LUT tensor and new convert API #2984

Merged

lisjin mentioned this pull request Sep 16, 2025

Fix torchao_convert, remove StretchedAffineQuantizedTensor #3015

Merged

Conversation

lisjin commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2947

✅ No Failures

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

metascroy commented Sep 10, 2025

Uh oh!

andrewor14 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

metascroy commented Sep 10, 2025

Uh oh!

lisjin commented Sep 11, 2025

Uh oh!

jerryzh168 Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

lisjin Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

metascroy Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

lisjin Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

lisjin commented Sep 5, 2025 •

edited

Loading

pytorch-bot Bot commented Sep 5, 2025 •

edited

Loading