Update QAT READMEs using new APIs by andrewor14 · Pull Request #1541 · pytorch/ao

andrewor14 · 2025-01-10T19:48:33Z

Stack from ghstack (oldest at bottom):

Add references to new QAT APIs including quantize_,
FakeQuantizedX, and the new embedding Quantizers and
ComposableQATQuantizer. Also link to new QAT + LoRA recipe
in torchtune.

To review on github: Files changed -> README.md -> View file

Summary: #1415 added a quantize_ QAT API for the prepare path. This commit adds the remaining convert path for users to actually perform end-to-end QAT using the quantize_ API. The new flow will look like: ``` from torchao.quantization import ( quantize_, int8_dynamic_activation_int4_weight, ) from torchao.quantization.qat import ( FakeQuantizeConfig, from_intx_quantization_aware_training, intx_quantization_aware_training, ) activation_config = FakeQuantizeConfig(torch.int8, "per_token", is_symmetric=False) weight_config = FakeQuantizeConfig(torch.int4, group_size=32) quantize_( my_model, intx_quantization_aware_training(activation_config, weight_config), ) quantize_(my_model, from_intx_quantization_aware_training()) quantize_(my_model, int8_dynamic_activation_int4_weight(group_size=32)) ``` Test Plan: python test/quantization/test_qat.py -k test_quantize_api_convert_path [ghstack-poisoned]

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

pytorch-bot · 2025-01-10T19:48:37Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1541

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 3dce6a3 with merge base b5b739b ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. ghstack-source-id: 2f7b80e Pull Request resolved: #1541

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. ghstack-source-id: 2882f45 Pull Request resolved: #1541

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. ghstack-source-id: 0755ab8 Pull Request resolved: #1541

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. ghstack-source-id: 0755ab8 Pull Request resolved: #1541

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. ghstack-source-id: 00c7c5d Pull Request resolved: #1541

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. ghstack-source-id: 10bbe97 Pull Request resolved: #1541

jerryzh168 · 2025-01-10T21:14:56Z

+```
+
+
+### Quantizer API (legacy)


is there a deprecation plan for this API?

Not yet, but will come up with one

andrewor14 added 2 commits January 10, 2025 11:48

Update QAT READMEs using new APIs

50a8355

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

andrewor14 mentioned this pull request Jan 10, 2025

Add convert path for quantize_ QAT API #1540

Merged

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 10, 2025

andrewor14 added 2 commits January 10, 2025 11:52

Update base for Update on "Update QAT READMEs using new APIs"

ec94797

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

Update on "Update QAT READMEs using new APIs"

fd55517

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

andrewor14 added the topic: documentation Use this tag if this PR adds or improves documentation label Jan 10, 2025

andrewor14 added 2 commits January 10, 2025 11:53

Update base for Update on "Update QAT READMEs using new APIs"

507a9e6

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

Update on "Update QAT READMEs using new APIs"

eb21de1

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

andrewor14 added 2 commits January 10, 2025 11:55

Update base for Update on "Update QAT READMEs using new APIs"

72000a2

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

Update on "Update QAT READMEs using new APIs"

102ec02

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

andrewor14 added 2 commits January 10, 2025 12:41

Update base for Update on "Update QAT READMEs using new APIs"

87bb7db

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

Update on "Update QAT READMEs using new APIs"

be538d6

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

andrewor14 added 2 commits January 10, 2025 12:53

Update base for Update on "Update QAT READMEs using new APIs"

2dea276

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

Update on "Update QAT READMEs using new APIs"

3dce6a3

Add references to new QAT APIs including `quantize_`, `FakeQuantizedX`, and the new embedding Quantizers and ComposableQATQuantizer. Also link to new QAT + LoRA recipe in torchtune. [ghstack-poisoned]

andrewor14 requested review from jerryzh168 and msaroufim January 10, 2025 20:59

jerryzh168 reviewed Jan 10, 2025

View reviewed changes

jerryzh168 approved these changes Jan 10, 2025

View reviewed changes

andrewor14 changed the base branch from gh/andrewor14/10/base to main January 13, 2025 15:50

andrewor14 merged commit d57704c into main Jan 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update QAT READMEs using new APIs#1541

Update QAT READMEs using new APIs#1541
andrewor14 merged 12 commits into
mainfrom
gh/andrewor14/10/head

andrewor14 commented Jan 10, 2025 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Jan 10, 2025 •

edited

Loading

Uh oh!

jerryzh168 Jan 10, 2025

Uh oh!

andrewor14 Jan 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

andrewor14 commented Jan 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Jan 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1541

✅ No Failures

Uh oh!

jerryzh168 Jan 10, 2025

Choose a reason for hiding this comment

Uh oh!

andrewor14 Jan 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

andrewor14 commented Jan 10, 2025 •

edited

Loading

pytorch-bot Bot commented Jan 10, 2025 •

edited

Loading