Delete deprecated PlainLayout, PlainAQTTensorImpl and related v1 code paths#4151

Merged

jerryzh168 merged 63 commits intomainfrom

gh/jerryzh168/67/head

Apr 2, 2026

Contributor

jerryzh168 commented Mar 23, 2026 •

edited

Loading

Stack from ghstack (oldest at bottom):

-> Delete deprecated PlainLayout, PlainAQTTensorImpl and related v1 code paths #4151

Remove PlainLayout class from dtypes/utils.py
Delete torchao/dtypes/uintx/plain_layout.py
Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
Remove AQT embedding dispatch (used PlainAQTTensorImpl)
Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
Remove PlainLayout from public exports
Update tests to use v2 tensor types

jerryzh168 added 9 commits

March 23, 2026 13:08


          Delete deprecated BlockSparseLayout and related code

3b5f928

[ghstack-poisoned]


          Delete deprecated GemlitePackedLayout and GemliteUIntXWeightOnlyConfig

ba3934d

[ghstack-poisoned]


          Delete deprecated UintxLayout and related code

67f8ce6

[ghstack-poisoned]


          Delete deprecated QDQLayout and related code

a3ecb77

[ghstack-poisoned]


          Delete deprecated Int4XPULayout and related code

916a635

[ghstack-poisoned]


          Delete deprecated SemiSparseLayout and related code

d102bcc

[ghstack-poisoned]


          Delete deprecated Int4CPULayout and related code

e76a9ba

[ghstack-poisoned]


          Delete deprecated PackedLinearInt8DynamicActivationIntxWeightLayout a…

a0e6333

…nd related code

[ghstack-poisoned]


          Delete deprecated PlainLayout, PlainAQTTensorImpl and related v1 code…

c98f09f

… paths

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

[ghstack-poisoned]

pytorch-bot Bot commented Mar 23, 2026 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/4151

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 662c670 with merge base 0c29e81 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

This was referenced Mar 23, 2026

Delete deprecated BlockSparseLayout and related code #4143

Merged

Delete deprecated GemlitePackedLayout and GemliteUIntXWeightOnlyConfig #4144

Merged

Delete deprecated UintxLayout and related code #4145

Merged

Delete deprecated QDQLayout and related code #4146

Merged

Delete deprecated Int4XPULayout and related code #4147

Merged

Delete deprecated SemiSparseLayout and related code #4148

Merged

Delete deprecated Int4CPULayout and related code #4149

Merged

Delete deprecated PackedLinearInt8DynamicActivationIntxWeightLayout and related code #4150

Merged

Move bitpacking.py to prototype and add uintx_utils.py #4152

Merged

Delete deprecated TensorCoreTiledLayout and related code #4153

Merged

meta-cla Bot added the CLA Signed label

jerryzh168 added 6 commits

March 23, 2026 13:34


          Update base for Update on "Delete deprecated PlainLayout, PlainAQTTen…

b7f6130

…sorImpl and related v1 code paths"

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

[ghstack-poisoned]


          Update on "Delete deprecated PlainLayout, PlainAQTTensorImpl and rela…

0ffdcba

…ted v1 code paths"

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

[ghstack-poisoned]


          Update base for Update on "Delete deprecated PlainLayout, PlainAQTTen…

b6e12f3

…sorImpl and related v1 code paths"

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

[ghstack-poisoned]


          Update on "Delete deprecated PlainLayout, PlainAQTTensorImpl and rela…

b4fb8b3

…ted v1 code paths"

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

[ghstack-poisoned]


          Update base for Update on "Delete deprecated PlainLayout, PlainAQTTen…

d61be4b

…sorImpl and related v1 code paths"

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

[ghstack-poisoned]


          Update on "Delete deprecated PlainLayout, PlainAQTTensorImpl and rela…

f9d8ce0

…ted v1 code paths"

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

[ghstack-poisoned]

jerryzh168 added the module: not user facing label

jerryzh168 added 2 commits

March 23, 2026 16:47


          Update base for Update on "Delete deprecated PlainLayout, PlainAQTTen…

…sorImpl and related v1 code paths"

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

[ghstack-poisoned]


          Update on "Delete deprecated PlainLayout, PlainAQTTensorImpl and rela…

6260ade

…ted v1 code paths"

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

[ghstack-poisoned]

jerryzh168 added 2 commits

April 1, 2026 21:10


          Update base for Update on "Delete deprecated PlainLayout, PlainAQTTen…

e95951b

…sorImpl and related v1 code paths"

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

[ghstack-poisoned]


          Update on "Delete deprecated PlainLayout, PlainAQTTensorImpl and rela…

773c352

…ted v1 code paths"

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

[ghstack-poisoned]

jerryzh168 added a commit that referenced this pull request


          Delete deprecated PlainLayout, PlainAQTTensorImpl and related v1 code…

3756d56

… paths

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

ghstack-source-id: 630ea48
Pull Request resolved: #4151

jerryzh168 changed the base branch from gh/jerryzh168/67/base to main

April 2, 2026 18:11

jerryzh168 requested review from svekars and vkuzo as code owners

April 2, 2026 18:11

jerryzh168 added 2 commits

April 2, 2026 11:18


          Update base for Update on "Delete deprecated PlainLayout, PlainAQTTen…

6df36be

…sorImpl and related v1 code paths"

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

[ghstack-poisoned]


          Update on "Delete deprecated PlainLayout, PlainAQTTensorImpl and rela…

662c670

…ted v1 code paths"

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

[ghstack-poisoned]

jerryzh168 added a commit that referenced this pull request


          Delete deprecated PlainLayout, PlainAQTTensorImpl and related v1 code…

a492227

… paths

- Remove PlainLayout class from dtypes/utils.py
- Delete torchao/dtypes/uintx/plain_layout.py
- Remove int8 weight and int8 dynamic activation dispatch from AQT dispatch table
- Remove AQT embedding dispatch (used PlainAQTTensorImpl)
- Change Int8WeightOnlyConfig default to version=2 (removes v1 AQT path)
- Change Int8DynamicActivationInt8WeightConfig default to version=2 (removes v1 AQT path)
- Remove PlainLayout from public exports
- Update tests to use v2 tensor types

ghstack-source-id: fcdc7a0
Pull Request resolved: #4151

jerryzh168 merged commit b1ddd15 into main

43 checks passed

jerryzh168 mentioned this pull request

Migrating from AffineQuantizedTensor + Layouts to new structure of tensor subclasses #2752

Closed

17 tasks

andrewor14 added a commit that referenced this pull request


          Delete AffineQuantizedTensor (AQT) and core references

dc8f0eb

AffineQuantizedTensor was the v1 quantized tensor system, now fully
superseded by v2 tensor types (Int8Tensor, Int4Tensor, Float8Tensor,
IntxUnpackedToInt8Tensor, etc.) that inherit from TorchAOBaseTensor.

Core deletions:
- torchao/dtypes/affine_quantized_tensor.py (class definition)
- torchao/dtypes/affine_quantized_tensor_ops.py (aten dispatch)
- torchao/dtypes/floatx/, torchao/dtypes/uintx/ (empty subpackages)
- torchao/dtypes/README.md (stale AQT-centric docs)
- torchao/dtypes/utils.py: removed AQTTensorImpl class
- torchao/dtypes/__init__.py: removed all AQT exports
- test/dtypes/test_affine_quantized.py
- test/dtypes/test_affine_quantized_tensor_parallel.py

Core updates:
- quant_api.py: removed AQT from _is_linear check, removed 5 dead
  activation quant helpers
- testing/utils.py: switched defaults from AQT to Int8Tensor
- Updated test assertions, docstrings, and docs to remove AQT references

Prototype migrations in this commit:
- torchao/prototype/autoround/: migrated off AQT, uses
  IntxUnpackedToInt8Tensor and TorchAOBaseTensor
- torchao/prototype/quantization/mixed_precision/: added assertion error
  since feature was already broken by PlainLayout deletion (#4151)

Still broken (predates this commit, tracked with TODOs):
- torchao/prototype/dtypes/uintx/uintx_utils.py (AQTTensorImpl deleted)
- tutorials/calibration_flow/ (uses to_affine_quantized_intx_static)

Docs/comments only (not broken, just stale references):
- torchao/prototype/quantization/module_swap/ (README)
- torchao/prototype/parq/ (README)
- torchao/prototype/quantized_training/ (comments)

andrewor14 added a commit that referenced this pull request


          Delete AffineQuantizedTensor (AQT) and core references

84cdede

AffineQuantizedTensor was the v1 quantized tensor system, now fully
superseded by v2 tensor types (Int8Tensor, Int4Tensor, Float8Tensor,
IntxUnpackedToInt8Tensor, etc.) that inherit from TorchAOBaseTensor.

Core deletions:
- torchao/dtypes/affine_quantized_tensor.py (class definition)
- torchao/dtypes/affine_quantized_tensor_ops.py (aten dispatch)
- torchao/dtypes/floatx/, torchao/dtypes/uintx/ (empty subpackages)
- torchao/dtypes/README.md (stale AQT-centric docs)
- torchao/dtypes/utils.py: removed AQTTensorImpl class
- torchao/dtypes/__init__.py: removed all AQT exports
- test/dtypes/test_affine_quantized.py
- test/dtypes/test_affine_quantized_tensor_parallel.py

Core updates:
- quant_api.py: removed AQT from _is_linear check, removed 5 dead
  activation quant helpers
- testing/utils.py: switched defaults from AQT to Int8Tensor
- Updated test assertions, docstrings, and docs to remove AQT references

Prototype migrations in this commit:
- torchao/prototype/autoround/: migrated off AQT, uses
  IntxUnpackedToInt8Tensor and TorchAOBaseTensor
- torchao/prototype/quantization/mixed_precision/: added assertion error
  since feature was already broken by PlainLayout deletion (#4151)

Still broken (predates this commit, tracked with TODOs):
- torchao/prototype/dtypes/uintx/uintx_utils.py (AQTTensorImpl deleted)
- tutorials/calibration_flow/ (uses to_affine_quantized_intx_static)

Docs/comments only (not broken, just stale references):
- torchao/prototype/quantization/module_swap/ (README)
- torchao/prototype/parq/ (README)
- torchao/prototype/quantized_training/ (comments)

andrewor14 mentioned this pull request

Delete AffineQuantizedTensor, AQTTensorImpl, and Layout #4245

Merged

andrewor14 added a commit that referenced this pull request


          Delete AffineQuantizedTensor (AQT) and core references

fb55c02

AffineQuantizedTensor was the v1 quantized tensor system, now fully
superseded by v2 tensor types (Int8Tensor, Int4Tensor, Float8Tensor,
IntxUnpackedToInt8Tensor, etc.) that inherit from TorchAOBaseTensor.

Core deletions:
- torchao/dtypes/affine_quantized_tensor.py (class definition)
- torchao/dtypes/affine_quantized_tensor_ops.py (aten dispatch)
- torchao/dtypes/floatx/, torchao/dtypes/uintx/ (empty subpackages)
- torchao/dtypes/README.md (stale AQT-centric docs)
- torchao/dtypes/utils.py: removed Layout class and AQTTensorImpl class
- torchao/dtypes/__init__.py: removed all AQT and Layout exports
- torchao/utils.py: removed _register_layout, _get_tensor_impl_constructor,
  and their classmethod registrations on TorchAOBaseTensor
- test/dtypes/test_affine_quantized.py
- test/dtypes/test_affine_quantized_tensor_parallel.py

Core updates:
- quant_api.py: removed AQT from _is_linear check, removed 5 dead
  activation quant helpers
- testing/utils.py: switched defaults from AQT to Int8Tensor
- Updated test assertions, docstrings, and docs to remove AQT references

Prototype status:
- prototype/autoround/: everything works except for
  `apply_auto_round()`, which was already broken before
- prototype/dtypes/uintx/uintx_utils.py: removed UintxLayout,
  UintxAQTTensorImpl, and AQT imports (fixes codebook import breakage)
- prototype/quantization/mixed_precision/: added assertion error since
  feature was already broken by PlainLayout deletion (#4151)
- prototype/parq: removed unused layout field

Still broken (tracked with TODOs):
- tutorials/calibration_flow/ (uses to_affine_quantized_intx_static)
- tutorials/developer_api_guide/ (uses Layout)

Docs/comments only (not broken, just stale references):
- prototype/quantization/module_swap/ (README)
- prototype/parq/ (README)
- prototype/quantized_training/ (comments)

andrewor14 added a commit that referenced this pull request


          Delete AffineQuantizedTensor, AQTTensorImpl, and Layout

cb57bf3

AffineQuantizedTensor was the v1 quantized tensor system, now fully
superseded by v2 tensor types (Int8Tensor, Int4Tensor, Float8Tensor,
IntxUnpackedToInt8Tensor, etc.) that inherit from TorchAOBaseTensor.

Core deletions:
- torchao/dtypes/affine_quantized_tensor.py (class definition)
- torchao/dtypes/affine_quantized_tensor_ops.py (aten dispatch)
- torchao/dtypes/floatx/, torchao/dtypes/uintx/ (empty subpackages)
- torchao/dtypes/README.md (stale AQT-centric docs)
- torchao/dtypes/utils.py: removed Layout class and AQTTensorImpl class
- torchao/dtypes/__init__.py: removed all AQT and Layout exports
- torchao/utils.py: removed _register_layout, _get_tensor_impl_constructor,
  and their classmethod registrations on TorchAOBaseTensor
- test/dtypes/test_affine_quantized.py
- test/dtypes/test_affine_quantized_tensor_parallel.py

Core updates:
- quant_api.py: removed AQT from _is_linear check, removed 5 dead
  activation quant helpers
- testing/utils.py: switched defaults from AQT to Int8Tensor
- Updated test assertions, docstrings, and docs to remove AQT references

Prototype status:
- prototype/autoround/: everything works except for
  `apply_auto_round()`, which was already broken before
- prototype/dtypes/uintx/uintx_utils.py: removed UintxLayout,
  UintxAQTTensorImpl, and AQT imports (fixes codebook import breakage)
- prototype/quantization/mixed_precision/: added assertion error since
  feature was already broken by PlainLayout deletion (#4151)
- prototype/parq: removed unused layout field

Still broken (tracked with TODOs):
- tutorials/calibration_flow/ (uses to_affine_quantized_intx_static)
- tutorials/developer_api_guide/ (uses Layout)

Docs/comments only (not broken, just stale references):
- prototype/quantization/module_swap/ (README)
- prototype/parq/ (README)
- prototype/quantized_training/ (comments)

andrewor14 added a commit that referenced this pull request


          Delete AffineQuantizedTensor, AQTTensorImpl, and Layout

ac92ac1

**Summary: ** AffineQuantizedTensor was the v1 quantized tensor
system, now fully superseded by v2 tensor types (Int8Tensor,
Int4Tensor, Float8Tensor, IntxUnpackedToInt8Tensor, etc.) that
inherit from TorchAOBaseTensor.

**BC-Breaking notes:**

Before (AQT):
```python
from torchao.dtypes import to_affine_quantized_intx
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# Low-level AQT API
weight = to_affine_quantized_intx(
    weight, mapping_type, block_size, target_dtype,
    quant_min, quant_max, eps, _layout=Layout(),
)

# High-level API (unchanged)
quantize_(model, Int4WeightOnlyConfig())
```

After (v2 tensors):
```python
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# High-level API (unchanged, recommended)
quantize_(model, Int4WeightOnlyConfig())

# Low-level v2 API (if needed)
from torchao.quantization import Int4Tensor, IntxUnpackedToInt8Tensor
weight = Int4Tensor.from_hp(weight, block_size)
weight = IntxUnpackedToInt8Tensor.from_hp(weight, block_size, torch.int4)
```

**Detailed changes:**

Core deletions:
- torchao/dtypes/affine_quantized_tensor.py (class definition)
- torchao/dtypes/affine_quantized_tensor_ops.py (aten dispatch)
- torchao/dtypes/floatx/, torchao/dtypes/uintx/ (empty subpackages)
- torchao/dtypes/README.md (stale AQT-centric docs)
- torchao/dtypes/utils.py: removed Layout class and AQTTensorImpl class
- torchao/dtypes/__init__.py: removed all AQT and Layout exports
- torchao/utils.py: removed _register_layout, _get_tensor_impl_constructor,
  and their classmethod registrations on TorchAOBaseTensor
- test/dtypes/test_affine_quantized.py
- test/dtypes/test_affine_quantized_tensor_parallel.py

Core updates:
- quant_api.py: removed AQT from _is_linear check, removed 5 dead
  activation quant helpers
- testing/utils.py: switched defaults from AQT to Int8Tensor
- Updated test assertions, docstrings, and docs to remove AQT references

Prototype updates:
- prototype/autoround/: removed broken AQT imports, updated isinstance
  checks to TorchAOBaseTensor. Everything works except apply_auto_round()
  which was already broken before this PR (issue #1690).
- prototype/dtypes/uintx/uintx_utils.py: removed UintxLayout,
  UintxAQTTensorImpl, and AQT imports (fixes codebook import breakage)
- prototype/quantization/mixed_precision/: added assertion error since
  feature was already broken by PlainLayout deletion (#4151)

Still broken (tracked with TODOs):
- tutorials/calibration_flow/ (uses to_affine_quantized_intx_static)
- tutorials/developer_api_guide/ (uses Layout)

Docs/comments only (not broken, just stale references):
- prototype/quantization/module_swap/ (README)
- prototype/parq/ (README)
- prototype/quantized_training/ (comments)

andrewor14 added a commit that referenced this pull request


          Delete AffineQuantizedTensor, AQTTensorImpl, and Layout

bf3628c

**Summary:** AffineQuantizedTensor was the v1 quantized tensor
system, now fully superseded by v2 tensor types (Int8Tensor,
Int4Tensor, Float8Tensor, IntxUnpackedToInt8Tensor, etc.) that
inherit from TorchAOBaseTensor.

**BC-Breaking notes:**

Before (AQT):
```python
from torchao.dtypes import to_affine_quantized_intx
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# Low-level AQT API
weight = to_affine_quantized_intx(
    weight, mapping_type, block_size, target_dtype,
    quant_min, quant_max, eps, _layout=Layout(),
)

# High-level API (unchanged)
quantize_(model, Int4WeightOnlyConfig())
```

After (v2 tensors):
```python
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# High-level API (unchanged, recommended)
quantize_(model, Int4WeightOnlyConfig())

# Low-level v2 API (if needed)
from torchao.quantization import Int4Tensor, IntxUnpackedToInt8Tensor
weight = Int4Tensor.from_hp(weight, block_size)
weight = IntxUnpackedToInt8Tensor.from_hp(weight, block_size, torch.int4)
```

**Detailed changes:**

Core deletions:
- torchao/dtypes/affine_quantized_tensor.py (class definition)
- torchao/dtypes/affine_quantized_tensor_ops.py (aten dispatch)
- torchao/dtypes/floatx/, torchao/dtypes/uintx/ (empty subpackages)
- torchao/dtypes/README.md (stale AQT-centric docs)
- torchao/dtypes/utils.py: removed Layout class and AQTTensorImpl class
- torchao/dtypes/__init__.py: removed all AQT and Layout exports
- torchao/utils.py: removed _register_layout, _get_tensor_impl_constructor,
  and their classmethod registrations on TorchAOBaseTensor
- test/dtypes/test_affine_quantized.py
- test/dtypes/test_affine_quantized_tensor_parallel.py

Core updates:
- quant_api.py: removed AQT from _is_linear check, removed 5 dead
  activation quant helpers
- testing/utils.py: switched defaults from AQT to Int8Tensor
- Updated test assertions, docstrings, and docs to remove AQT references

Prototype updates:
- prototype/autoround/: removed broken AQT imports, updated isinstance
  checks to TorchAOBaseTensor. Everything works except apply_auto_round()
  which was already broken before this PR (issue #1690).
- prototype/dtypes/uintx/uintx_utils.py: removed UintxLayout,
  UintxAQTTensorImpl, and AQT imports (fixes codebook import breakage)
- prototype/quantization/mixed_precision/: added assertion error since
  feature was already broken by PlainLayout deletion (#4151)

Still broken (tracked with TODOs):
- tutorials/calibration_flow/ (uses to_affine_quantized_intx_static)
- tutorials/developer_api_guide/ (uses Layout)

Docs/comments only (not broken, just stale references):
- prototype/quantization/module_swap/ (README)
- prototype/parq/ (README)
- prototype/quantized_training/ (comments)

andrewor14 added a commit that referenced this pull request


          Delete AffineQuantizedTensor, AQTTensorImpl, and Layout

193c21e

**Summary:** AffineQuantizedTensor was the v1 quantized tensor
system, now fully superseded by v2 tensor types (Int8Tensor,
Int4Tensor, Float8Tensor, IntxUnpackedToInt8Tensor, etc.) that
inherit from TorchAOBaseTensor.

**BC-Breaking notes:**

Before (AQT):
```python
from torchao.dtypes import to_affine_quantized_intx
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# Low-level AQT API
weight = to_affine_quantized_intx(
    weight, mapping_type, block_size, target_dtype,
    quant_min, quant_max, eps, _layout=Layout(),
)

# High-level API (unchanged)
quantize_(model, Int4WeightOnlyConfig())
```

After (v2 tensors):
```python
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# High-level API (unchanged, recommended)
quantize_(model, Int4WeightOnlyConfig())

# Low-level v2 API (if needed)
from torchao.quantization import Int4Tensor, IntxUnpackedToInt8Tensor
weight = Int4Tensor.from_hp(weight, block_size)
weight = IntxUnpackedToInt8Tensor.from_hp(weight, block_size, torch.int4)
```

**Detailed changes:**

Core deletions:
- torchao/dtypes/affine_quantized_tensor.py (class definition)
- torchao/dtypes/affine_quantized_tensor_ops.py (aten dispatch)
- torchao/dtypes/floatx/, torchao/dtypes/uintx/ (empty subpackages)
- torchao/dtypes/README.md (stale AQT-centric docs)
- torchao/dtypes/utils.py: removed Layout class and AQTTensorImpl class
- torchao/dtypes/__init__.py: removed all AQT and Layout exports
- torchao/utils.py: removed _register_layout, _get_tensor_impl_constructor,
  and their classmethod registrations on TorchAOBaseTensor
- test/dtypes/test_affine_quantized.py
- test/dtypes/test_affine_quantized_tensor_parallel.py

Core updates:
- quant_api.py: removed AQT from _is_linear check, removed 5 dead
  activation quant helpers
- testing/utils.py: switched defaults from AQT to Int8Tensor
- Updated test assertions, docstrings, and docs to remove AQT references

Prototype updates:
- prototype/autoround/: removed broken AQT imports, updated isinstance
  checks to TorchAOBaseTensor. Everything works except apply_auto_round()
  which was already broken before this PR (issue #1690).
- prototype/dtypes/uintx/uintx_utils.py: removed UintxLayout,
  UintxAQTTensorImpl, and AQT imports (fixes codebook import breakage)
- prototype/quantization/mixed_precision/: added assertion error since
  feature was already broken by PlainLayout deletion (#4151)

Still broken (tracked with TODOs):
- tutorials/calibration_flow/ (uses to_affine_quantized_intx_static)
- tutorials/developer_api_guide/ (uses Layout)

Docs/comments only (not broken, just stale references):
- prototype/quantization/module_swap/ (README)
- prototype/parq/ (README)
- prototype/quantized_training/ (comments)

andrewor14 added a commit that referenced this pull request


          Delete AffineQuantizedTensor, AQTTensorImpl, and Layout

6dbdf3e

**Summary:** AffineQuantizedTensor was the v1 quantized tensor
system, now fully superseded by v2 tensor types (Int8Tensor,
Int4Tensor, Float8Tensor, IntxUnpackedToInt8Tensor, etc.) that
inherit from TorchAOBaseTensor.

**BC-Breaking notes:**

Before (AQT):
```python
from torchao.dtypes import to_affine_quantized_intx
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# Low-level AQT API
weight = to_affine_quantized_intx(
    weight, mapping_type, block_size, target_dtype,
    quant_min, quant_max, eps, _layout=Layout(),
)

# High-level API (unchanged)
quantize_(model, Int4WeightOnlyConfig())
```

After (v2 tensors):
```python
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# High-level API (unchanged, recommended)
quantize_(model, Int4WeightOnlyConfig())

# Low-level v2 API (if needed)
from torchao.quantization import Int4Tensor, IntxUnpackedToInt8Tensor
weight = Int4Tensor.from_hp(weight, block_size)
weight = IntxUnpackedToInt8Tensor.from_hp(weight, block_size, torch.int4)
```

**Detailed changes:**

Core deletions:
- torchao/dtypes/affine_quantized_tensor.py (class definition)
- torchao/dtypes/affine_quantized_tensor_ops.py (aten dispatch)
- torchao/dtypes/floatx/, torchao/dtypes/uintx/ (empty subpackages)
- torchao/dtypes/README.md (stale AQT-centric docs)
- torchao/dtypes/utils.py: removed Layout class and AQTTensorImpl class
- torchao/dtypes/__init__.py: removed all AQT and Layout exports
- torchao/utils.py: removed _register_layout, _get_tensor_impl_constructor,
  and their classmethod registrations on TorchAOBaseTensor
- test/dtypes/test_affine_quantized.py
- test/dtypes/test_affine_quantized_tensor_parallel.py

Core updates:
- quant_api.py: removed AQT from _is_linear check, removed 5 dead
  activation quant helpers
- testing/utils.py: switched defaults from AQT to Int8Tensor
- Updated test assertions, docstrings, and docs to remove AQT references

Prototype updates:
- prototype/autoround/: removed broken AQT imports, updated isinstance
  checks to TorchAOBaseTensor. Everything works except apply_auto_round()
  which was already broken before this PR (issue #1690).
- prototype/dtypes/uintx/uintx_utils.py: removed UintxLayout,
  UintxAQTTensorImpl, and AQT imports (fixes codebook import breakage)
- prototype/quantization/mixed_precision/: added assertion error since
  feature was already broken by PlainLayout deletion (#4151)

Still broken (tracked with TODOs):
- tutorials/calibration_flow/ (uses to_affine_quantized_intx_static)
- tutorials/developer_api_guide/ (uses Layout)

Docs/comments only (not broken, just stale references):
- prototype/quantization/module_swap/ (README)
- prototype/parq/ (README)
- prototype/quantized_training/ (comments)

andrewor14 added a commit that referenced this pull request


          Delete AffineQuantizedTensor, AQTTensorImpl, and Layout

**Summary:** AffineQuantizedTensor was the v1 quantized tensor
system, now fully superseded by v2 tensor types (Int8Tensor,
Int4Tensor, Float8Tensor, IntxUnpackedToInt8Tensor, etc.) that
inherit from TorchAOBaseTensor.

**BC-Breaking notes:**

Before (AQT):
```python
from torchao.dtypes import to_affine_quantized_intx
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# Low-level AQT API
weight = to_affine_quantized_intx(
    weight, mapping_type, block_size, target_dtype,
    quant_min, quant_max, eps, _layout=Layout(),
)

# High-level API (unchanged)
quantize_(model, Int4WeightOnlyConfig())
```

After (v2 tensors):
```python
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# High-level API (unchanged, recommended)
quantize_(model, Int4WeightOnlyConfig())

# Low-level v2 API (if needed)
from torchao.quantization import Int4Tensor, IntxUnpackedToInt8Tensor
weight = Int4Tensor.from_hp(weight, block_size)
weight = IntxUnpackedToInt8Tensor.from_hp(weight, block_size, torch.int4)
```

**Detailed changes:**

Core deletions:
- torchao/dtypes/affine_quantized_tensor.py (class definition)
- torchao/dtypes/affine_quantized_tensor_ops.py (aten dispatch)
- torchao/dtypes/floatx/, torchao/dtypes/uintx/ (empty subpackages)
- torchao/dtypes/README.md (stale AQT-centric docs)
- torchao/dtypes/utils.py: removed Layout class and AQTTensorImpl class
- torchao/dtypes/__init__.py: removed all AQT and Layout exports
- torchao/utils.py: removed _register_layout, _get_tensor_impl_constructor,
  and their classmethod registrations on TorchAOBaseTensor
- test/dtypes/test_affine_quantized.py
- test/dtypes/test_affine_quantized_tensor_parallel.py

Core updates:
- quant_api.py: removed AQT from _is_linear check, removed 5 dead
  activation quant helpers
- testing/utils.py: switched defaults from AQT to Int8Tensor
- Updated test assertions, docstrings, and docs to remove AQT references

Prototype updates:
- prototype/autoround/: removed broken AQT imports, updated isinstance
  checks to TorchAOBaseTensor. Everything works except apply_auto_round()
  which was already broken before this PR (issue #1690).
- prototype/dtypes/uintx/uintx_utils.py: removed UintxLayout,
  UintxAQTTensorImpl, and AQT imports (fixes codebook import breakage)
- prototype/quantization/mixed_precision/: added assertion error since
  feature was already broken by PlainLayout deletion (#4151)

Still broken (tracked with TODOs):
- tutorials/calibration_flow/ (uses to_affine_quantized_intx_static)
- tutorials/developer_api_guide/ (uses Layout)

Docs/comments only (not broken, just stale references):
- prototype/quantization/module_swap/ (README)
- prototype/parq/ (README)
- prototype/quantized_training/ (comments)

andrewor14 added a commit that referenced this pull request


          Delete AffineQuantizedTensor, AQTTensorImpl, and Layout

9eb99cb

**Summary:** AffineQuantizedTensor was the v1 quantized tensor
system, now fully superseded by v2 tensor types (Int8Tensor,
Int4Tensor, Float8Tensor, IntxUnpackedToInt8Tensor, etc.) that
inherit from TorchAOBaseTensor.

**BC-Breaking notes:**

Before (AQT):
```python
from torchao.dtypes import to_affine_quantized_intx
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# Low-level AQT API
weight = to_affine_quantized_intx(
    weight, mapping_type, block_size, target_dtype,
    quant_min, quant_max, eps, _layout=Layout(),
)

# High-level API (unchanged)
quantize_(model, Int4WeightOnlyConfig())
```

After (v2 tensors):
```python
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# High-level API (unchanged, recommended)
quantize_(model, Int4WeightOnlyConfig())

# Low-level v2 API (if needed)
from torchao.quantization import Int4Tensor, IntxUnpackedToInt8Tensor
weight = Int4Tensor.from_hp(weight, block_size)
weight = IntxUnpackedToInt8Tensor.from_hp(weight, block_size, torch.int4)
```

**Detailed changes:**

Core deletions:
- torchao/dtypes/affine_quantized_tensor.py (class definition)
- torchao/dtypes/affine_quantized_tensor_ops.py (aten dispatch)
- torchao/dtypes/floatx/, torchao/dtypes/uintx/ (empty subpackages)
- torchao/dtypes/README.md (stale AQT-centric docs)
- torchao/dtypes/utils.py: removed Layout class and AQTTensorImpl class
- torchao/dtypes/__init__.py: removed all AQT and Layout exports
- torchao/utils.py: removed _register_layout, _get_tensor_impl_constructor,
  and their classmethod registrations on TorchAOBaseTensor
- test/dtypes/test_affine_quantized.py
- test/dtypes/test_affine_quantized_tensor_parallel.py

Core updates:
- quant_api.py: removed AQT from _is_linear check, removed 5 dead
  activation quant helpers
- testing/utils.py: switched defaults from AQT to Int8Tensor
- Updated test assertions, docstrings, and docs to remove AQT references

Prototype updates:
- prototype/autoround/: removed broken AQT imports, updated isinstance
  checks to TorchAOBaseTensor. Everything works except apply_auto_round()
  which was already broken before this PR (issue #1690).
- prototype/dtypes/uintx/uintx_utils.py: removed UintxLayout,
  UintxAQTTensorImpl, and AQT imports (fixes codebook import breakage)
- prototype/quantization/mixed_precision/: added assertion error since
  feature was already broken by PlainLayout deletion (#4151)

Still broken (tracked with TODOs):
- tutorials/calibration_flow/ (uses to_affine_quantized_intx_static)
- tutorials/developer_api_guide/ (uses Layout)

Docs/comments only (not broken, just stale references):
- prototype/quantization/module_swap/ (README)
- prototype/parq/ (README)
- prototype/quantized_training/ (comments)

andrewor14 added a commit that referenced this pull request


          Delete AffineQuantizedTensor, AQTTensorImpl, and Layout (#4245)

c554b1f

**Summary:** AffineQuantizedTensor was the v1 quantized tensor
system, now fully superseded by v2 tensor types (Int8Tensor,
Int4Tensor, Float8Tensor, IntxUnpackedToInt8Tensor, etc.) that
inherit from TorchAOBaseTensor.

**BC-Breaking notes:**

Before (AQT):
```python
from torchao.dtypes import to_affine_quantized_intx
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# Low-level AQT API
weight = to_affine_quantized_intx(
    weight, mapping_type, block_size, target_dtype,
    quant_min, quant_max, eps, _layout=Layout(),
)

# High-level API (unchanged)
quantize_(model, Int4WeightOnlyConfig())
```

After (v2 tensors):
```python
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# High-level API (unchanged, recommended)
quantize_(model, Int4WeightOnlyConfig())

# Low-level v2 API (if needed)
from torchao.quantization import Int4Tensor, IntxUnpackedToInt8Tensor
weight = Int4Tensor.from_hp(weight, block_size)
weight = IntxUnpackedToInt8Tensor.from_hp(weight, block_size, torch.int4)
```

**Detailed changes:**

Core deletions:
- torchao/dtypes/affine_quantized_tensor.py (class definition)
- torchao/dtypes/affine_quantized_tensor_ops.py (aten dispatch)
- torchao/dtypes/floatx/, torchao/dtypes/uintx/ (empty subpackages)
- torchao/dtypes/README.md (stale AQT-centric docs)
- torchao/dtypes/utils.py: removed Layout class and AQTTensorImpl class
- torchao/dtypes/__init__.py: removed all AQT and Layout exports
- torchao/utils.py: removed _register_layout, _get_tensor_impl_constructor,
  and their classmethod registrations on TorchAOBaseTensor
- test/dtypes/test_affine_quantized.py
- test/dtypes/test_affine_quantized_tensor_parallel.py

Core updates:
- quant_api.py: removed AQT from _is_linear check, removed 5 dead
  activation quant helpers
- testing/utils.py: switched defaults from AQT to Int8Tensor
- Updated test assertions, docstrings, and docs to remove AQT references

Prototype updates:
- prototype/autoround/: removed broken AQT imports, updated isinstance
  checks to TorchAOBaseTensor. Everything works except apply_auto_round()
  which was already broken before this PR (issue #1690).
- prototype/dtypes/uintx/uintx_utils.py: removed UintxLayout,
  UintxAQTTensorImpl, and AQT imports (fixes codebook import breakage)
- prototype/quantization/mixed_precision/: added assertion error since
  feature was already broken by PlainLayout deletion (#4151)

Still broken (tracked with TODOs):
- tutorials/calibration_flow/ (uses to_affine_quantized_intx_static)
- tutorials/developer_api_guide/ (uses Layout)

Docs/comments only (not broken, just stale references):
- prototype/quantization/module_swap/ (README)
- prototype/parq/ (README)
- prototype/quantized_training/ (comments)

brucechanglongxu pushed a commit to brucechanglongxu/ao that referenced this pull request


          Delete AffineQuantizedTensor, AQTTensorImpl, and Layout (pytorch#4245)

**Summary:** AffineQuantizedTensor was the v1 quantized tensor
system, now fully superseded by v2 tensor types (Int8Tensor,
Int4Tensor, Float8Tensor, IntxUnpackedToInt8Tensor, etc.) that
inherit from TorchAOBaseTensor.

**BC-Breaking notes:**

Before (AQT):
```python
from torchao.dtypes import to_affine_quantized_intx
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# Low-level AQT API
weight = to_affine_quantized_intx(
    weight, mapping_type, block_size, target_dtype,
    quant_min, quant_max, eps, _layout=Layout(),
)

# High-level API (unchanged)
quantize_(model, Int4WeightOnlyConfig())
```

After (v2 tensors):
```python
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# High-level API (unchanged, recommended)
quantize_(model, Int4WeightOnlyConfig())

# Low-level v2 API (if needed)
from torchao.quantization import Int4Tensor, IntxUnpackedToInt8Tensor
weight = Int4Tensor.from_hp(weight, block_size)
weight = IntxUnpackedToInt8Tensor.from_hp(weight, block_size, torch.int4)
```

**Detailed changes:**

Core deletions:
- torchao/dtypes/affine_quantized_tensor.py (class definition)
- torchao/dtypes/affine_quantized_tensor_ops.py (aten dispatch)
- torchao/dtypes/floatx/, torchao/dtypes/uintx/ (empty subpackages)
- torchao/dtypes/README.md (stale AQT-centric docs)
- torchao/dtypes/utils.py: removed Layout class and AQTTensorImpl class
- torchao/dtypes/__init__.py: removed all AQT and Layout exports
- torchao/utils.py: removed _register_layout, _get_tensor_impl_constructor,
  and their classmethod registrations on TorchAOBaseTensor
- test/dtypes/test_affine_quantized.py
- test/dtypes/test_affine_quantized_tensor_parallel.py

Core updates:
- quant_api.py: removed AQT from _is_linear check, removed 5 dead
  activation quant helpers
- testing/utils.py: switched defaults from AQT to Int8Tensor
- Updated test assertions, docstrings, and docs to remove AQT references

Prototype updates:
- prototype/autoround/: removed broken AQT imports, updated isinstance
  checks to TorchAOBaseTensor. Everything works except apply_auto_round()
  which was already broken before this PR (issue pytorch#1690).
- prototype/dtypes/uintx/uintx_utils.py: removed UintxLayout,
  UintxAQTTensorImpl, and AQT imports (fixes codebook import breakage)
- prototype/quantization/mixed_precision/: added assertion error since
  feature was already broken by PlainLayout deletion (pytorch#4151)

Still broken (tracked with TODOs):
- tutorials/calibration_flow/ (uses to_affine_quantized_intx_static)
- tutorials/developer_api_guide/ (uses Layout)

Docs/comments only (not broken, just stale references):
- prototype/quantization/module_swap/ (README)
- prototype/parq/ (README)
- prototype/quantized_training/ (comments)

brucechanglongxu pushed a commit to brucechanglongxu/ao that referenced this pull request


          Delete AffineQuantizedTensor, AQTTensorImpl, and Layout (pytorch#4245)

8e64c07

**Summary:** AffineQuantizedTensor was the v1 quantized tensor
system, now fully superseded by v2 tensor types (Int8Tensor,
Int4Tensor, Float8Tensor, IntxUnpackedToInt8Tensor, etc.) that
inherit from TorchAOBaseTensor.

**BC-Breaking notes:**

Before (AQT):
```python
from torchao.dtypes import to_affine_quantized_intx
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# Low-level AQT API
weight = to_affine_quantized_intx(
    weight, mapping_type, block_size, target_dtype,
    quant_min, quant_max, eps, _layout=Layout(),
)

# High-level API (unchanged)
quantize_(model, Int4WeightOnlyConfig())
```

After (v2 tensors):
```python
from torchao.quantization import quantize_, Int4WeightOnlyConfig

# High-level API (unchanged, recommended)
quantize_(model, Int4WeightOnlyConfig())

# Low-level v2 API (if needed)
from torchao.quantization import Int4Tensor, IntxUnpackedToInt8Tensor
weight = Int4Tensor.from_hp(weight, block_size)
weight = IntxUnpackedToInt8Tensor.from_hp(weight, block_size, torch.int4)
```

**Detailed changes:**

Core deletions:
- torchao/dtypes/affine_quantized_tensor.py (class definition)
- torchao/dtypes/affine_quantized_tensor_ops.py (aten dispatch)
- torchao/dtypes/floatx/, torchao/dtypes/uintx/ (empty subpackages)
- torchao/dtypes/README.md (stale AQT-centric docs)
- torchao/dtypes/utils.py: removed Layout class and AQTTensorImpl class
- torchao/dtypes/__init__.py: removed all AQT and Layout exports
- torchao/utils.py: removed _register_layout, _get_tensor_impl_constructor,
  and their classmethod registrations on TorchAOBaseTensor
- test/dtypes/test_affine_quantized.py
- test/dtypes/test_affine_quantized_tensor_parallel.py

Core updates:
- quant_api.py: removed AQT from _is_linear check, removed 5 dead
  activation quant helpers
- testing/utils.py: switched defaults from AQT to Int8Tensor
- Updated test assertions, docstrings, and docs to remove AQT references

Prototype updates:
- prototype/autoround/: removed broken AQT imports, updated isinstance
  checks to TorchAOBaseTensor. Everything works except apply_auto_round()
  which was already broken before this PR (issue pytorch#1690).
- prototype/dtypes/uintx/uintx_utils.py: removed UintxLayout,
  UintxAQTTensorImpl, and AQT imports (fixes codebook import breakage)
- prototype/quantization/mixed_precision/: added assertion error since
  feature was already broken by PlainLayout deletion (pytorch#4151)

Still broken (tracked with TODOs):
- tutorials/calibration_flow/ (uses to_affine_quantized_intx_static)
- tutorials/developer_api_guide/ (uses Layout)

Docs/comments only (not broken, just stale references):
- prototype/quantization/module_swap/ (README)
- prototype/parq/ (README)
- prototype/quantized_training/ (comments)

Freed-Wu mentioned this pull request

Add torch.uint16, torch.uint32 #4269

Open

This was referenced Apr 14, 2026

delete v1 of Int8DynamicActivationInt8WeightConfig #4019

Closed

delete v1 of Int8WeightOnlyConfig #4020

Closed

haotongzou added a commit to haotongzou/ao that referenced this pull request


          Fix int8 dynamic activation quantization accuracy regression

c7e09af

Restore scale_dtype=torch.float32 and quant_min=-127 in the v2 int8
quantization path to match the old behavior that was lost in PR pytorch#4151.

haotongzou mentioned this pull request

Fix int8 dynamic activation quantization accuracy regression from v2 tensor migration #4326

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed module: not user facing