Calculate vmem limit dynamically in the quantized matmul kernel. by vanbasten23 · Pull Request #9470 · pytorch/xla

vanbasten23 · 2025-07-10T20:08:14Z

Test plan:

python pytorch/xla/test/test_pallas.py -k test_quantized_matmul_int8
pytest pytorch/xla/test/test_quantized_matmul_pallas_kernel.py -s

yaochengji

LGTM, thanks!

vanbasten23 · 2025-07-10T22:51:10Z

Thanks for the review!

vanbasten23 added 2 commits July 10, 2025 20:06

Set vmem_limit according to block sizes.

f30367a

linter

66d6100

vanbasten23 requested a review from yaochengji July 10, 2025 20:09

vanbasten23 marked this pull request as ready for review July 10, 2025 20:09

yaochengji approved these changes Jul 10, 2025

View reviewed changes

yaochengji reviewed Jul 10, 2025

View reviewed changes

Comment thread torch_xla/experimental/pallas_kernels/quantized_matmul_kernel.py Outdated

make vmem_limit account for vreg spill etc.

2abb856

vanbasten23 mentioned this pull request Jul 10, 2025

Use w8a8 quantized matmul Pallas kernel vllm-project/vllm#19170

Merged

3 tasks

vanbasten23 merged commit 83dc9da into master Jul 10, 2025
23 of 24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calculate vmem limit dynamically in the quantized matmul kernel.#9470

Calculate vmem limit dynamically in the quantized matmul kernel.#9470
vanbasten23 merged 3 commits intomasterfrom
xiowei/calculate_vmem_limit_dynamically

vanbasten23 commented Jul 10, 2025 •

edited

Loading

Uh oh!

yaochengji left a comment

Uh oh!

Uh oh!

vanbasten23 commented Jul 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vanbasten23 commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yaochengji left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vanbasten23 commented Jul 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vanbasten23 commented Jul 10, 2025 •

edited

Loading