Unify the return type of w8a8 matmul between fallback and the actual impl. by vanbasten23 · Pull Request #9452 · pytorch/xla

vanbasten23 · 2025-07-08T22:09:47Z

We need to unify the return type between fallback and the actual impl. Specifically, in the fallback impl quantized_matmul_int8_non_xla, if we don't specify a dtype, it'll use the default dtype torch.float32. This can cause issue in vLLM.

Test plan:

python pytorch/xla/test/test_pallas.py -k test_quantized_matmul_int8
pytest pytorch/xla/test/test_quantized_matmul_pallas_kernel.py -s

yaochengji

LGTM, thanks for fixing this!

…mplementation.

vanbasten23 · 2025-07-10T01:49:40Z

Thanks for the review!

vanbasten23 requested a review from yaochengji July 8, 2025 22:10

vanbasten23 marked this pull request as ready for review July 8, 2025 22:10

yaochengji approved these changes Jul 8, 2025

View reviewed changes

vanbasten23 force-pushed the xiowei/fix_nonxla_return_type branch from e527723 to 9c2c0b2 Compare July 9, 2025 20:07

vanbasten23 added 5 commits July 10, 2025 00:01

Unify the return type of w8a8 matmul across fallback and the actual i…

a1be787

…mplementation.

linter

b10868c

make the function more readable

ea8cbd9

also check frobeneous error

4e46f5b

Only run the test_quantized_matmul_int8_wrapper_fallback test on TPU

aac2352

vanbasten23 force-pushed the xiowei/fix_nonxla_return_type branch from 8f36349 to aac2352 Compare July 10, 2025 00:01

vanbasten23 merged commit 52569ec into master Jul 10, 2025
23 of 24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify the return type of w8a8 matmul between fallback and the actual impl.#9452

Unify the return type of w8a8 matmul between fallback and the actual impl.#9452
vanbasten23 merged 5 commits intomasterfrom
xiowei/fix_nonxla_return_type

vanbasten23 commented Jul 8, 2025 •

edited

Loading

Uh oh!

yaochengji left a comment

Uh oh!

vanbasten23 commented Jul 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vanbasten23 commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yaochengji left a comment

Choose a reason for hiding this comment

Uh oh!

vanbasten23 commented Jul 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vanbasten23 commented Jul 8, 2025 •

edited

Loading