Fix FP16 overflow for vision tensors (Fixes #1678) by dodams258 · Pull Request #1682 · jundot/omlx

dodams258 · 2026-06-05T14:09:48Z

Summary
oQ: vision and audio tensors are kept in float32 instead of float16

Test
pytest tests/test_oq.py -- 206 passed

jundot · 2026-06-05T16:54:15Z

Thanks for the patch. The root cause looks right: protected vision/audio tensors should not be downcast to FP16 in the float16 oQ path.

One gap is that this patch only changes the _should_quantize_tensor() == false fallback. The main 2D vision/audio weight tensors still go through _should_quantize_tensor() == true, then _get_predicate_bits() returns None, so they hit the existing bits is None fallback and are still cast to target_dtype.

I will handle the remaining part in a follow-up commit: apply the same protected pass-through dtype policy to both fallback paths and add a regression test for a 2D vision/audio tensor with dtype="float16".

Fix FP16 overflow for vision tensors (Fixes jundot#1678)

ee620a9

jundot merged commit 1ae8919 into jundot:main Jun 5, 2026

dodams258 deleted the feature/fix-oq-vision-fp16-overflow branch June 5, 2026 16:58

alexgranford mentioned this pull request Jun 6, 2026

oQ: _TrackedTensor missing 'swapaxes' on VLM QAT unquantized checkpoint #1706

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix FP16 overflow for vision tensors (Fixes #1678)#1682

Fix FP16 overflow for vision tensors (Fixes #1678)#1682
jundot merged 1 commit into
jundot:mainfrom
dodams258:feature/fix-oq-vision-fp16-overflow

dodams258 commented Jun 5, 2026 •

edited

Loading

Uh oh!

jundot commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dodams258 commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jundot commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dodams258 commented Jun 5, 2026 •

edited

Loading