hunyuan_vl / gemma3n: drop dead assignments in cache-offset extraction by Blaizzy · Pull Request #1056 · Blaizzy/mlx-vlm

Blaizzy · 2026-04-24T00:45:45Z

Summary

Small follow-up to #1055 — two post-merge clean-ups spotted while re-reading the changes:

hunyuan_vl/language.py: both branches of the _idx check assigned offset = cache.offset, but only offset_scalar is read downstream (for the xdrope_section prefill-vs-decode branch). Dropped the dead assignment, and hoisted the offset_scalar = 0 default above the if cache is not None so the no-cache else disappears.
gemma3n/language.py: raw_offset was computed via a generator expression before the _idx fast path ran, even though the fast path doesn't use it. Moved the lookup into the fallback branch so the fast path does zero extra work.

No functional change; same outputs on the same inputs.

Test plan

from mlx_vlm.models.hunyuan_vl import language / from mlx_vlm.models.gemma3n import language — imports clean.
Re-run the decode smoke-test on mlx-community/gemma-3n-E2B-it-8bit + tencent/HunyuanOCR (CLI + server single + server concurrent).

🤖 Generated with Claude Code

hunyuan_vl was setting ``offset = cache.offset`` in both branches even though only ``offset_scalar`` is read downstream — that variable is never used again. Hoist ``offset_scalar = 0`` above the ``if cache`` branch and drop the unused ``offset`` assignment. gemma3n was eagerly computing ``raw_offset`` via a generator expression before the ``_idx`` fast path even ran. Move that lookup into the fallback branch so the fast path does zero extra work. No functional change. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Merge changes from upstream: - Blaizzy#1056: hunyuan_vl/gemma3n cache-offset optimization - Blaizzy#1053: Fix DFlash speculative decoding (GPU hang, performance) - Blaizzy#1050: Thread-local generation stream (port mlx-lm#1090) - Blaizzy#1055: Close batch_generate/server decode gap + VLM fixes Conflict resolution: - requirements.txt: Mixed approach - mlx>=0.31.2 with transformers<5.4.0 to maintain omlx compatibility while accepting mlx update Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Blaizzy and others added 3 commits April 24, 2026 02:44

format

638dfc5

format

9943ed5

Blaizzy merged commit e41cd25 into main Apr 24, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

hunyuan_vl / gemma3n: drop dead assignments in cache-offset extraction#1056

hunyuan_vl / gemma3n: drop dead assignments in cache-offset extraction#1056
Blaizzy merged 3 commits into
mainfrom
pc/tidy-cache-offset

Blaizzy commented Apr 24, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

Blaizzy commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Blaizzy commented Apr 24, 2026 •

edited

Loading