[Core][MM] Cleanup `MultiModalCache` by lgeiger · Pull Request #25006 · vllm-project/vllm

lgeiger · 2025-09-17T00:09:05Z

Purpose

Remove unused debug argument from MultiModalCache.get_leaf_size
Simplify isinstance checks in MultiModalCache.get_leaf_size
Simplify MultiModalCache.get_item_size

Test Plan

CI

- Remove unused debug argument from `get_leaf_size` - simplify isinstance checks - simplify `get_item_size` Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

gemini-code-assist

Code Review

This pull request introduces several cleanups to the MultiModalCache class in vllm/multimodal/cache.py. The changes include removing an unused debug argument from MultiModalCache.get_leaf_size, simplifying multiple isinstance checks into a single check with a tuple of types, and replacing lambda functions in MultiModalCache.get_item_size with operator.add and a direct method reference. These refactorings improve code readability and maintainability. The changes are correct and well-implemented.

…litPR into model_register * 'model_register' of https://github.com/dsxsteven/vllm_splitPR: (138 commits) Retrieve `sliding_window` from text config in Gemma3 MM (vllm-project#25085) [Docs] Fix API Reference (vllm-project#25140) [Kernel] Better inf handling for grouped topk cu (vllm-project#24886) [CLI] Use streaming in CLI chat and completion commands (vllm-project#23769) [benchmark] add peak throughput metrics and plot (vllm-project#23867) [Spec Decode] Efficient padded speculation (vllm-project#24539) [V0 Deprecation] Remove more V0 tests (vllm-project#25117) [EPLB] Add EPLB support for hunyuan_v1 (vllm-project#23078) [XPU] Whisper model support on XPU Platform (vllm-project#25123) Mark prompt logprobs as incompatible with prompt embeds at API level (vllm-project#25077) [Model] enable data parallel for InternVL vision encoder (vllm-project#23909) [Kernels] Overlap shared experts with combine instead of dispatch (vllm-project#24254) [Bugfix][Qwen3-Next] add prefixes to shared_expert in qwen3-next and mlp in qwen2moe to successfully load ignored params in quantized models (vllm-project#24960) [Core][MM] Cleanup `MultiModalCache` (vllm-project#25006) [Docs] Clean up the contributing README (vllm-project#25099) [MM Encoder] Apply DP ViT for Qwen3-VL model series (vllm-project#24955) [Kernels] Enable DeepGEMM by default (vllm-project#24462) [V0 Deprecation] Skip PP test (vllm-project#25128) [V0 Deprecation] Remove misc V0 tests (vllm-project#25118) [V0 Deprecation] Remove V0 Tracing & Metrics tests (vllm-project#25115) ...

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com> Signed-off-by: charlifu <charlifu@amd.com>

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

[Core][MM] Cleanup MultiModalCache

a831ef0

- Remove unused debug argument from `get_leaf_size` - simplify isinstance checks - simplify `get_item_size` Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

lgeiger requested review from DarkLight1337, NickLucche and ywang96 as code owners September 17, 2025 00:09

mergify bot added the multi-modality Related to multi-modality (#4194) label Sep 17, 2025

gemini-code-assist bot reviewed Sep 17, 2025

View reviewed changes

DarkLight1337 approved these changes Sep 17, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) September 17, 2025 03:27

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 17, 2025

Merge branch 'main' into mm-cache

3ae71c9

vllm-bot merged commit b982196 into vllm-project:main Sep 18, 2025
37 of 40 checks passed

lgeiger deleted the mm-cache branch September 18, 2025 06:39

debroy-rh pushed a commit to debroy-rh/vllm that referenced this pull request Sep 19, 2025

[Core][MM] Cleanup MultiModalCache (vllm-project#25006)

5f2ad2a

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Core][MM] Cleanup MultiModalCache (vllm-project#25006)

1f363fa

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

charlifu pushed a commit to ROCm/vllm that referenced this pull request Sep 25, 2025

[Core][MM] Cleanup MultiModalCache (vllm-project#25006)

91780fa

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com> Signed-off-by: charlifu <charlifu@amd.com>

choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025

[Core][MM] Cleanup MultiModalCache (vllm-project#25006)

c59ade5

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Core][MM] Cleanup `MultiModalCache`#25006

[Core][MM] Cleanup `MultiModalCache`#25006
vllm-bot merged 2 commits intovllm-project:mainfrom
lgeiger:mm-cache

lgeiger commented Sep 17, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

lgeiger commented Sep 17, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lgeiger commented Sep 17, 2025 •

edited by github-actions bot

Loading