[KVEvent] User request.block_hash for parent block_hash by heheda12345 · Pull Request #30544 · vllm-project/vllm

heheda12345 · 2025-12-12T08:50:10Z

Purpose

Parent block can be a null block:

mamba: the block_table will be like [null_block, null_block, ..., null_block, normal_block] as we only need one block per decode step.
sliding window + kv cache connector, assume block_size 1, window size 2, hit 3 local tokens + 3 external tokens, the block table will be [NULL, NULL, NULL, NULL, 4, 5]
we will do:

# first allocation
allocate_slots(delay_cache_blocks=True):
    save_new_computed_blocks() caches the first 3 blocks
# after kv cache transfer
allocate_slots(delay_cache_blocks=False):
    cache_blocks() caches the first 6 blocks, parent block is block 2 (null_block)

So we extract parent block hash from request.block_hashes instead of null_block

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Chen Zhang <zhangch99@outlook.com>

gemini-code-assist

Code Review

This pull request addresses a potential AssertionError that could occur when determining the parent block hash for KV cache events. The issue arises in scenarios where the parent block is a null_block, which does not have an associated block hash, causing the assertion to fail. The proposed change correctly resolves this by retrieving the parent block hash directly from request.block_hashes. This is the correct approach as request.block_hashes represents the logical sequence of hashes and is the reliable source of truth, independent of the physical block implementation. The fix is clean, direct, and I find no issues with the implementation.

mgoin

Seems reasonable to me. Should we add a unit test if this failed in some case?

Signed-off-by: Yifan Qiao <yifanqiao@berkeley.edu>

ivanium · 2025-12-18T09:11:22Z

Seems reasonable to me. Should we add a unit test if this failed in some case?

Good idea. I added a test case for null parent block hash. PTAL

…#30544) Signed-off-by: Chen Zhang <zhangch99@outlook.com> Signed-off-by: Yifan Qiao <yifanqiao@berkeley.edu> Co-authored-by: Yifan Qiao <yifanqiao@berkeley.edu>

heheda12345 added 2 commits December 12, 2025 00:34

init

93935aa

Signed-off-by: Chen Zhang <zhangch99@outlook.com>

fix

5073e6d

Signed-off-by: Chen Zhang <zhangch99@outlook.com>

heheda12345 requested review from ApostaC, WoosukKwon, alexm-redhat, njhill, robertgshaw2-redhat and ywang96 as code owners December 12, 2025 08:50

mergify Bot added the v1 label Dec 12, 2025

gemini-code-assist Bot reviewed Dec 12, 2025

View reviewed changes

heheda12345 mentioned this pull request Dec 12, 2025

[V1] [Hybrid] Lighter Mamba Prefix Caching with standard memory layout #29272

Closed

5 tasks

mgoin approved these changes Dec 14, 2025

View reviewed changes

heheda12345 mentioned this pull request Dec 18, 2025

[Core][Hybrid allocator + connector] Support hybrid allocator + kv cache connector #30166

Merged

5 tasks

test: add unit test case for null parent block hash

77f6531

Signed-off-by: Yifan Qiao <yifanqiao@berkeley.edu>

ivanium force-pushed the fix_kv_event branch from 5741e19 to 77f6531 Compare December 18, 2025 09:10

heheda12345 enabled auto-merge (squash) December 21, 2025 23:30

github-actions Bot added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 21, 2025

vllm-bot merged commit 538e830 into vllm-project:main Dec 24, 2025
46 of 47 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[KVEvent] User request.block_hash for parent block_hash#30544

[KVEvent] User request.block_hash for parent block_hash#30544
vllm-bot merged 3 commits into
vllm-project:mainfrom
heheda12345:fix_kv_event

heheda12345 commented Dec 12, 2025 •

edited by github-actions Bot

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

mgoin left a comment

Uh oh!

ivanium commented Dec 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

heheda12345 commented Dec 12, 2025 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

mgoin left a comment

Choose a reason for hiding this comment

Uh oh!

ivanium commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

heheda12345 commented Dec 12, 2025 •

edited by github-actions Bot

Loading

ivanium commented Dec 18, 2025 •

edited

Loading