[Core] Add eviction for CB by YaoJiayi · Pull Request #2893 · LMCache/LMCache

YaoJiayi · 2026-03-27T22:29:49Z

What this PR does / why we need it:

Special notes for your reviewers:

If applicable:

this PR contains user facing changes - docs added
this PR contains unit tests

Signed-off-by: YaoJiayi <120040070@link.cuhk.edu.cn>

gemini-code-assist

Code Review

This pull request implements a lazy eviction mechanism for the BlendTokenRangeMatcher by adding a remove_chunks method and updating the lookup logic to identify and clear stale entries. Review feedback highlighted a critical scalability issue where the lack of compact_id reuse will eventually cause an IndexError and a memory leak as the internal list grows beyond the fixed table size. Additionally, the implementation currently fails to handle duplicate token_hash registrations, which could lead to orphaned entries in the fingerprint table and potential data corruption during matching.

gemini-code-assist · 2026-03-27T22:32:18Z

+            slot = int(chunk_hashes[i]) & int(self._mask)
            self._chunk_token_hash.append(th)
            self._token_hash_to_start[th] = i * self.chunk_size
+            self._compact_id_to_slot[cid] = slot


There is a critical issue here that will lead to an IndexError and a crash.

The compact_ids are generated based on the length of self._chunk_token_hash. The remove_chunks method evicts entries by setting them to None in this list, but it never reclaims the space or allows compact_ids to be reused.

As a result, len(self._chunk_token_hash) grows without bound. Since self._compact_id_to_slot has a fixed size of _TABLE_SIZE, this will inevitably cause an IndexError here when a cid is generated that is >= _TABLE_SIZE. This also constitutes a memory leak, as the list will fill with None values that are never removed.

To fix this, you should implement a mechanism to reuse compact_ids. A common pattern is to maintain a free list of evicted IDs. When registering new chunks, you can pull from this free list before generating a new ID by increasing the size of _chunk_token_hash.

gemini-code-assist · 2026-03-27T22:32:19Z

        for i in range(n):
            th = token_hashes[i]
+            cid = int(compact_ids[i])
+            slot = int(chunk_hashes[i]) & int(self._mask)
            self._chunk_token_hash.append(th)
            self._token_hash_to_start[th] = i * self.chunk_size
+            self._compact_id_to_slot[cid] = slot
+            self._token_hash_to_compact_id[th] = cid


The current implementation does not handle the case where the same token_hash is registered for multiple different chunks. This can lead to incorrect behavior and orphaned entries in the matcher.

If a token_hash is reused, the reverse-lookup maps (_token_hash_to_start and _token_hash_to_compact_id) are overwritten with the details of the latest registration. This can cause several problems:

remove_chunks will only be able to evict the most recently registered chunk for a given hash, leaving earlier chunks with the same hash as orphans in the table.

match_sub_sequence could return incorrect old_st positions for these orphaned chunks if they are matched, leading to data corruption downstream.

To ensure correctness, you should enforce that all registered token_hash values are unique. A straightforward way to enforce this is to check for existing hashes before registration and raise an error. This makes the requirement on the caller explicit.

# Check for duplicates before modifying state for th in token_hashes: if th in self._token_hash_to_compact_id: raise ValueError(f"Attempted to register duplicate token hash: {th.hex()}") # Persist compact_id → token_hash, token_hash → start, and reverse maps for i in range(n): th = token_hashes[i] cid = int(compact_ids[i]) slot = int(chunk_hashes[i]) & int(self._mask) self._chunk_token_hash.append(th) self._token_hash_to_start[th] = i * self.chunk_size self._compact_id_to_slot[cid] = slot self._token_hash_to_compact_id[th] = cid

sammshen

LGTM!

ApostaC

LGTM!

add eviction for cb Signed-off-by: YaoJiayi <120040070@link.cuhk.edu.cn>

add eviction for cb

22d17cb

Signed-off-by: YaoJiayi <120040070@link.cuhk.edu.cn>

gemini-code-assist Bot reviewed Mar 27, 2026

View reviewed changes

sammshen approved these changes Mar 27, 2026

View reviewed changes

ApostaC approved these changes Mar 31, 2026

View reviewed changes

ApostaC added the full Run comprehensive tests on this PR label Mar 31, 2026

ApostaC enabled auto-merge (squash) March 31, 2026 23:54

ApostaC merged commit 0f51fab into dev Apr 1, 2026
34 checks passed

jooho-XCENA pushed a commit to xcena-dev/LMCache that referenced this pull request Apr 2, 2026

[Core] Add eviction for Blend server v2 (LMCache#2893)

c8d17b4

add eviction for cb Signed-off-by: YaoJiayi <120040070@link.cuhk.edu.cn>

jooho-XCENA pushed a commit to xcena-dev/LMCache that referenced this pull request Apr 2, 2026

[Core] Add eviction for Blend server v2 (LMCache#2893)

c5c222d

add eviction for cb Signed-off-by: YaoJiayi <120040070@link.cuhk.edu.cn>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core] Add eviction for CB#2893

[Core] Add eviction for CB#2893
ApostaC merged 1 commit intodevfrom
localdev/blend-eviction

YaoJiayi commented Mar 27, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Mar 27, 2026

Uh oh!

gemini-code-assist Bot Mar 27, 2026

Uh oh!

sammshen left a comment

Uh oh!

ApostaC left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

YaoJiayi commented Mar 27, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

sammshen left a comment

Choose a reason for hiding this comment

Uh oh!

ApostaC left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants