vllm block event by Oasis-Git · Pull Request #2930 · LMCache/LMCache

Oasis-Git · 2026-04-01T21:42:00Z

What this PR does / why we need it:

Special notes for your reviewers:

If applicable:

this PR contains user facing changes - docs added
this PR contains unit tests

Note

Medium Risk
Adds a new REPORT_BLOCK_ALLOCATION request type to the multiprocess RPC protocol and wires it through the vLLM adapter and MP server, which could impact protocol compatibility and message handling if clients/servers are version-skewed.

Overview
Adds a new fire-and-forget multiprocess RPC, RequestType.REPORT_BLOCK_ALLOCATION, to let vLLM report per-request GPU block/token allocation deltas (BlockAllocationRecord) to the LMCache server.

The server handles this request by publishing a new observability event (EventType.MP_VLLM_BLOCK_ALLOCATION) to the EventBus, with logging support in MPServerLoggingSubscriber. Tests are extended to cover msgpack serialization for BlockAllocationRecord, MQ handling for the new request, and EventBus delivery semantics for the new event.

^{Written by Cursor Bugbot for commit 8f4542f. This will update automatically on new commits. Configure here.}

Signed-off-by: Oasis-Git <ayw.sirius19@gmail.com>

gemini-code-assist

Code Review

This pull request implements a new observability protocol for reporting vLLM GPU block allocation events to the LMCache server. It includes the necessary protocol definitions, event bus integration, and a logging subscriber to handle these reports. Feedback suggests changing the logging level from info to debug for consistency with the subscriber's design and refining the type hints for the allocation records to improve code clarity.

gemini-code-assist · 2026-04-01T21:43:53Z

+            logger.info(
+                "vLLM block allocation: req_id=%s "
+                "new_blocks=%d new_tokens=%d "
+                "block_ids=%s",
+                rec.get("req_id"),
+                len(block_ids),
+                len(token_ids),
+                block_ids[:10],
+            )


The use of logger.info is inconsistent with the rest of the MPServerLoggingSubscriber class, which is explicitly documented to log events at the debug level (see class docstring on line 23 and module description on line 3). For high-frequency events like block allocations in vLLM, debug level is more appropriate to avoid log pollution in production environments.

Suggested change

logger.info(

"vLLM block allocation: req_id=%s "

"new_blocks=%d new_tokens=%d "

"block_ids=%s",

rec.get("req_id"),

len(block_ids),

len(token_ids),

block_ids[:10],

)

logger.debug(

"vLLM block allocation: req_id=%s "

"new_blocks=%d new_tokens=%d "

"block_ids=%s",

rec.get("req_id"),

len(block_ids),

len(token_ids),

block_ids[:10],

)

References

The style guide emphasizes code cleanliness and consistency with existing patterns. ^(link)

gemini-code-assist · 2026-04-01T21:43:53Z

            "storage_manager": sm,
        }

+    def report_block_allocations(self, records: list) -> None:


The type hint for records is imprecise. To maintain consistency with the LMCacheMPSchedulerAdapter implementation and provide better clarity for this public method, the type hint should be more specific.

Suggested change

def report_block_allocations(self, records: list) -> None:

def report_block_allocations(self, records: list[dict]) -> None:

References

All new functions must have type hints for arguments and return values. Using more specific type hints (e.g., list[dict] instead of list) improves maintainability. ^(link)

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

ApostaC

The structure looks good. Please see the implementation-related comments.

Let's also add some unit tests for the new protocol.

ApostaC · 2026-04-02T17:31:37Z

+        "REPORT_BLOCK_ALLOCATION": ProtocolDefinition(
+            payload_classes=[list],
+            response_class=None,
+            handler_type=HandlerType.SYNC,
+        ),


For protocol definitions, we want to have a more rigorous definition (e.g., a clear dataclass defined in custom_types.py) rather than just a list.
For handler_type, we should use HandlerType.BLOCKING.

ApostaC · 2026-04-02T17:32:49Z

+        for rec in records:
+            block_ids = rec.get("new_block_ids", [])
+            token_ids = rec.get("new_token_ids", [])
+            logger.info(


let's use logger.debug here for now

ApostaC · 2026-04-02T17:33:22Z

    MP_LOOKUP_PREFETCH_END = "mp.lookup_prefetch.end"

+    # vLLM block allocation events
+    VLLM_BLOCK_ALLOCATION = "vllm.block_allocation"


nit: name it as MP_VLLM_BLOCK_ALLOCATION to follow the naming conventions above

Signed-off-by: Oasis-Git <ayw.sirius19@gmail.com>

ApostaC

LGTM!

ApostaC · 2026-04-02T22:00:45Z



+@dataclass
+class BlockAllocationRecord:


Maybe "RequestAllocationRecord"?

sammshen

LGTM!

fix

92f7d02

Signed-off-by: Oasis-Git <ayw.sirius19@gmail.com>

gemini-code-assist Bot reviewed Apr 1, 2026

View reviewed changes

cursor Bot reviewed Apr 1, 2026

View reviewed changes

Comment thread lmcache/v1/mp_observability/subscribers/logging/mp_server.py Outdated

Comment thread lmcache/v1/multiprocess/protocols/observability.py

Oasis-Git changed the title ~~Vllm block event~~ vllm block event Apr 1, 2026

Oasis-Git assigned Oasis-Git and unassigned Oasis-Git Apr 1, 2026

Oasis-Git added the full Run comprehensive tests on this PR label Apr 1, 2026

Merge branch 'dev' into vl

fda111f

maobaolong added the mp_mode label Apr 2, 2026

ApostaC requested changes Apr 2, 2026

View reviewed changes

Oasis-Git added 2 commits April 2, 2026 20:13

fix

5f4d4ab

Signed-off-by: Oasis-Git <ayw.sirius19@gmail.com>

Merge branch 'vl' of https://github.com/Oasis-Git/LMCache into vl

8f4542f

ApostaC approved these changes Apr 2, 2026

View reviewed changes

sammshen approved these changes Apr 2, 2026

View reviewed changes

sammshen merged commit a060b4b into LMCache:dev Apr 2, 2026
35 checks passed

Oasis-Git deleted the vl branch April 2, 2026 22:49

ApostaC mentioned this pull request May 2, 2026

docs: daily drift check — multi-process mode (2026-05-02) #3184

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vllm block event#2930

vllm block event#2930
sammshen merged 4 commits intoLMCache:devfrom
Oasis-Git:vl

Oasis-Git commented Apr 1, 2026 •

edited by cursor Bot

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 1, 2026

Uh oh!

gemini-code-assist Bot Apr 1, 2026

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

Uh oh!

ApostaC left a comment

Uh oh!

ApostaC Apr 2, 2026

Uh oh!

ApostaC Apr 2, 2026

Uh oh!

ApostaC Apr 2, 2026

Uh oh!

ApostaC left a comment

Uh oh!

ApostaC Apr 2, 2026

Uh oh!

sammshen left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	def report_block_allocations(self, records: list) -> None:
	def report_block_allocations(self, records: list[dict]) -> None:

Conversation

Oasis-Git commented Apr 1, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ApostaC left a comment

Choose a reason for hiding this comment

Uh oh!

ApostaC Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

ApostaC Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

ApostaC Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

ApostaC left a comment

Choose a reason for hiding this comment

Uh oh!

ApostaC Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

sammshen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Oasis-Git commented Apr 1, 2026 •

edited by cursor Bot

Loading