[Chore][Revert] MP adapter signature shim from #3100 by sammshen · Pull Request #3111 · LMCache/LMCache

sammshen · 2026-04-22T20:56:57Z

PR #3100 bundled two unrelated changes: the CI sitecustomize/SCM fix and a backward-compat shim on LMCacheMPScheduler/WorkerAdapter that accepted the old positional (world_size, kv_rank, vllm_block_size) call form used by the vllm-bundled lmcache_mp_connector.

This reverts only the adapter shim (restores the pre-#3100 signature requiring parallel_strategy). The CI sitecustomize and SETUPTOOLS_SCM_PRETEND_VERSION changes from #3100 are kept.

Callers on an un-updated vllm that still pass the old positional args will break until the vllm side is updated to pass parallel_strategy.

What this PR does / why we need it:

Special notes for your reviewers:

If applicable:

this PR contains user facing changes - docs added
this PR contains unit tests

Note

Medium Risk
Breaking change to LMCacheMPSchedulerAdapter/LMCacheMPWorkerAdapter constructor signatures; older vLLM callers using positional (world_size, kv_rank, vllm_block_size) will fail until updated.

Overview
Restores the pre-#3100 API for vLLM multiprocess adapters by removing the backward-compat constructor shim on LMCacheMPSchedulerAdapter and LMCacheMPWorkerAdapter.

Both constructors now require vllm_block_size and a non-optional parallel_strategy, and no longer accept/derive values from world_size, kv_rank, or tp_size (including dropping related typing/imports).

^{Reviewed by Cursor Bugbot for commit 18fbeab. Bugbot is set up for automated code reviews on this repo. Configure here.}

PR LMCache#3100 bundled two unrelated changes: the CI sitecustomize/SCM fix and a backward-compat shim on LMCacheMPScheduler/WorkerAdapter that accepted the old positional (world_size, kv_rank, vllm_block_size) call form used by the vllm-bundled lmcache_mp_connector. This reverts only the adapter shim (restores the pre-LMCache#3100 signature requiring parallel_strategy). The CI sitecustomize and SETUPTOOLS_SCM_PRETEND_VERSION changes from LMCache#3100 are kept. Callers on an un-updated vllm that still pass the old positional args will break until the vllm side is updated to pass parallel_strategy. Signed-off-by: Samuel Shen <slshen@uchciago.edu>

deng451e

LGTM

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 18fbeab. Configure here.}

cursor · 2026-04-22T20:59:09Z

-        tp_size: int = 1,
-        parallel_strategy: Optional[ParallelStrategy] = None,
+        vllm_block_size: int,
+        parallel_strategy: ParallelStrategy,


In-repo connector still uses old adapter call convention

High Severity

The signature of LMCacheMPSchedulerAdapter and LMCacheMPWorkerAdapter was reverted to require (vllm_block_size, parallel_strategy), but the in-repo caller lmcache_mp_connector_0180.py (create_scheduler_adapter / create_worker_adapter) still passes (world_size, kv_rank, block_size) positionally. This causes kv_rank (an int) to land on parallel_strategy (expects ParallelStrategy), and block_size to collide with the keyword mq_timeout, producing a TypeError at runtime. The connector file needs to be updated to construct a ParallelStrategy and pass the new signature.

Additional Locations (1)

lmcache/integration/vllm/vllm_multi_process_adapter.py#L554-L563

^{Reviewed by Cursor Bugbot for commit 18fbeab. Configure here.}

gemini-code-assist

Code Review

This pull request refactors the LMCacheMPSchedulerAdapter and LMCacheMPWorkerAdapter classes by removing legacy parameters and making vllm_block_size and parallel_strategy mandatory. The review identifies that these changes break existing internal integrations in lmcache_mp_connector_0180.py and that the LMCacheMPWorkerAdapter constructor lacks a required docstring per the project's style guide.

gemini-code-assist · 2026-04-22T21:00:02Z

    def __init__(
        self,
        server_url: str,
        context: zmq.Context,
        model_name: str,
-        world_size: int = 1,
-        kv_rank: int = 0,
-        vllm_block_size: int = 16,
-        tp_size: int = 1,
-        parallel_strategy: Optional[ParallelStrategy] = None,
+        vllm_block_size: int,
+        parallel_strategy: ParallelStrategy,
        mq_timeout: float = DEFAULT_MQ_TIMEOUT,
        heartbeat_interval: float = DEFAULT_HEARTBEAT_INTERVAL,
    ):


This change breaks the internal integration code in lmcache/integration/vllm/lmcache_mp_connector_0180.py. The create_scheduler_adapter function (lines 123-133) still passes arguments using the old signature, which will now cause a TypeError because it attempts to pass an integer (kv_rank) to the parallel_strategy parameter. Since this connector is part of the repository, it should be updated in this PR to maintain a working state.

gemini-code-assist · 2026-04-22T21:00:02Z

    def __init__(
        self,
        server_url: str,
        context: zmq.Context,
        model_name: str,
-        world_size: int = 1,
-        kv_rank: int = 0,
-        vllm_block_size: int = 16,
-        tp_size: int = 1,
-        parallel_strategy: Optional[ParallelStrategy] = None,
+        vllm_block_size: int,
+        parallel_strategy: ParallelStrategy,
        mq_timeout: float = DEFAULT_MQ_TIMEOUT,
        heartbeat_interval: float = DEFAULT_HEARTBEAT_INTERVAL,
    ):


This change breaks create_worker_adapter in lmcache/integration/vllm/lmcache_mp_connector_0180.py (lines 148-157) because the positional arguments no longer match the new signature. Additionally, per the repository style guide (line 26), this public method should include a docstring describing its arguments.

def __init__( self, server_url: str, context: zmq.Context, model_name: str, vllm_block_size: int, parallel_strategy: ParallelStrategy, mq_timeout: float = DEFAULT_MQ_TIMEOUT, heartbeat_interval: float = DEFAULT_HEARTBEAT_INTERVAL, ): """ Args: server_url: The server URL for the LMCache message queue context: The ZMQ context model_name: The model name used for LMCache keys vllm_block_size: The block size used in vLLM parallel_strategy: The parallel strategy, which includes use_mla, kv_world_size, kv_worker_id and so on mq_timeout: Timeout in seconds for message queue requests. heartbeat_interval: Interval in seconds between heartbeat pings. """

References

All new public functions have docstrings (what, args, return, exceptions) ^(link)

ApostaC

LGTM!

sammshen requested review from YaoJiayi, deng451e and maobaolong as code owners April 22, 2026 20:56

deng451e approved these changes Apr 22, 2026

View reviewed changes

cursor Bot reviewed Apr 22, 2026

View reviewed changes

gemini-code-assist Bot reviewed Apr 22, 2026

View reviewed changes

sammshen added the full Run comprehensive tests on this PR label Apr 22, 2026

ApostaC approved these changes Apr 22, 2026

View reviewed changes

ApostaC enabled auto-merge (squash) April 22, 2026 21:11

ApostaC merged commit afd5c4b into LMCache:dev Apr 23, 2026
39 of 41 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Chore][Revert] MP adapter signature shim from #3100#3111

[Chore][Revert] MP adapter signature shim from #3100#3111
ApostaC merged 1 commit intoLMCache:devfrom
sammshen:revert-3100-mp-adapter-only

sammshen commented Apr 22, 2026 •

edited by cursor Bot

Loading

Uh oh!

deng451e left a comment

Uh oh!

cursor Bot left a comment

Uh oh!

cursor Bot Apr 22, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 22, 2026

Uh oh!

gemini-code-assist Bot Apr 22, 2026

Uh oh!

ApostaC left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sammshen commented Apr 22, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

deng451e left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot Apr 22, 2026

Choose a reason for hiding this comment

In-repo connector still uses old adapter call convention

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

ApostaC left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sammshen commented Apr 22, 2026 •

edited by cursor Bot

Loading