[Store] Python store API supports batch operation by xinranwang17 · Pull Request #511 · kvcache-ai/Mooncake

xinranwang17 · 2025-06-17T13:46:23Z

This PR introduces batch APIs (put_batch, get_batch, is_batch_exist) in the Python module to accelerate data transfer. These APIs are concurrently utilized in the Mooncake store implementation as the L3 cache in SGLang.
Related PR in sglang

1. expose batch operation via python api 2. add BatchIsExist support

james0zan · 2025-06-18T02:41:01Z

Since the PR in SGLang requires further refactoring, we can proceed with reviewing and merging this PR in Mooncake as a preparatory step.

zhaoyongke · 2025-06-18T13:30:05Z

Since the PR in SGLang requires further refactoring, we can proceed with reviewing and merging this PR in Mooncake as a preparatory step.

Thx!

xiaguan · 2025-06-20T02:28:19Z

+     * @param keys Key to check
+     * @return Map of keys to booleans
+     */
+    std::unordered_map<std::string, bool> BatchIsExist(


I’d prefer to return `vector that matches up one-to-one with the keys in the input parameters.

According to SGLang integration, is it better to return <key, is_exist> pair so that the existed key can be easily appended instead of addressing them by index again.

I was looking through the sglang code and noticed this part:

mooncake_exist_keys = self.mooncake_l3_kv_pool.is_batch_exist( fragment_keys ) non_exist_keys = [] non_exist_value = [] for i in range(len(fragment_keys)): if not mooncake_exist_keys[fragment_keys[i]]:

I think we could simplify it to:

mooncake_exist_keys = self.mooncake_l3_kv_pool.is_batch_exist( fragment_keys ) non_exist_keys = [] non_exist_value = [] for i in range(len(fragment_keys)): if mooncake_exist_keys[i] == 1:

This way we're using direct indexing instead of querying the hash map, which should be slightly more efficient.

LMCache needs same interface, so I’ve submitted a separate PR for batch exist: #542.

Appreciate it if you could take a look. Thanks!

great work! Since array is more efficient, let's use array as the return value of batch exist interface.

xiaguan · 2025-06-20T02:28:42Z

+     * @param keys Keys to check
+     * @return existence map, true if exists, false if not
+     */
+    std::unordered_map<std::string, ErrorCode> BatchIsExist(


xiaguan · 2025-06-20T02:31:33Z

+        test_client_->BatchIsExist(keys);
+    end = std::chrono::high_resolution_clock::now();
+    LOG(INFO) << "Time taken for BatchIsExist: "
+              << std::chrono::duration_cast<std::chrono::microseconds>(end -


Could you share a quick performance result here?

xinranwang17 · 2025-06-24T06:50:04Z

Since we have had an agreement on using array as the return value of batch exist, this PR will be accept. I'd like to continue contribute other implementation, such as batch put/get interface, in a separate PR.

jeremyzhang866 · 2025-06-25T03:29:31Z

hi @xinranwang17 .When can this pr be merge. thanks

jeremyzhang866 · 2025-06-25T03:30:44Z

hi @xiaguan When does lmcache have such a batch interface. thanks

xinranwang17 · 2025-06-27T09:10:15Z

hi @xinranwang17 .When can this pr be merge. thanks

Part of this PR has been merged here: #542
I have submit another PR(#556) consisting of the rest part.

xiaguan · 2025-06-30T02:25:06Z

hi @xiaguan When does lmcache have such a batch interface. thanks

After this LMCache/LMCache#924 got merged, I will submit mooncake's impl

jeremyzhang866 · 2025-06-30T13:15:39Z

hi @xiaguan When does lmcache have such a batch interface. thanks

After this LMCache/LMCache#924 got merged, I will submit mooncake's impl

could you creat a PR about this so that i can preview. thanks.

xiaguan · 2025-07-01T03:47:23Z

hi @xiaguan When does lmcache have such a batch interface. thanks

After this LMCache/LMCache#924 got merged, I will submit mooncake's impl

could you creat a PR about this so that i can preview. thanks.

LMCache/LMCache#934,

xiaguan · 2025-07-01T03:47:42Z

since #556 is merged, close this pr

Python store API supports batch operation

4286794

1. expose batch operation via python api 2. add BatchIsExist support

xinranwang17 changed the title ~~Python store API supports batch operation~~ [Store] Python store API supports batch operation Jun 17, 2025

Update store_py.cpp

2a4e266

stmatengss requested a review from maobaolong June 17, 2025 16:27

huangtingwei9988 mentioned this pull request Jun 18, 2025

Support l3 cache (mooncake store) for hiradix cache sgl-project/sglang#7211

Merged

10 tasks

xiaguan reviewed Jun 20, 2025

View reviewed changes

xinranwang17 mentioned this pull request Jun 24, 2025

feat(store): add batch exist support for master #542

Merged

xiaguan mentioned this pull request Jun 25, 2025

feat(store): add zero copy batch put and get for python binding #551

Merged

xinranwang17 mentioned this pull request Jun 25, 2025

support batch put/get api in python module #556

Merged

xiaguan closed this Jul 1, 2025

Conversation

xinranwang17 commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

james0zan commented Jun 18, 2025

Uh oh!

zhaoyongke commented Jun 18, 2025

Uh oh!

xiaguan Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

xinranwang17 Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

xiaguan Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

xiaguan Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

xinranwang17 Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

xiaguan Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

xiaguan Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

xinranwang17 commented Jun 24, 2025

Uh oh!

jeremyzhang866 commented Jun 25, 2025

Uh oh!

jeremyzhang866 commented Jun 25, 2025

Uh oh!

xinranwang17 commented Jun 27, 2025

Uh oh!

xiaguan commented Jun 30, 2025

Uh oh!

jeremyzhang866 commented Jun 30, 2025

Uh oh!

xiaguan commented Jul 1, 2025

Uh oh!

xiaguan commented Jul 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

xinranwang17 commented Jun 17, 2025 •

edited

Loading