[Bug Fix] missing index/KV transfer for MTP layer in NSA disaggregation by zRzRzRzRzRzRzR · Pull Request #23539 · sgl-project/sglang

zRzRzRzRzRzRzR · 2026-04-23T06:50:51Z

Motivation

In PD disaggregation with NSA + MTP, only the target model's NSA state buffers are registered for transfer. The draft model's NSATokenToKVPool buffers are never appended to kv_args, so the MTP layer's index/KV state is not sent from prefill to decode, causing wrong speculative decoding results.

Modifications

In DecodePreallocQueue and PrefillBootstrapQueue, when the main pool is NSA, also append draft_token_to_kv_pool.get_state_buf_infos() to kv_args if the draft pool is also NSA.

gemini-code-assist · 2026-04-23T06:50:54Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

ShangmingCai

Nice catch. Thx for the fix.

ShangmingCai · 2026-04-23T07:13:36Z

/tag-and-rerun-ci

ShangmingCai · 2026-04-23T08:00:47Z

/rerun-stage stage-c-test-8-gpu-h20

github-actions · 2026-04-23T08:01:14Z

✅ Triggered stage-c-test-8-gpu-h20 to run independently (skipping dependencies). View workflow run

ShangmingCai · 2026-04-23T11:33:45Z

/rerun-stage stage-c-test-8-gpu-h20

github-actions · 2026-04-23T11:34:12Z

✅ Triggered stage-c-test-8-gpu-h20 to run independently (skipping dependencies). View workflow run

ShangmingCai · 2026-04-24T05:14:47Z

/rerun-stage stage-c-test-8-gpu-h20

github-actions · 2026-04-24T05:15:13Z

✅ Triggered stage-c-test-8-gpu-h20 to run independently (skipping dependencies). View workflow run

kpham-sgl · 2026-04-24T07:39:44Z

+                if self.draft_token_to_kv_pool is not None and isinstance(
+                    self.draft_token_to_kv_pool, NSATokenToKVPool
+                ):


Two small questions here

Do we need draft_token_to_kv_pool to also be NSATokenToKVPool?

Need to check if hasattr(self.draft_token_to_kv_pool,"get_state_buf_infos")

Make sense. Do we need to consider the Non-MTP spec decode cases? Not sure if this is a common use-case. @zRzRzRzRzRzRzR

The isinstance(self.draft_token_to_kv_pool, NSATokenToKVPool) guard makes this PR a no-op for non-NSA draft pools, so non-MTP spec decode cases aren't affected. The fix kicks in only when the draft pool is also NSA, which today is the MTP-on-NSA path. If a future non-MTP spec decode also runs an NSA draft, the same logic applies and would Just Work.

JustinTong0323 · 2026-04-29T09:29:30Z

/rerun-failed-ci

JustinTong0323 · 2026-04-29T18:42:43Z

/rerun-failed-ci

ShangmingCai · 2026-04-30T03:55:34Z

PD-related CI has passed. Let's merge.

…on (sgl-project#23539)

Fix missing index/KV transfer for MTP layer in NSA disaggregation

5f9eb42

zRzRzRzRzRzRzR requested review from ByronHsu, ShangmingCai and hnyls2002 as code owners April 23, 2026 06:50

ShangmingCai approved these changes Apr 23, 2026

View reviewed changes

github-actions Bot added the run-ci label Apr 23, 2026

Merge branch 'main' into glm-pd

59a8c5f

JustinTong0323 assigned Qiaolin-Yu and JustinTong0323 Apr 23, 2026

Merge branch 'main' into glm-pd

ddbe2c0

Merge branch 'main' into glm-pd

35b40f6

kpham-sgl self-assigned this Apr 24, 2026

kpham-sgl reviewed Apr 24, 2026

View reviewed changes

ShangmingCai and others added 4 commits April 24, 2026 19:19

Merge branch 'main' into glm-pd

fcfee1c

Merge branch 'main' into glm-pd

f1323e5

Merge branch 'main' into glm-pd

5ae5fec

Merge branch 'sgl-project:main' into glm-pd

f3a4c01

ShangmingCai merged commit d040333 into sgl-project:main Apr 30, 2026
223 of 254 checks passed

zRzRzRzRzRzRzR deleted the glm-pd branch April 30, 2026 05:25

vguduruTT pushed a commit to vguduruTT/sglang that referenced this pull request May 2, 2026

[Bug Fix] missing index/KV transfer for MTP layer in NSA disaggregati…

6a06dc4

…on (sgl-project#23539)

ShangmingCai mentioned this pull request May 7, 2026

[PD] Abort prefill KV transfer before page reuse #24580

Open

Conversation

zRzRzRzRzRzRzR commented Apr 23, 2026

Motivation

Modifications

Uh oh!

gemini-code-assist Bot commented Apr 23, 2026

Uh oh!

ShangmingCai left a comment

Choose a reason for hiding this comment

Uh oh!

ShangmingCai commented Apr 23, 2026

Uh oh!

ShangmingCai commented Apr 23, 2026

Uh oh!

github-actions Bot commented Apr 23, 2026

Uh oh!

ShangmingCai commented Apr 23, 2026

Uh oh!

github-actions Bot commented Apr 23, 2026

Uh oh!

ShangmingCai commented Apr 24, 2026

Uh oh!

github-actions Bot commented Apr 24, 2026

Uh oh!

kpham-sgl Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

ShangmingCai Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zRzRzRzRzRzRzR Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

JustinTong0323 commented Apr 29, 2026

Uh oh!

JustinTong0323 commented Apr 29, 2026

Uh oh!

ShangmingCai commented Apr 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ShangmingCai Apr 24, 2026 •

edited

Loading