[Bug Fix] Ensure prefill_info_table is populated before honoring disagg_prefill_dp_rank by ByronHsu · Pull Request #22990 · sgl-project/sglang

ByronHsu · 2026-04-16T17:48:15Z

Motivation

If the first request to the PD engine carries disagg_prefill_dp_rank, the request fails with:

Prefill server with bootstrap_addr: {self.bootstrap_addr} is healthy before

However, if the first request does not contain disagg_prefill_dp_rank but a later request does, the subsequent requests work because the first request triggers the prefill info query and caches it.

Root cause

In DecodePreallocQueue._resolve_prefill_dp_rank, the req.disagg_prefill_dp_rank early-return is checked before self.kv_manager.prefill_info_table.get(_bootstrap_addr(req)). When the client explicitly supplies disagg_prefill_dp_rank, we short-circuit and never trigger the slow path that queries and caches the prefill info. The subsequent prefill-server health check then has no cached info to validate against, producing the "healthy before" error.

Modifications

Move prefill_info = self.kv_manager.prefill_info_table.get(_bootstrap_addr(req)) to the top of _resolve_prefill_dp_rank. If the lookup returns None, return None so the request falls through to the slow path (_ensure_prefill_info), which queries and caches the prefill info. Only after prefill info is available do we honor the client-provided req.disagg_prefill_dp_rank.

Reproduction

# Prefill
python3 -m sglang.launch_server --model-path Qwen/Qwen3-30B-A3B \
    --tp 4 --dp 4 --enable-dp-attention --disaggregation-mode prefill

# Decode
python3 -m sglang.launch_server --model-path Qwen/Qwen3-30B-A3B \
    --tp 4 --dp 4 --enable-dp-attention --disaggregation-mode decode \
    --base-gpu-id 4 --port 30010

# Load balancer
python -m sglang_router.launch_router --mini-lb --pd-disaggregation \
    --prefill http://127.0.0.1:30000 --decode http://127.0.0.1:30010 \
    --host 0.0.0.0 --port 8000

Send the very first request with disagg_prefill_dp_rank set — before the fix, this fails with the "healthy before" error; after the fix, it succeeds:

curl -X POST http://127.0.0.1:8000/generate \
    -H 'Content-Type: application/json' \
    -d '{
        "text": "Hello World How are you?",
        "sampling_params": {"max_new_tokens": 128, "temperature": 0.0},
        "stream": false,
        "routed_dp_rank": 0,
        "disagg_prefill_dp_rank": 0
    }'

Checklist

Format your code according to the Code Formatting with Pre-Commit.
Add unit tests as outlined in the Running Unit Tests.
Update documentation / benchmarks / examples as needed.
Ensure all CI checks pass before submission.
For adding new models / features, please use slash commands.

gemini-code-assist · 2026-04-16T17:48:20Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

If the first request to the PD engine carries `disagg_prefill_dp_rank`, `_resolve_prefill_dp_rank` returns it immediately without ever populating `kv_manager.prefill_info_table`. This causes the prefill server health check to fail with "Prefill server with bootstrap_addr: ... is healthy before" because the prefill info was never queried/cached. Move the `prefill_info_table.get(...)` lookup to the top so that the slow path runs (and caches the prefill info) on the first request, even when the client supplies an explicit `disagg_prefill_dp_rank`. Made-with: Cursor

ByronHsu · 2026-04-16T18:07:15Z

                    continue
-                nd = self.device_pool.kv_buffer[layer_id][naive_locs[b, i].long()]
-                kd = self.device_pool.kv_buffer[layer_id][kernel_locs[b, i].long()]
+                naive_data = self.device_pool.kv_buffer[layer_id][


fix lint error on main

ShangmingCai

LGTM

…gg_prefill_dp_rank (sgl-project#22990) Co-authored-by: Byron Hsu <byron+per@periodiclabs.ai>

…gg_prefill_dp_rank (#22990) Co-authored-by: Byron Hsu <byron+per@periodiclabs.ai>

…gg_prefill_dp_rank (sgl-project#22990) Co-authored-by: Byron Hsu <byron+per@periodiclabs.ai>

ByronHsu requested review from ShangmingCai and hnyls2002 as code owners April 16, 2026 17:48

ByronHsu force-pushed the byron/fix-resolve-prefill-dp-rank-order branch from 407822c to 8f024b7 Compare April 16, 2026 18:06

ByronHsu commented Apr 16, 2026

View reviewed changes

ShangmingCai approved these changes Apr 17, 2026

View reviewed changes

ShangmingCai merged commit cf9845f into sgl-project:main Apr 17, 2026
61 of 69 checks passed

whybeyoung pushed a commit to whybeyoung/sglang that referenced this pull request Apr 17, 2026

[Bug Fix] Ensure prefill_info_table is populated before honoring disa…

b88eeac

…gg_prefill_dp_rank (sgl-project#22990) Co-authored-by: Byron Hsu <byron+per@periodiclabs.ai>

ByronHsu added a commit that referenced this pull request Apr 17, 2026

[Bug Fix] Ensure prefill_info_table is populated before honoring disa…

94315af

…gg_prefill_dp_rank (#22990) Co-authored-by: Byron Hsu <byron+per@periodiclabs.ai>

ByronHsu mentioned this pull request Apr 17, 2026

[miles] Pick up two dp rank fixes from main #23101

Merged

jmamou pushed a commit to jmamou/sglang that referenced this pull request Apr 20, 2026

[Bug Fix] Ensure prefill_info_table is populated before honoring disa…

08775db

…gg_prefill_dp_rank (sgl-project#22990) Co-authored-by: Byron Hsu <byron+per@periodiclabs.ai>

yhyang201 pushed a commit to yhyang201/sglang that referenced this pull request Apr 22, 2026

[Bug Fix] Ensure prefill_info_table is populated before honoring disa…

9ec1d77

…gg_prefill_dp_rank (sgl-project#22990) Co-authored-by: Byron Hsu <byron+per@periodiclabs.ai>

zhangying098 pushed a commit to zhangying098/sglang that referenced this pull request Apr 23, 2026

[Bug Fix] Ensure prefill_info_table is populated before honoring disa…

c2d8576

…gg_prefill_dp_rank (sgl-project#22990) Co-authored-by: Byron Hsu <byron+per@periodiclabs.ai>

kyx1999 pushed a commit to KMSorSMS/sglang that referenced this pull request Apr 27, 2026

[Bug Fix] Ensure prefill_info_table is populated before honoring disa…

d6f6a69

…gg_prefill_dp_rank (sgl-project#22990) Co-authored-by: Byron Hsu <byron+per@periodiclabs.ai>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug Fix] Ensure prefill_info_table is populated before honoring disagg_prefill_dp_rank#22990

[Bug Fix] Ensure prefill_info_table is populated before honoring disagg_prefill_dp_rank#22990
ShangmingCai merged 1 commit intosgl-project:mainfrom
ByronHsu:byron/fix-resolve-prefill-dp-rank-order

ByronHsu commented Apr 16, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot commented Apr 16, 2026

Uh oh!

ByronHsu Apr 16, 2026

Uh oh!

ShangmingCai left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ByronHsu commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Root cause

Modifications

Reproduction

Checklist

Uh oh!

gemini-code-assist Bot commented Apr 16, 2026

Uh oh!

ByronHsu Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

ShangmingCai left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ByronHsu commented Apr 16, 2026 •

edited

Loading