[PD] Support decode pp for PD disaggregation#14265
Conversation
Signed-off-by: Shangming Cai <csmthu@gmail.com>
|
Warning You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again! |
|
/tag-and-rerun-ci |
|
/tag-and-rerun-ci |
Signed-off-by: Shangming Cai <csmthu@gmail.com>
Signed-off-by: Shangming Cai <csmthu@gmail.com>
|
LTGM, but I try with this command python -m sglang.launch_server decode and get errors: [2025-12-05 03:44:56 PP0 TP3] Scheduler hit an exception: Traceback (most recent call last): same errors with PP2 DP8 |
Signed-off-by: Shangming Cai <csmthu@gmail.com>
Signed-off-by: Shangming Cai <csmthu@gmail.com>
I also encountered the same problem when using the PP2TP4 decode instance |
|
@maoqiuli @nihao1997 Sorry for causing the misunderstanding, the sceduler loop for disaggregated decode hasn't been merged into main yet. You can check this branch for a preview: openanolis#13 |
|
@ShangmingCai Thank you very much! I will try it out. |
Signed-off-by: Shangming Cai <csmthu@gmail.com>
Signed-off-by: Shangming Cai <csmthu@gmail.com>

Motivation
Modifications
Accuracy Tests
Benchmarking and Profiling
Checklist