Revert several PRs#14958
Conversation
This reverts commit 70758d4.
|
Warning You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again! |
|
/tag-and-rerun-ci |
|
/tag-and-rerun-ci |
|
|
I had a PR #14937 just merged for fixing this. Not sure the CI failure on B200 is the same root cause or something else is still wrong for B200? |
|
@ZailiWang I've also reverted your pr |
|
/rerun-stage unit-test-backend-4-gpu-b200 |
|
/rerun-stage unit-test-backend-8-gpu-b200 |
|
✅ Triggered Check the Actions tab for progress. |
|
✅ Triggered Check the Actions tab for progress. |
|
/rerun-stage unit-test-backend-4-gpu-b200 |
|
/rerun-stage unit-test-backend-8-gpu-b200 |
|
✅ Triggered Check the Actions tab for progress. |
|
✅ Triggered Check the Actions tab for progress. |
|
/tag-and-rerun-ci |
|
/rerun-stage unit-test-backend-4-gpu-b200 |
|
/rerun-stage unit-test-backend-8-gpu-b200 |
|
✅ Triggered Check the Actions tab for progress. |
|
✅ Triggered Check the Actions tab for progress. |
|
/rerun-stage unit-test-backend-8-gpu-b200 |
|
✅ Triggered Check the Actions tab for progress. |
|
/tag-and-rerun-ci |
1 similar comment
|
/tag-and-rerun-ci |
…n_eagle3_npu * 'main' of https://github.com/sgl-project/sglang: (121 commits) Super tiny add gsp-fast-prepare (sgl-project#14992) Super tiny fix confusing slash_command_handler hint (sgl-project#14976) Super tiny remove unused argument (sgl-project#14966) [registry] Add a strict mode to model registration (sgl-project#14933) Feature/Fix multi lora scheduler blocking issue and evict LoRA None lastly (sgl-project#14795) Tune triton fused moe for the case of glm-4.6-fp8 b200 tp4 (sgl-project#15020) [model-gateway] refactor: unify worker management into modular workflow structure (sgl-project#15010) Update ci permission (sgl-project#15014) Refactor of http and engine entrypoints to allow custom override (sgl-project#14869) Add KV4-capable backend flashmla and update server args (sgl-project#14989) Revert several PRs (sgl-project#14958) Super tiny extract route_typed_request_once (sgl-project#14951) Fix CI by reverting incorrect metric check logic (sgl-project#15004) [model-gateway] refactor: workflow engine cleanup and minor optimization (sgl-project#15001) [model-gateway] fix: handle workflow deadlock and optimize cycle detection (sgl-project#15000) [model-gateway] feat: add DAG parallel execution support and workflow optimization (sgl-project#14999) [model-gateway] refactor: extract workflow engine to src/workflow module (sgl-project#14996) Update CODEOWNERS for multimodal_gen (sgl-project#14995) [diffusion] docker: Tiny fix Docker Hub link in installation documentation (sgl-project#14987) [PD] Add decode PP event loop for PD disaggregation (sgl-project#14945) ... # Conflicts: # python/sglang/srt/model_executor/piecewise_cuda_graph_runner.py
There was a problem hiding this comment.
This blog is removed. The link in https://lmsys.org/blog/2025-12-10-rfork/ is broken.
Co-authored-by: fzyzcjy <ch271828n@outlook.com>
Co-authored-by: fzyzcjy <ch271828n@outlook.com>
This reverts commit 70758d4.
Motivation
that commit has broken b200 cis
Modifications
Accuracy Tests
Benchmarking and Profiling
Checklist