Re-enable unit-test-deepep-8-gpu and unit-test-backend-4-gpu-gb200#17438
Merged
Re-enable unit-test-deepep-8-gpu and unit-test-backend-4-gpu-gb200#17438
Conversation
Both runners have been fixed: - 8-GPU H200 runner: IBGDA environment issues resolved (#17175) - 4-GPU GB200 runner: repaired and working (#17367) Successful run: https://github.com/sgl-project/sglang/actions/runs/21139104876/job/60851136785
Contributor
|
Warning You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again! |
Collaborator
Author
|
/rerun-stage unit-test-backend-4-gpu-gb200 |
Contributor
|
✅ Triggered |
Contributor
Collaborator
Author
|
/rerun-stage unit-test-deepep-8-gpu |
Contributor
|
✅ Triggered |
Contributor
Uncomment the suite that was disabled in #17175. The IBGDA/cudaHostRegister environment issues on the 8-GPU runner have been fixed.
Collaborator
Author
|
/rerun-stage unit-test-deepep-8-gpu |
Contributor
|
✅ Triggered |
Contributor
When call-gate fails, all stage-b jobs are skipped. Without this fix, wait-for-stage-b would run and wait forever because it expects 23 matrix jobs but only sees 4 skipped jobs (one per matrix). Add call-gate to the needs and check its result to skip wait-for-stage-b when call-gate fails.
Same issue as wait-for-stage-b: when call-gate fails, stage-a-test-1 is skipped, but wait-for-stage-a would still run and treat skipped as success. Add call-gate to needs and skip wait-for-stage-a when call-gate fails.
merrymercy
approved these changes
Jan 21, 2026
Johnsonms
pushed a commit
to Johnsonms/sglang
that referenced
this pull request
Feb 14, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
unit-test-deepep-8-gpu(8-GPU H200 runner fixed, unit-test-deepep-8-gpu: test_deepep_large.py failing with IBGDA/cudaHostRegister errors #17175)unit-test-backend-4-gpu-gb200(GB200 runner repaired, Disable unit-test-backend-4-gpu-gb200 job #17367)Successful GB200 run: https://github.com/sgl-project/sglang/actions/runs/21139104876/job/60851136785