Skip to content

[Fix] Try to fix error caused by latest cutedsl packages #25690

Merged
hnyls2002 merged 4 commits into
mainfrom
fix-ci
May 18, 2026
Merged

[Fix] Try to fix error caused by latest cutedsl packages #25690
hnyls2002 merged 4 commits into
mainfrom
fix-ci

Conversation

@Fridge003

@Fridge003 Fridge003 commented May 18, 2026

Copy link
Copy Markdown
Collaborator

Motivation

Ref: https://github.com/sgl-project/sglang/actions/runs/26055697810/job/76604443741
vllm-project/vllm#40082 (comment)

Modifications

Accuracy Tests

Speed Tests and Profiling

Checklist

Review and Merge Process

  1. Ping Merge Oncalls to start the process. See the PR Merge Process.
  2. Get approvals from CODEOWNERS and other reviewers.
  3. Trigger CI tests with comments or contact authorized users to do so.
    • Common commands include /tag-and-rerun-ci, /tag-run-ci-label, /rerun-failed-ci
  4. After green CI and required approvals, ask Merge Oncalls or people with Write permission to merge the PR.

CI States

Latest PR Test (Base): ⏳ Run #26065692957
Latest PR Test (Extra): ⏳ Run #26065692857

@gemini-code-assist

Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@github-actions github-actions Bot added the dependencies Pull requests that update a dependency file label May 18, 2026
@Fridge003

Copy link
Copy Markdown
Collaborator Author

/rerun-test test_streaming_session.py

@github-actions

github-actions Bot commented May 18, 2026

Copy link
Copy Markdown
Contributor

🚀 1-gpu-h100 (1 test): ✅ View workflow run

cd test/ && python3 registered/sessions/test_streaming_session.py

@hnyls2002 hnyls2002 merged commit b79e4b1 into main May 18, 2026
101 of 128 checks passed
@hnyls2002 hnyls2002 deleted the fix-ci branch May 18, 2026 23:51
Fridge003 added a commit that referenced this pull request May 19, 2026
Kangyan-Zhou added a commit to Kangyan-Zhou/sglang that referenced this pull request May 21, 2026
…l_dependency.sh

nvidia-cutlass-dsl[cu13] has additive PyPI extras: both -libs-base AND
-libs-cu13 are installed together, writing to the same site-packages
paths with conflicting content. This causes a GPUModuleOp TypeError at
kernel-compile time (vllm-project/vllm#40082).

The correct libs package depends on the GPU family, not just CUDA version:

  Blackwell (IS_BLACKWELL=1, CU13):
    -libs-cu13 must win. It carries the sm_110 arch alias that the
    CUDA-12.9-built -libs-base wheel lacks.
    Fix: purge -libs-base, force-reinstall -libs-cu13.

  Non-Blackwell CU13 (H100, H200):
    -libs-base must win. Forcing only -libs-cu13 introduces a
    CUDBG_EXCEPTION_WARP_ILLEGAL_ADDRESS regression in LoRA CUDA-graph
    capture (sgl-project#25743).
    Fix: purge -libs-cu13, force-reinstall -libs-base.

  Non-CU13: only -libs-base installed (no [cu13] extra), no conflict.

Add fix_cutlass_dsl_libs() called from main() after download_flashinfer_cache,
mirroring the position of the original purge_cutlass_libs_base() from sgl-project#25690.
Shunkangz pushed a commit to Shunkangz/sglang that referenced this pull request May 27, 2026
alphabetc1 pushed a commit to alphabetc1/sglang that referenced this pull request Jun 4, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bypass-fastfail dependencies Pull requests that update a dependency file run-ci run-ci-extra

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants