docs(cookbook): add Qwen3.6-27B dense variant by JustinTong0323 · Pull Request #23486 · sgl-project/sglang

JustinTong0323 · 2026-04-22T15:14:49Z

Summary

Qwen3.6 ships a 27B dense variant (Qwen3.6-27B / Qwen3.6-27B-FP8) alongside the existing 35B-A3B MoE. Update the cookbook page and deployment snippet to cover both.

Rewrite intro / available-models / hardware-requirements sections for both variants
Add a Model Size radio (35B-A3B MoE / 27B Dense) to the deployment snippet; --model-path is derived from the selected baseName, same flags carry over
Drop the stale sglang[all] install hint in favor of plain uv pip install sglang (matches the install doc; [all] only pulls diffusion/tracing/http2, irrelevant for an autoregressive VLM)
Note in Section 4 that 27B swaps only --model-path

Test plan

Verified on H200 TP=2 on top of #23467 (FP8 MLP shard-matching fix) via benchmark/mmmu/bench_sglang.py with --reasoning-parser qwen3-thinking --tool-call-parser qwen25, concurrency 48:

Model	MMMU val (900)
Qwen3.6-27B (BF16)	55.1%
Qwen3.6-27B-FP8	53.0%

The 2.1% gap is within the Wilson 95% CI on n=900 (±3.2%) — FP8 at BF16 parity, i.e. no accuracy regression from the FP8 fix that landed in #23467.

mint dev preview renders the updated Qwen3.6 page (both variants in Available Models / Hardware tables; Model Size radio toggles the command)
@babel/parser parses the updated qwen36-deployment.jsx
Generator output for default selections matches the documented deploy command in Section 4

Qwen3.6 ships a 27B dense variant alongside the existing 35B-A3B MoE. Update the cookbook page and deployment snippet: - Rewrite intro / available-models / hardware-requirements sections to cover both variants. - Add a Model Size radio (35B-A3B MoE / 27B Dense) to the deployment snippet; model path is derived from the selected baseName. - Drop the stale 'sglang[all]' install hint in favor of plain 'uv pip install sglang' (matches the install doc; '[all]' pulls diffusion/tracing/http2, irrelevant for an autoregressive VLM). - Note in Section 4 that 27B swaps only '--model-path'; other flags carry over. Verified with the sanity + MMMU runs against the FP8 fix from sgl-project#23467 on H200 TP=2: BF16 MMMU 900/val: 55.1% FP8 MMMU 900/val: 53.0% (within Wilson 95% CI, i.e. parity)

gemini-code-assist · 2026-04-22T15:14:52Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

cs-cat · 2026-04-22T16:34:47Z

The MMMU result is very different from the official result (82.9). Is this test performed on the sglang main branch?

JustinTong0323 requested a review from wisclmy0611 as a code owner April 22, 2026 15:14

mickqian merged commit de962f3 into sgl-project:main Apr 22, 2026
42 checks passed

zhangying098 pushed a commit to zhangying098/sglang that referenced this pull request Apr 23, 2026

docs(cookbook): add Qwen3.6-27B dense variant (sgl-project#23486)

5fb0c58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(cookbook): add Qwen3.6-27B dense variant#23486

docs(cookbook): add Qwen3.6-27B dense variant#23486
mickqian merged 1 commit intosgl-project:mainfrom
JustinTong0323:feat/qwen36-27b-docs

JustinTong0323 commented Apr 22, 2026

Uh oh!

gemini-code-assist Bot commented Apr 22, 2026

Uh oh!

cs-cat commented Apr 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

JustinTong0323 commented Apr 22, 2026

Summary

Test plan

Uh oh!

gemini-code-assist Bot commented Apr 22, 2026

Uh oh!

cs-cat commented Apr 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants