[Fix] Fixed Qwen3.5 dense model load weight issue (#232). by IzacharyI · Pull Request #239 · zejunchen-zejun/sglang

IzacharyI · 2026-04-03T11:27:04Z

Motivation

Fixed dense model load weight issue (#232) .

Modifications

Cherry-pick PR 21019 load weight config and add BF16 qkv z b a fusion and PTPC quant config

Accuracy Tests

397B-A17B BF16：

397B-A17B PTPC FP8：

27B BF16 TP2

27B FP8 TP2

Benchmarking and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.

- Cherry-pick PR sgl-project#21019 load weight func - Add BF16 qkv z b a fusion and PTPC quant config

[Fix] Fixed Qwen3.5 dense model load weight issue (sgl-project#232).

d68fe2a

- Cherry-pick PR sgl-project#21019 load weight func - Add BF16 qkv z b a fusion and PTPC quant config

qichu-yun merged commit 773c750 into zejunchen-zejun:Qwen3.5_v0.5.9 Apr 3, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] Fixed Qwen3.5 dense model load weight issue (#232).#239

[Fix] Fixed Qwen3.5 dense model load weight issue (#232).#239
qichu-yun merged 1 commit intozejunchen-zejun:Qwen3.5_v0.5.9from
IzacharyI:Qwen3.5_v0.5.9

IzacharyI commented Apr 3, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

IzacharyI commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

IzacharyI commented Apr 3, 2026 •

edited

Loading