Support gpt oss moe lora by gongyisheng · Pull Request #21375 · sgl-project/sglang

gongyisheng · 2026-03-25T06:21:45Z

Motivation

Support MoE LoRA for gpt-oss in miles

Modifications

This PR includes two parts

adapt to gpt-oss specific weight name
- python/sglang/srt/server_args.py: use triton as moe backend when use gpt-oss model + lora adapter
- python/sglang/srt/lora/mem_pool.py: support gpt-oss moe arch, which do not have shared experts
- python/sglang/srt/lora/utils.py: gpt-oss moe layer name
- python/sglang/srt/utils/hf_transformers_utils.py: gpt-oss specific configs
fix for load_lora
- python/sglang/srt/managers/tp_worker.py: support loading lora in tp environment
- python/sglang/srt/managers/io_struct.py: type hint change, since serialized_tensors is already a list[str]

todo:

add unittest

Accuracy Tests

N/A

Benchmarking and Profiling

N/A

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.

Review Process

Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments or contact authorized users to do so.
- /tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci
After green CI and required approvals, ask Merge Oncalls to merge.

gemini-code-assist · 2026-03-25T06:21:49Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

gongyisheng · 2026-04-01T17:42:39Z

@yushengsu-thu review request

gongyisheng added 2 commits March 24, 2026 23:03

feat: support lora for tp worker

3d66292

feat: support gpt-oss specific weight names for moe lora

a32a6e1

gongyisheng requested review from Fridge003, Ying1123, hnyls2002, lifuhuang, merrymercy, xiezhq-hermann and yushengsu-thu as code owners March 25, 2026 06:21

gongyisheng mentioned this pull request Mar 25, 2026

Support moe lora for gpt-oss radixark/miles#798

Merged

gongyisheng added 2 commits March 24, 2026 23:26

chore: fix weight not init error with zero out

84b1e01

chore: format code

fa7a6fe

yushengsu-thu self-assigned this Apr 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support gpt oss moe lora#21375

Support gpt oss moe lora#21375
gongyisheng wants to merge 4 commits intosgl-project:mainfrom
gongyisheng:sglang-gpt-oss-moe-lora

gongyisheng commented Mar 25, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot commented Mar 25, 2026

Uh oh!

gongyisheng commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gongyisheng commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Review Process

Uh oh!

gemini-code-assist Bot commented Mar 25, 2026

Uh oh!

gongyisheng commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gongyisheng commented Mar 25, 2026 •

edited

Loading