Skip to content

[NPU] solve accuracy problem for stablelm-2-1-6b for npu#17470

Merged
iforgetmyname merged 9 commits intosgl-project:mainfrom
McZyWu:main0121-stablelm2-1-6b
Jan 24, 2026
Merged

[NPU] solve accuracy problem for stablelm-2-1-6b for npu#17470
iforgetmyname merged 9 commits intosgl-project:mainfrom
McZyWu:main0121-stablelm2-1-6b

Conversation

@McZyWu
Copy link
Copy Markdown
Contributor

@McZyWu McZyWu commented Jan 21, 2026

Motivation

Motivation
accuracy fix for stablelm-2-1-6b for npu, previously it was 0.0

Modifications

as follows.
Increase the dtype before performing rope processing to avoid affecting the precision.

Accuracy Tests

20%
image

image image

##start command

python -m sglang.launch_server
--model-path /xxxxxxxxx/xxxxxxxxx/model path for stablelm-2-1-6b
--trust-remote-code
--mem-fraction-static 0.9
--host 127.0.0.1
--port 8700
--tp-size 1
--device npu
--attention-backend ascend
--max-running-requests 64
--disable-radix-cache
--disable-overlap-schedule
--watchdog-timeout 9000
--base-gpu-id xxxx
--disable-cuda-graph
--enable-torch-compile
--chunked-prefill-size -1 \

##test_gsm8k command
change model path in test/registered/ascend/llm_models/test_ascend_stablelm-2-1_6b.py for stablelm-2-1-6b
python test/registered/ascend/llm_models/test_ascend_stablelm-2-1_6b.py

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@xiaobaicxy
Copy link
Copy Markdown
Contributor

Please run pre-commit and update the PR description

@github-actions github-actions Bot added the npu label Jan 21, 2026
@McZyWu
Copy link
Copy Markdown
Contributor Author

McZyWu commented Jan 21, 2026

Please run pre-commit and update the PR description

Sure. Thank you.

@iforgetmyname iforgetmyname self-assigned this Jan 21, 2026
Comment thread python/sglang/srt/models/stablelm.py Outdated
Comment thread python/sglang/srt/models/stablelm.py Outdated
@McZyWu McZyWu changed the title solve accuracy problem for model stable-2-1-6b for npu [NPU] solve accuracy problem for model stable-2-1-6b for npu Jan 21, 2026
@iforgetmyname
Copy link
Copy Markdown
Collaborator

/tag-and-rerun-ci

@iforgetmyname iforgetmyname changed the title [NPU] solve accuracy problem for model stable-2-1-6b for npu [NPU] solve accuracy problem for model stablemate-2-1-6b for npu Jan 24, 2026
@iforgetmyname iforgetmyname changed the title [NPU] solve accuracy problem for model stablemate-2-1-6b for npu [NPU] solve accuracy problem for stablelm-2-1-6b for npu Jan 24, 2026
@iforgetmyname iforgetmyname merged commit b4a611f into sgl-project:main Jan 24, 2026
284 of 297 checks passed
Johnsonms pushed a commit to Johnsonms/sglang that referenced this pull request Feb 14, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants