Piecewise Cuda Graph Support for gpt-oss model by Oasis-Git · Pull Request #13045 · sgl-project/sglang

Oasis-Git · 2025-11-11T04:22:15Z

Motivation

Support Piecewise cuda graph for gpt-oss series model.

Modifications

MoE backend Select: With piecewise cuda graph, we can achieve similar performance with auto backend compared with triton backend.
Adjust the position of enable_piecewise_cudagraph to avoid circular import

Accuracy Tests

In benchmark & profilling section

Benchmarking and Profiling

For gsm 8k test:

piecewise cuda graph support with auto backend

Accuracy: 0.525
Invalid: 0.158
Latency: 34.185 s
Output throughput: 16977.166 token/s

triton backend

Accuracy: 0.522
Invalid: 0.149
Latency: 34.580 s
Output throughput: 17012.391 token/s

auto backend only

Accuracy: 0.512
Invalid: 0.157
Latency: 250.572 s
Output throughput: 2301.222 token/s

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.

Signed-off-by: Oasis-Git <ayw.sirius19@gmail.com>

gemini-code-assist · 2025-11-11T04:22:19Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

BBuf · 2025-11-11T06:23:25Z

            if self.moe_runner_backend == "auto":
-                if is_blackwell_supported() and is_mxfp4_quant_format:
+                if self.enable_piecewise_cuda_graph:
+                    self.moe_runner_backend = "auto"


Repeating code?

BBuf

LGTM.

ispobock · 2025-11-12T17:33:37Z


            if self.moe_runner_backend == "auto":
-                if is_blackwell_supported() and is_mxfp4_quant_format:
+                if self.enable_piecewise_cuda_graph:


If we enable both piecewise cuda graph and flashinfer_mxfp4, which moe_runner_backend will use?

I see. Here is potential problems. Will fix it soon

gpt oss without ep

d92ea13

Signed-off-by: Oasis-Git <ayw.sirius19@gmail.com>

Oasis-Git requested review from BBuf, Edwardf0t1, Fridge003, HaiShaw, Ying1123, ch-wan, hnyls2002, ispobock, kushanam and merrymercy as code owners November 11, 2025 04:22

github-actions Bot added the deepseek label Nov 11, 2025

Oasis-Git mentioned this pull request Nov 11, 2025

[Feature] Roadmap for Prefill (Piecewise) CUDA Graph #11490

Closed

34 tasks

BBuf reviewed Nov 11, 2025

View reviewed changes

BBuf approved these changes Nov 11, 2025

View reviewed changes

hebiao064 added the run-ci label Nov 11, 2025

Oasis-Git and others added 2 commits November 11, 2025 20:30

Merge branch 'main' into gpt-oss

42e39e5

Merge branch 'main' into gpt-oss

ebc32ba

ispobock reviewed Nov 12, 2025

View reviewed changes

Merge branch 'sgl-project:main' into gpt-oss

59ce1c6

ispobock approved these changes Nov 15, 2025

View reviewed changes

ispobock merged commit eae59b3 into sgl-project:main Nov 15, 2025
51 of 61 checks passed

Oasis-Git deleted the gpt-oss branch November 22, 2025 06:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Piecewise Cuda Graph Support for gpt-oss model#13045

Piecewise Cuda Graph Support for gpt-oss model#13045
ispobock merged 4 commits intosgl-project:mainfrom
Oasis-Git:gpt-oss

Oasis-Git commented Nov 11, 2025 •

edited

Loading

Uh oh!

gemini-code-assist Bot commented Nov 11, 2025

Uh oh!

BBuf Nov 11, 2025

Uh oh!

BBuf left a comment

Uh oh!

ispobock Nov 12, 2025

Uh oh!

Oasis-Git Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Oasis-Git commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Uh oh!

gemini-code-assist Bot commented Nov 11, 2025

Uh oh!

BBuf Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

BBuf left a comment

Choose a reason for hiding this comment

Uh oh!

ispobock Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

Oasis-Git Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Oasis-Git commented Nov 11, 2025 •

edited

Loading