[CPU] Add apply_routed_scaling_factor_on_output support for biased_grouped_topk fusion by jianan-gu · Pull Request #22413 · sgl-project/sglang

jianan-gu · 2026-04-09T03:38:05Z

This PR:

removes the limit of apply_routed_scaling_factor_on_output for CPU path of the fusion biased_grouped_topk_cpu
add fp32 dtype support for gating_output
refine topk expert numebers

gemini-code-assist · 2026-04-09T03:38:09Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

mingfeima

minor changes needed to simplify the code.

mingfeima · 2026-04-09T07:46:45Z

/tag-and-rerun-ci

Kangyan-Zhou · 2026-04-09T18:12:44Z

/rerun-failed-ci

jianan-gu · 2026-04-10T02:45:42Z

/rerun-failed-ci

jianan-gu · 2026-04-10T03:01:56Z

Latest Xeon CI broken by this PR #20796

…ouped_topk fusion (#22413) Co-authored-by: Ma Mingfei <mingfei.ma@intel.com>

…ouped_topk fusion (sgl-project#22413) Co-authored-by: Ma Mingfei <mingfei.ma@intel.com>

* fix topk softmax performance issue (sgl-project#14702) * [CPU] Add apply_routed_scaling_factor_on_output support for biased_grouped_topk fusion (sgl-project#22413) Co-authored-by: Ma Mingfei <mingfei.ma@intel.com> * add kernel biased_topk_cpu * add kernel hash_topk_cpu --------- Co-authored-by: Ma Mingfei <mingfei.ma@intel.com> Co-authored-by: jianan-gu <jianan.gu@intel.com>

jianan-gu added 2 commits April 8, 2026 10:38

init pr

6dfe6f9

refine ut

ead0531

jianan-gu requested review from BBuf, Edwardf0t1, FlamingoPg, Fridge003, HaiShaw, Ying1123, ch-wan, ispobock, merrymercy and yizhang2077 as code owners April 9, 2026 03:38

github-actions Bot added the sgl-kernel label Apr 9, 2026

jianan-gu changed the title ~~[CPU] Add apply_routed_scaling_factor support for biased_grouped_topk~~ [CPU] Add apply_routed_scaling_factor_on_output support for biased_grouped_topk fusion Apr 9, 2026

add more size

0c6291d

mingfeima requested changes Apr 9, 2026

View reviewed changes

Comment thread sgl-kernel/csrc/cpu/common.h Outdated

refine micro

6b8ce5b

jianan-gu requested a review from mingfeima April 9, 2026 05:32

mingfeima approved these changes Apr 9, 2026

View reviewed changes

Merge branch 'main' into jianan/bias_gtopk

cfe4b50

github-actions Bot added the run-ci label Apr 9, 2026

jianan-gu and others added 2 commits April 10, 2026 08:18

Merge branch 'main' into jianan/bias_gtopk

da6bd04

Merge branch 'main' into jianan/bias_gtopk

8dfc5ee

mingfeima merged commit 2ab1415 into sgl-project:main Apr 10, 2026
53 of 63 checks passed

Fridge003 pushed a commit that referenced this pull request Apr 11, 2026

[CPU] Add apply_routed_scaling_factor_on_output support for biased_gr…

b8ef7fb

…ouped_topk fusion (#22413) Co-authored-by: Ma Mingfei <mingfei.ma@intel.com>

pyc96 pushed a commit to pyc96/sglang that referenced this pull request Apr 14, 2026

[CPU] Add apply_routed_scaling_factor_on_output support for biased_gr…

05f21cc

…ouped_topk fusion (sgl-project#22413) Co-authored-by: Ma Mingfei <mingfei.ma@intel.com>

yhyang201 pushed a commit to yhyang201/sglang that referenced this pull request Apr 22, 2026

[CPU] Add apply_routed_scaling_factor_on_output support for biased_gr…

d1fe952

…ouped_topk fusion (sgl-project#22413) Co-authored-by: Ma Mingfei <mingfei.ma@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CPU] Add apply_routed_scaling_factor_on_output support for biased_grouped_topk fusion#22413

[CPU] Add apply_routed_scaling_factor_on_output support for biased_grouped_topk fusion#22413
mingfeima merged 7 commits intosgl-project:mainfrom
jianan-gu:jianan/bias_gtopk

jianan-gu commented Apr 9, 2026

Uh oh!

gemini-code-assist Bot commented Apr 9, 2026

Uh oh!

mingfeima left a comment

Uh oh!

Uh oh!

mingfeima commented Apr 9, 2026

Uh oh!

Kangyan-Zhou commented Apr 9, 2026

Uh oh!

jianan-gu commented Apr 10, 2026

Uh oh!

jianan-gu commented Apr 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jianan-gu commented Apr 9, 2026

Uh oh!

gemini-code-assist Bot commented Apr 9, 2026

Uh oh!

mingfeima left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mingfeima commented Apr 9, 2026

Uh oh!

Kangyan-Zhou commented Apr 9, 2026

Uh oh!

jianan-gu commented Apr 10, 2026

Uh oh!

jianan-gu commented Apr 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants