Conversation
5a84411 to
97b04d4
Compare
merrymercy
requested changes
Jan 16, 2024
Ying1123
reviewed
Jan 16, 2024
Ying1123
reviewed
Jan 17, 2024
Ying1123
reviewed
Jan 17, 2024
Contributor
|
|
yanbing-j
added a commit
to yanbing-j/sglang
that referenced
this pull request
May 30, 2025
… and decode attention kernel (sgl-project#9) * Add intel_amx backend for attention, including extend and decode * update
5 tasks
pengxin99
pushed a commit
to pengxin99/sglang
that referenced
this pull request
Jun 19, 2025
sleepcoo
added a commit
to shuaills/sglang
that referenced
this pull request
Jun 24, 2025
5 tasks
5 tasks
yichiche
pushed a commit
to yichiche/sglang
that referenced
this pull request
Jul 30, 2025
Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>
siuhunh
pushed a commit
to xing-wenjin/sglang
that referenced
this pull request
Aug 1, 2025
…agregation add llm-datadist feature to realize pd disaggregate
yichiche
pushed a commit
to yichiche/sglang
that referenced
this pull request
Aug 7, 2025
Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>
yichiche
pushed a commit
to yichiche/sglang
that referenced
this pull request
Aug 11, 2025
Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>
5 tasks
5 tasks
someoneexistsontheinternet
pushed a commit
to someoneexistsontheinternet/sglang
that referenced
this pull request
Oct 23, 2025
Merge in 0 4 9 post2
kalyank007
pushed a commit
to kalyank007/sglang
that referenced
this pull request
Nov 7, 2025
…unked prefill size for OOM scenarios (sgl-project#9)
5 tasks
amd-youchen
referenced
this pull request
in amd-youchen/sglang
Nov 13, 2025
[Fix] fix fuse share expert function in MI35X
yhyang201
pushed a commit
that referenced
this pull request
Dec 13, 2025
* test: add EPD disaggregation integration tests * fix comment for encoder-only * revert http_server warmup for vlm
fstandhartinger
pushed a commit
to fstandhartinger/sglang
that referenced
this pull request
Jan 13, 2026
Devstral tools
tpoisonooo
pushed a commit
to tpoisonooo/sglang
that referenced
this pull request
Feb 12, 2026
5 tasks
chx96642264
pushed a commit
to chx96642264/sglang
that referenced
this pull request
Mar 11, 2026
lawrence-harmonic
added a commit
to lawrence-harmonic/sglang
that referenced
this pull request
Mar 19, 2026
mmangkad
pushed a commit
to mmangkad-dev/sglang
that referenced
this pull request
Apr 3, 2026
nit: modify weight loader warning msg
rucnyz
added a commit
to rucnyz/sglang
that referenced
this pull request
Apr 30, 2026
…s 28 xfers v9 pool-binding-shift trace produces real differentiation: - Phase B (KV-bound 8K random): L1+L2 -37% mean TTFT vs stock - Phase C (mixed 4K random): L1+L2 -38% median E2E vs stock - Cross-pool transfers: stock 0, L1-only 0, L2-only 0, L1+L2 28 Two surprising findings documented: 1. Layer 2 alone fires zero transfers — Layer 1 retention is what makes Layer 2 cross the firing threshold. 2. Phase A regresses with L1 (-20% TPS) because K_big=8192 hurts on prefix-friendly GSP. Consistent with A2's K_big=0-wins finding. Adaptive K_big control marked as follow-up. Settings status: Setting 1 marked **DONE v6 NULL + v9 PASS**. All 4 user-requested follow-ups (sgl-project#9 Q3.A 4-arm, sgl-project#10 Sweep 1 multi-seed, sgl-project#11 Setting 4 fallback rule, sgl-project#12 Setting 1 v9 trace) now complete.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
add support for gemini backend