Added the ability to Modify the Context Length#210
Merged
comaniac merged 2 commits intosgl-project:mainfrom Feb 21, 2024
Merged
Added the ability to Modify the Context Length#210comaniac merged 2 commits intosgl-project:mainfrom
comaniac merged 2 commits intosgl-project:mainfrom
Conversation
timethink
pushed a commit
to timethink/sglang
that referenced
this pull request
Mar 9, 2025
lujangus
pushed a commit
to tails-mpt/sglang
that referenced
this pull request
Mar 31, 2026
* vlm model support flex_attention * fix lint * fix rope
EdwardXuy
pushed a commit
to shun8686/sglang
that referenced
this pull request
Apr 6, 2026
Add MambaCache features testcases and Configuration file support and Forward hooks features testcases
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Fixes issue #159
You can now specify how much context you want the model to have.
For example Mixtral 8x7b AWQ:
python -m sglang.launch_server --model-path /path/to/bagel_mixtral_AWQ --port 30000 --tp 2With the adjustment
python -m sglang.launch_server --model-path /path/to/bagel_mixtral_AWQ --port 30000 --tp 2 --context-length 8192