Added the ability to Modify the Context Length by psych0v0yager · Pull Request #210 · sgl-project/sglang

psych0v0yager · 2024-02-20T23:24:26Z

Fixes issue #159

You can now specify how much context you want the model to have.

For example Mixtral 8x7b AWQ:

python -m sglang.launch_server --model-path /path/to/bagel_mixtral_AWQ --port 30000 --tp 2

Rank 1: max_total_num_token=135505, max_prefill_num_token=32768, context_len=32768, model_mode=[]
Rank 0: max_total_num_token=135505, max_prefill_num_token=32768, context_len=32768, model_mode=[]

With the adjustment

python -m sglang.launch_server --model-path /path/to/bagel_mixtral_AWQ --port 30000 --tp 2 --context-length 8192

Rank 0: max_total_num_token=135505, max_prefill_num_token=22584, context_len=8192, model_mode=[]
Rank 1: max_total_num_token=135505, max_prefill_num_token=22584, context_len=8192, model_mode=[]

comaniac

LGTM

* vlm model support flex_attention * fix lint * fix rope

Add MambaCache features testcases and Configuration file support and Forward hooks features testcases

psych0v0yager added 2 commits February 20, 2024 16:50

Added context length parameter

28a0a44

fixed typo

bb7e42d

comaniac approved these changes Feb 21, 2024

View reviewed changes

comaniac linked an issue Feb 21, 2024 that may be closed by this pull request

initialise model with max_model_len #159

Closed

comaniac merged commit 9de9a46 into sgl-project:main Feb 21, 2024

timethink pushed a commit to timethink/sglang that referenced this pull request Mar 9, 2025

Added the ability to Modify the Context Length (sgl-project#210)

e4b8622

lujangus pushed a commit to tails-mpt/sglang that referenced this pull request Mar 31, 2026

vlm model support flex_attention (sgl-project#210)

145076b

* vlm model support flex_attention * fix lint * fix rope

EdwardXuy pushed a commit to shun8686/sglang that referenced this pull request Apr 6, 2026

Merge pull request sgl-project#210 from shun8686/pr-yzy-4-2

284f016

Add MambaCache features testcases and Configuration file support and Forward hooks features testcases

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added the ability to Modify the Context Length#210

Added the ability to Modify the Context Length#210
comaniac merged 2 commits intosgl-project:mainfrom
psych0v0yager:variable_ctx

psych0v0yager commented Feb 20, 2024 •

edited

Loading

Uh oh!

comaniac left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

psych0v0yager commented Feb 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

comaniac left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

psych0v0yager commented Feb 20, 2024 •

edited

Loading