Enhance VL embedding model with video input support and revise warm-up strategy by yuhao318 · Pull Request #16635 · sgl-project/sglang

yuhao318 · 2026-01-07T07:23:48Z

Motivation

This PR extends the input schema of the embedding model to accommodate video inputs, and refines the warmup mechanism of the vision-language model (VLM) to ensure compatibility with VL embedding model initialization.

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.

Review Process

Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments (/tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci) or contact authorized users to do so.
After green CI and required approvals, ask Merge Oncalls to merge.

gemini-code-assist · 2026-01-07T07:23:51Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

ispobock · 2026-01-08T09:06:27Z

/tag-and-rerun-ci

…arm-up strategy (#16635) Co-authored-by: Mick <mickjagger19@icloud.com>

yuhao318 added 5 commits January 6, 2026 14:05

Add condition for generation models in HTTP server

3fe69b1

Add video input for MultimodalEmbeddingInput

1318cf6

Add video input for serving_embedding

4845c73

Add video input for tokenizer_manager

cf8d997

Add video input for conversation

b868ec0

yuhao318 requested review from CatherineSue, JustinTong0323, Ying1123, hnyls2002, ispobock, merrymercy, slin1237 and xiezhq-hermann as code owners January 7, 2026 07:23

github-actions Bot added the run-ci label Jan 8, 2026

mickqian added 2 commits January 8, 2026 20:51

lint

4fd41fd

Merge branch 'main' into add_qwen3vl_embedding_support

e43f290

mickqian merged commit d2ea44f into sgl-project:main Jan 8, 2026
134 of 156 checks passed

hnyls2002 pushed a commit that referenced this pull request Jan 8, 2026

VLM: enhance VL embedding model with video input support and revise w…

a18166c

…arm-up strategy (#16635) Co-authored-by: Mick <mickjagger19@icloud.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance VL embedding model with video input support and revise warm-up strategy#16635

Enhance VL embedding model with video input support and revise warm-up strategy#16635
mickqian merged 7 commits intosgl-project:mainfrom
yuhao318:add_qwen3vl_embedding_support

yuhao318 commented Jan 7, 2026

Uh oh!

gemini-code-assist Bot commented Jan 7, 2026

Uh oh!

ispobock commented Jan 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yuhao318 commented Jan 7, 2026

Motivation

Checklist

Review Process

Uh oh!

gemini-code-assist Bot commented Jan 7, 2026

Uh oh!

ispobock commented Jan 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants