Skip to content

Enhance VL embedding model with video input support and revise warm-up strategy#16635

Merged
mickqian merged 7 commits intosgl-project:mainfrom
yuhao318:add_qwen3vl_embedding_support
Jan 8, 2026
Merged

Enhance VL embedding model with video input support and revise warm-up strategy#16635
mickqian merged 7 commits intosgl-project:mainfrom
yuhao318:add_qwen3vl_embedding_support

Conversation

@yuhao318
Copy link
Copy Markdown
Contributor

@yuhao318 yuhao318 commented Jan 7, 2026

Motivation

This PR extends the input schema of the embedding model to accommodate video inputs, and refines the warmup mechanism of the vision-language model (VLM) to ensure compatibility with VL embedding model initialization.

Checklist

Review Process

  1. Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
  2. Get approvals from CODEOWNERS and other reviewers.
  3. Trigger CI tests with comments (/tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci) or contact authorized users to do so.
  4. After green CI and required approvals, ask Merge Oncalls to merge.

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@ispobock
Copy link
Copy Markdown
Collaborator

ispobock commented Jan 8, 2026

/tag-and-rerun-ci

@github-actions github-actions Bot added the run-ci label Jan 8, 2026
@mickqian mickqian merged commit d2ea44f into sgl-project:main Jan 8, 2026
134 of 156 checks passed
hnyls2002 pushed a commit that referenced this pull request Jan 8, 2026
…arm-up strategy (#16635)

Co-authored-by: Mick <mickjagger19@icloud.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants