Skip to content

[Model] Let more models to support the score template.#31335

Merged
noooop merged 28 commits intovllm-project:mainfrom
noooop:more_score_template
Jan 5, 2026
Merged

[Model] Let more models to support the score template.#31335
noooop merged 28 commits intovllm-project:mainfrom
noooop:more_score_template

Conversation

@noooop
Copy link
Copy Markdown
Collaborator

@noooop noooop commented Dec 25, 2025

TL;DR

examples/pooling/score/qwen3_reranker_offline.py
examples/pooling/score/qwen3_reranker_online.py
- Qwen/Qwen3-Reranker-0.6B: qwen3_reranker.jinja
- Qwen/Qwen3-Reranker-4B: qwen3_reranker.jinja
- Qwen/Qwen3-Reranker-BB: qwen3_reranker.jinja
examples/pooling/score/using_template_offline.py
examples/pooling/score/using_template_online.py
- BAAI/bge-reranker-v2-gemma: bge-reranker-v2-gemma.jinja
- tomaarsen/Qwen3-Reranker-0.6B-seq-cls: qwen3_reranker.jinja
- tomaarsen/Qwen3-Reranker-4B-seq-cls: qwen3_reranker.jinja
- tomaarsen/Qwen3-Reranker-8B-seq-cls: qwen3_reranker.jinja
- mixedbread-ai/mxbai-rerank-base-v2: mxbai_rerank_v2.jinja
- mixedbread-ai/mxbai-rerank-large-v2: mxbai_rerank_v2.jinja
- nvidia/llama-nemotron-rerank-1b-v2: nemotron-rerank.jinja
- Qwen/Qwen3-Reranker-0.6B: qwen3_reranker.jinja
- Qwen/Qwen3-Reranker-4B: qwen3_reranker.jinja
- Qwen/Qwen3-Reranker-BB: qwen3_reranker.jinja

Purpose

Following #30550
Let more models to support the score template.

Add more models to Testing and Examples

  • bge_reranker_v2_gemma √
  • mxbai_rerank √
  • qwen3_reranker √
  • nemotron_rerank √

Test Plan

tests/models/language/pooling_mteb_test/

Test Result

pass


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
  • (Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

@mergify
Copy link
Copy Markdown

mergify bot commented Dec 25, 2025

Documentation preview: https://vllm--31335.org.readthedocs.build/en/31335/

@mergify mergify bot added documentation Improvements or additions to documentation qwen Related to Qwen models labels Dec 25, 2025
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request enhances vLLM's support for reranker models by introducing score templates for several new models. The changes include adding new, well-documented example scripts for both offline and online scoring, creating specific Jinja templates for different models, and refactoring existing examples to be more flexible and user-friendly. The overall quality of the changes is high, with clear documentation and good coding practices. I have reviewed the changes for high and critical severity issues and found none. The PR is a valuable addition that improves the usability of vLLM for reranking tasks.

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
@noooop noooop force-pushed the more_score_template branch from 4b3cd5d to 9cf0a7c Compare December 25, 2025 10:35
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
@noooop noooop added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 29, 2025
@noooop noooop marked this pull request as ready for review December 29, 2025 09:09
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
@mergify mergify bot added the ci/build label Dec 30, 2025
@vllm-project vllm-project deleted a comment from mergify bot Dec 30, 2025
@vllm-project vllm-project deleted a comment from mergify bot Dec 30, 2025
@vllm-project vllm-project deleted a comment from mergify bot Dec 30, 2025
@noooop noooop removed the ready ONLY add when PR is ready to merge/full CI is needed label Dec 30, 2025
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
@mergify
Copy link
Copy Markdown

mergify bot commented Dec 30, 2025

Hi @noooop, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy or markdownlint failing?
mypy and markdownlint are run differently in CI. If the failure is related to either of these checks, please use the following commands to run them locally:
# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10
# For markdownlint
pre-commit run --hook-stage manual markdownlint

@noooop noooop requested a review from DarkLight1337 December 30, 2025 06:46
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
@noooop noooop added the rl Related to RL workflows label Dec 31, 2025
@noooop
Copy link
Copy Markdown
Collaborator Author

noooop commented Dec 31, 2025

Thanks for the review and Happy New Year.

@noooop noooop added ready ONLY add when PR is ready to merge/full CI is needed and removed rl Related to RL workflows labels Dec 31, 2025
@noooop noooop removed the ready ONLY add when PR is ready to merge/full CI is needed label Dec 31, 2025
@mergify
Copy link
Copy Markdown

mergify bot commented Dec 31, 2025

Hi @noooop, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy or markdownlint failing?
mypy and markdownlint are run differently in CI. If the failure is related to either of these checks, please use the following commands to run them locally:
# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10
# For markdownlint
pre-commit run --hook-stage manual markdownlint

@noooop
Copy link
Copy Markdown
Collaborator Author

noooop commented Dec 31, 2025

Perhaps we should upgrade sentence-transformers in another PR

Let's do this next year.

@mergify
Copy link
Copy Markdown

mergify bot commented Jan 5, 2026

Hi @noooop, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy or markdownlint failing?
mypy and markdownlint are run differently in CI. If the failure is related to either of these checks, please use the following commands to run them locally:
# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10
# For markdownlint
pre-commit run --hook-stage manual markdownlint

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
@noooop noooop force-pushed the more_score_template branch from de308fb to a3db23b Compare January 5, 2026 07:49
@noooop noooop enabled auto-merge (squash) January 5, 2026 09:50
@noooop noooop added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 5, 2026
@noooop noooop merged commit 911d38e into vllm-project:main Jan 5, 2026
59 checks passed
@noooop noooop deleted the more_score_template branch January 5, 2026 11:57
LucasWilkinson pushed a commit to neuralmagic/vllm that referenced this pull request Jan 6, 2026
…31335)

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <noooop@126.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
yugong333 pushed a commit to yugong333/vllm that referenced this pull request Jan 9, 2026
…31335)

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <noooop@126.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
akh64bit pushed a commit to akh64bit/vllm that referenced this pull request Jan 16, 2026
…31335)

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <noooop@126.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026
…31335)

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <noooop@126.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>
ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026
…31335)

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <noooop@126.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci/build documentation Improvements or additions to documentation qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants