[Model] Let more models to support the score template.#31335
[Model] Let more models to support the score template.#31335noooop merged 28 commits intovllm-project:mainfrom
Conversation
|
Documentation preview: https://vllm--31335.org.readthedocs.build/en/31335/ |
There was a problem hiding this comment.
Code Review
This pull request enhances vLLM's support for reranker models by introducing score templates for several new models. The changes include adding new, well-documented example scripts for both offline and online scoring, creating specific Jinja templates for different models, and refactoring existing examples to be more flexible and user-friendly. The overall quality of the changes is high, with clear documentation and good coding practices. I have reviewed the changes for high and critical severity issues and found none. The PR is a valuable addition that improves the usability of vLLM for reranking tasks.
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
4b3cd5d to
9cf0a7c
Compare
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
ℹ️ About Codex in GitHub
Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
|
Hi @noooop, the pre-commit checks have failed. Please run: uv pip install pre-commit
pre-commit install
pre-commit run --all-filesThen, commit the changes and push to your branch. For future commits, Tip Is
|
|
Thanks for the review and Happy New Year. |
|
Hi @noooop, the pre-commit checks have failed. Please run: uv pip install pre-commit
pre-commit install
pre-commit run --all-filesThen, commit the changes and push to your branch. For future commits, Tip Is
|
|
Perhaps we should upgrade sentence-transformers in another PR Let's do this next year. |
|
Hi @noooop, the pre-commit checks have failed. Please run: uv pip install pre-commit
pre-commit install
pre-commit run --all-filesThen, commit the changes and push to your branch. For future commits, Tip Is
|
de308fb to
a3db23b
Compare
…31335) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by: wang.yuqi <noooop@126.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
…31335) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by: wang.yuqi <noooop@126.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
…31335) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by: wang.yuqi <noooop@126.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
…31335) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by: wang.yuqi <noooop@126.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>
…31335) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by: wang.yuqi <noooop@126.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
TL;DR
examples/pooling/score/qwen3_reranker_offline.py
examples/pooling/score/qwen3_reranker_online.py
- Qwen/Qwen3-Reranker-0.6B: qwen3_reranker.jinja
- Qwen/Qwen3-Reranker-4B: qwen3_reranker.jinja
- Qwen/Qwen3-Reranker-BB: qwen3_reranker.jinja
examples/pooling/score/using_template_offline.py
examples/pooling/score/using_template_online.py
- BAAI/bge-reranker-v2-gemma: bge-reranker-v2-gemma.jinja
- tomaarsen/Qwen3-Reranker-0.6B-seq-cls: qwen3_reranker.jinja
- tomaarsen/Qwen3-Reranker-4B-seq-cls: qwen3_reranker.jinja
- tomaarsen/Qwen3-Reranker-8B-seq-cls: qwen3_reranker.jinja
- mixedbread-ai/mxbai-rerank-base-v2: mxbai_rerank_v2.jinja
- mixedbread-ai/mxbai-rerank-large-v2: mxbai_rerank_v2.jinja
- nvidia/llama-nemotron-rerank-1b-v2: nemotron-rerank.jinja
- Qwen/Qwen3-Reranker-0.6B: qwen3_reranker.jinja
- Qwen/Qwen3-Reranker-4B: qwen3_reranker.jinja
- Qwen/Qwen3-Reranker-BB: qwen3_reranker.jinja
Purpose
Following #30550
Let more models to support the score template.
Add more models to Testing and Examples
Test Plan
tests/models/language/pooling_mteb_test/
Test Result
pass
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.