[Bugfix] Enable audio transcription endpoint for Gemma 4#43609
[Bugfix] Enable audio transcription endpoint for Gemma 4#43609SoluMilken wants to merge 4 commits into
Conversation
|
Documentation preview: https://vllm--43609.org.readthedocs.build/en/43609/ |
There was a problem hiding this comment.
Code Review
This pull request implements speech-to-text capabilities for the Gemma 4 model, including transcription and translation support. The changes involve updating the supported models documentation, adding a new test file for transcription, and implementing the SupportsTranscription interface in the Gemma 4 model executor. Review feedback highlighted a recurring typo where the end-of-turn token was incorrectly written as <turn|> instead of <|turn|>, which needs to be corrected in both the model implementation and the associated tests to ensure proper tokenizer behavior.
e6b6bca to
492d3ed
Compare
Isotr0py
left a comment
There was a problem hiding this comment.
Overall look reasonable. But would like to let @NickLucche have a second eye too.
|
Thanks @darklight for helping request the right reviewers, and a huge thanks to @isotrop for the super quick review! 🙌 |
0c9b311 to
c6edc42
Compare
Co-authored-by: OpenAI Codex Signed-off-by: SoluMilken <ypiheyn.imm02g@g2.nctu.edu.tw>
Signed-off-by: SoluMilken <ypiheyn.imm02g@g2.nctu.edu.tw>
Signed-off-by: SoluMilken <ypiheyn.imm02g@g2.nctu.edu.tw>
Signed-off-by: SoluMilken <ypiheyn.imm02g@g2.nctu.edu.tw>
c6edc42 to
24652b4
Compare
|
Try to fix the failed CI buildkite/ci/pr/basic-models-tests-extra-initialization-2 by this PR #43831 |
|
This pull request has merge conflicts that must be resolved before it can be |
Purpose
Fix #40994.
Test Plan
Launch a server
Call transcription API
Call translation API
Test Result
Call transcription API
Call translation API
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.