Skip to content

[refactor] [asr] Add transcription adapter for extensible ASR models support#22181

Merged
mickqian merged 6 commits intosgl-project:mainfrom
AgainstEntropy:refactor/transcription-adapter
Apr 8, 2026
Merged

[refactor] [asr] Add transcription adapter for extensible ASR models support#22181
mickqian merged 6 commits intosgl-project:mainfrom
AgainstEntropy:refactor/transcription-adapter

Conversation

@AgainstEntropy
Copy link
Copy Markdown
Collaborator

Motivation

Follow up of #22073. To remove hardcoded model family detection and branching.
See reviews in #22073 and #22089.

Modifications

Mainly scoped under python/sglang/srt/entrypoints/openai/
Will coordinate with #22089 to resolve conflicts and dependencies.

Accuracy Tests

Same as #22073

Speed Tests and Profiling

To be added.

Checklist

Review and Merge Process

  1. Ping Merge Oncalls to start the process. See the PR Merge Process.
  2. Get approvals from CODEOWNERS and other reviewers.
  3. Trigger CI tests with comments or contact authorized users to do so.
    • Common commands include /tag-and-rerun-ci, /tag-run-ci-label, /rerun-failed-ci
  4. After green CI and required approvals, ask Merge Oncalls or people with Write permission to merge the PR.

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@github-actions github-actions Bot added documentation Improvements or additions to documentation Multi-modal multi-modal language model labels Apr 6, 2026
@@ -1329,13 +1328,13 @@ def is_generation_model(model_architectures: List[str], is_embedding: bool = Fal
"Qwen3VLMoeForConditionalGeneration",
"Qwen3_5ForConditionalGeneration",
"Qwen3_5MoeForConditionalGeneration",
"Qwen3ASRForConditionalGeneration",
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe detect this by whether we have dedicated processor/adaptor for this model, instead of maintaining a hard-coded list, in the future

@mickqian
Copy link
Copy Markdown
Collaborator

mickqian commented Apr 7, 2026

/tag-and-rerun-ci

@github-actions github-actions Bot added the run-ci label Apr 7, 2026
- Introduced `TranscriptionAdapter` abstract class for model-specific transcription logic.
- Added `Qwen3ASRAdapter` and `WhisperAdapter` implementations for respective ASR models.
- Implemented adapter registration via `@register_transcription_adapter` decorator.
- Updated `OpenAIServingTranscription` to utilize the adapter framework for processing requests.
@AgainstEntropy AgainstEntropy force-pushed the refactor/transcription-adapter branch from 2971bbb to a51a32f Compare April 7, 2026 23:44
@mickqian mickqian merged commit a5ed507 into sgl-project:main Apr 8, 2026
365 of 402 checks passed
@AgainstEntropy AgainstEntropy deleted the refactor/transcription-adapter branch April 9, 2026 23:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation Multi-modal multi-modal language model run-ci

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants