Skip to content

[Model Runner v2] Migration from v1 to v2, with Qwen and DSv2 MOE models [3/N]#42667

Merged
njhill merged 9 commits into
mainfrom
wentao-mrv2-migration-moe
Jun 12, 2026
Merged

[Model Runner v2] Migration from v1 to v2, with Qwen and DSv2 MOE models [3/N]#42667
njhill merged 9 commits into
mainfrom
wentao-mrv2-migration-moe

Conversation

@yewentao256

Copy link
Copy Markdown
Member

Purpose

Make progress for #41286

Test

Covered in CI

Signed-off-by: yewentao256 <zhyanwentao@126.com>

@claude claude Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Claude Code Review

This repository is configured for manual code reviews. Comment @claude review to trigger a review and subscribe this PR to future pushes, or @claude review once for a one-time review.

Tip: disable this comment in your organization's Code Review settings.

@yewentao256 yewentao256 added the ready ONLY add when PR is ready to merge/full CI is needed label May 14, 2026
@yewentao256 yewentao256 changed the title [Model Runner v2] Migration from v1 to v2, with Qwen and DSv2 MOE models [Model Runner v2] Migration from v1 to v2, with Qwen and DSv2 MOE models [3/N] May 14, 2026

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request enables the V2 model runner for specific Mixture-of-Experts (MoE) architectures by adding 'DeepseekV2ForCausalLM' and 'Qwen2MoeForCausalLM' to the allowed architectures list. The logic in _is_default_v2_model_runner_model was refactored to allow these MoE models while maintaining the exclusion of quantized models. The test suite was also updated with new cases for DeepSeek, Qwen MoE, and PowerMoE to verify the configuration changes. I have no feedback to provide as there were no review comments to evaluate.

@mergify mergify Bot added the qwen Related to Qwen models label May 14, 2026
@yewentao256 yewentao256 added the ready-run-all-tests Trigger CI with all tests for wide-ranging PRs label May 14, 2026
@njhill njhill added the v2 label May 20, 2026
@mergify

mergify Bot commented Jun 3, 2026

Copy link
Copy Markdown
Contributor

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @yewentao256.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

@mergify mergify Bot added the needs-rebase label Jun 3, 2026
Signed-off-by: yewentao256 <zhyanwentao@126.com>
@mergify mergify Bot removed the needs-rebase label Jun 3, 2026
Signed-off-by: yewentao256 <zhyanwentao@126.com>

@yewentao256 yewentao256 left a comment

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should be ready to merge now, all CI failures not related

@njhill njhill left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @yewentao256! Let's wait to see how the current CI run completes...

And we should make sure that this doesn't go into the 0.23.0 release.

@yewentao256 yewentao256 left a comment

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @njhill , Yes it won't, as we already cut the commit on Sunday night

@mergify

mergify Bot commented Jun 11, 2026

Copy link
Copy Markdown
Contributor

Hi @yewentao256, the pre-commit checks have failed. Please run:

uv pip install pre-commit>=4.5.1
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

@yewentao256 yewentao256 left a comment

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

buildkite/ci/pr/e2e-core-1-gpu

Has been fixed in main, no other issues found, remove the run all tests label

@yewentao256 yewentao256 removed the ready-run-all-tests Trigger CI with all tests for wide-ranging PRs label Jun 12, 2026
@njhill njhill enabled auto-merge (squash) June 12, 2026 20:41
@njhill njhill merged commit 78739c1 into main Jun 12, 2026
74 checks passed
@njhill njhill deleted the wentao-mrv2-migration-moe branch June 12, 2026 20:44
Saddss pushed a commit to Saddss/vllm that referenced this pull request Jun 14, 2026
…els [3/N] (vllm-project#42667)

Signed-off-by: yewentao256 <zhyanwentao@126.com>
Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed v2

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants