Adding Support for Qwen3.5 by bozheng-hit · Pull Request #43830 · huggingface/transformers

bozheng-hit · 2026-02-08T05:51:57Z

This PR adds the support of codes for the upcoming Qwen3.5 series models. For information about Qwen, please visit:
👉https://qwen.ai

Special thanks to @JJJYmmm for helping complete the code in this PR. We also appreciate the valuable feedback and thorough review provided by @vasqu and @ArthurZucker ! 🙏

1-bytes · 2026-02-09T06:24:30Z

尽快审批～期待

github-actions · 2026-02-09T10:58:44Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, qwen3_5, qwen3_5_moe

HuggingFaceDocBuilderDev · 2026-02-09T11:22:00Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

* rebase main * remove redundant init * fix * remove qwen3vlmoe mapping since resolved * add auto image processor * fix * fix * update qwen3_next_style rmsnorm * update text config check * simplify vision model * simplify vision config * inherit pretrainedmodel and qwen3next decoder layer forward * simplify config * fix apply_rotary_pos_emb import * move to latest main, update vision output and fix rope validation * fix text-only model loading * fix config * quick fixes * fix rope ignore keys * oops * add test suite * style * docs * ok * last consistency fix --------- Co-authored-by: bozheng-hit <dsoul0621@gmail.com> Co-authored-by: JJJYmmm <92386084+JJJYmmm@users.noreply.github.com> Co-authored-by: vasqu <antonprogamer@gmail.com>

MaoJianwei · 2026-02-12T02:32:50Z

wait for a long time

BestJuly · 2026-03-03T02:26:31Z

Hi @bozheng-hit , thanks for the PR. I have a simple question, that we noticed that the dtypes for A_log and out_norm in the HF released ckpt are different from Qwen3-Next. Should the storage of these two parameters also be in FP32? Or computation is necessary in FP32 but storage could remain the same as Qwen3-Next, in BF16.

Current implementation in MCore is using BF16 for storage and computation for these parts are in FP32. But if the storage dtypes are also required in FP32, there require make some changes (draft PR). Thank you.

bozheng-hit and others added 17 commits February 7, 2026 12:08

rebase main

ee87021

remove redundant init

6a3eefd

fix

23cd36a

remove qwen3vlmoe mapping since resolved

1f11b92

add auto image processor

e21067a

fix

d943dd3

fix

42bda8a

update qwen3_next_style rmsnorm

e445155

update text config check

a697579

simplify vision model

fdc63c2

simplify vision config

87fc89b

inherit pretrainedmodel and qwen3next decoder layer forward

645ba10

simplify config

32d8d68

fix apply_rotary_pos_emb import

b047ea1

move to latest main, update vision output and fix rope validation

a5ea9fc

fix text-only model loading

91ace4c

fix config

34f7964

bozheng-hit force-pushed the qwen3_5 branch from f543e94 to 34f7964 Compare February 8, 2026 06:28

YurkoHoshko mentioned this pull request Feb 8, 2026

Feature Request: Add support for Qwen3 Next ikawrakow/ik_llama.cpp#1229

Closed

4 tasks

johnmai-dev mentioned this pull request Feb 8, 2026

Adding Support for Qwen3.5 ml-explore/mlx-lm#861

Closed

1 task

This comment was marked as off-topic.

Sign in to view

MrHills-rs mentioned this pull request Feb 8, 2026

Feature Request: Support for Qwen 3.5 ikawrakow/ik_llama.cpp#1255

Closed

4 tasks

vasqu added 7 commits February 8, 2026 13:59

quick fixes

fd9d530

fix rope ignore keys

8b34f99

oops

ba0c456

add test suite

e36255c

style

5e293a9

docs

ac4ac39

ok

be63a26

JJJYmmm mentioned this pull request Feb 9, 2026

[MODEL] Adding Support for Qwen3.5 Models vllm-project/vllm#34110

Merged

jpg1024 mentioned this pull request Feb 9, 2026

Support Qwen3.5 modelscope/ms-swift#8014

Closed

1 task

ICENacl mentioned this pull request Feb 9, 2026

[Feature] Support Qwen3.5 sgl-project/sglang#18465

Open

2 tasks

mudler mentioned this pull request Feb 9, 2026

Adding Support for Qwen3.5 mudler/LocalAI#8469

Closed

vasqu added 2 commits February 9, 2026 11:54

Merge branch 'main' into qwen3_5

a6e6032

last consistency fix

f3bd443

vasqu approved these changes Feb 9, 2026

View reviewed changes

vasqu enabled auto-merge (squash) February 9, 2026 10:57

vasqu merged commit fc91372 into huggingface:main Feb 9, 2026
25 checks passed

JJJYmmm mentioned this pull request Feb 9, 2026

[MODEL] support qwen3.5 series Blaizzy/mlx-vlm#722

Merged

zju-stu-lizheng mentioned this pull request Feb 9, 2026

[MODEL] Adding Support for Qwen3.5 Models sgl-project/sglang#18489

Merged

johnmai-dev mentioned this pull request Feb 9, 2026

Adding Support for Qwen3.5 & Qwen3.5 MoE (Text-only) ml-explore/mlx-swift-lm#97

Merged

4 tasks

JJJYmmm mentioned this pull request Feb 9, 2026

[MODEL] support qwen3.5 series ggml-org/llama.cpp#19468

Merged

Jintao-Huang mentioned this pull request Feb 10, 2026

[model] support qwen3_5 / qwen3_5_moe modelscope/ms-swift#8016

Merged

JJJYmmm mentioned this pull request Feb 10, 2026

[MODEL] support qwen3.5 series w/o vision ml-explore/mlx-lm#869

Merged

bmarimuthu-nv mentioned this pull request Feb 11, 2026

[New Model]: Support Qwen3.5 model in AutoDeploy NVIDIA/TensorRT-LLM#11440

Closed

1 task

LysandreJik added the New model label Feb 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Support for Qwen3.5#43830

Adding Support for Qwen3.5#43830
vasqu merged 26 commits intohuggingface:mainfrom
bozheng-hit:qwen3_5

bozheng-hit commented Feb 8, 2026

Uh oh!

This comment was marked as off-topic.

1-bytes commented Feb 9, 2026

Uh oh!

github-actions bot commented Feb 9, 2026

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Feb 9, 2026

Uh oh!

MaoJianwei commented Feb 12, 2026

Uh oh!

BestJuly commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Conversation

bozheng-hit commented Feb 8, 2026

Uh oh!

This comment was marked as off-topic.

1-bytes commented Feb 9, 2026

Uh oh!

github-actions bot commented Feb 9, 2026

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Feb 9, 2026

Uh oh!

MaoJianwei commented Feb 12, 2026

Uh oh!

BestJuly commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants