Qwen3.6 integration by qgallouedec · Pull Request #5642 · huggingface/trl

qgallouedec · 2026-04-25T20:50:49Z

Qwen3.6 (Qwen/Qwen3.6-27B, Qwen/Qwen3.6-35B-A3B) reuses the Qwen3_5Moe* architecture but ships a slightly different chat template (adds preserve_thinking flag, tweaks tool-arg stringification). Exact-string template matching in chat_template_utils.py therefore fails for Qwen3.6 tokenizers.

Changes

Chat templates: add qwen3_6.jinja (verbatim from upstream) and qwen3_6_training.jinja (prefix-preserving + {% generation %} markers).
chat_template_utils.py: register both templates; route Qwen3.6 to the existing qwen3_5_schema in add_response_schema (output format is unchanged); route to qwen3_6_training_chat_template in get_training_chat_template.
scripts/generate_tiny_models.py: add Qwen/Qwen3.6-35B-A3B to the VLM loop (pushed as tiny-Qwen3_5MoeForConditionalGeneration-3.6 to leave room for future Qwen3.5-MoE variants); reuse the Qwen3.5 dense gotchas (force one full-attention layer, fp32-restore for linear-attn weights) and add MoE-specific shrinks.
Tests: parametrize the new tiny model in test_chat_template_utils, test_data_utils.TestApplyChatTemplate, and the SFT/DPO/GRPO/RLOO test_(train|training)_vlm cases.

Note

Medium Risk
Medium risk because it extends chat-template matching and training template swapping for a new model family, which can affect tool-calling formatting and assistant-only loss masking across Qwen variants if the template detection/mapping is wrong.

Overview
Adds Qwen3.6 support by bundling upstream qwen3_6.jinja plus a new training-patched qwen3_6_training.jinja (prefix-preserving tool-call rendering and {% generation %} markers for assistant-only loss).

Updates chat_template_utils.py to recognize Qwen3.6 templates for add_response_schema (reusing the existing Qwen3.5 response schema) and to return the new Qwen3.6 training template from get_training_chat_template.

Extends the tiny-model generator and test matrix to include a Qwen3.6 VLM tiny model (tiny-Qwen3_5MoeForConditionalGeneration-3.6), with MoE-specific config downsizing and docs updated to list Qwen3.6 as supported/tested.

^{Reviewed by Cursor Bugbot for commit 9a66674. Bugbot is set up for automated code reviews on this repo. Configure here.}

HuggingFaceDocBuilderDev · 2026-04-25T20:53:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

AmineDiro

super cool to suport it this quick !

Qwen3.6

e85e9d4

qgallouedec added 2 commits April 25, 2026 21:00

style + doc

99fe25c

fix cast

9a66674

AmineDiro approved these changes Apr 25, 2026

View reviewed changes

qgallouedec merged commit 2f10689 into main Apr 26, 2026
11 of 13 checks passed

qgallouedec deleted the qwen3.6 branch April 26, 2026 15:16

qgallouedec added a commit that referenced this pull request Apr 27, 2026

Qwen3.6 integration (#5642)

39bafd4

qgallouedec mentioned this pull request May 25, 2026

Tracking: Add {% generation %} chat templates for common model families #5471

Open

24 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen3.6 integration#5642

Qwen3.6 integration#5642
qgallouedec merged 3 commits into
mainfrom
qwen3.6

qgallouedec commented Apr 25, 2026 •

edited by cursor Bot

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 25, 2026

Uh oh!

AmineDiro left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

qgallouedec commented Apr 25, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

HuggingFaceDocBuilderDev commented Apr 25, 2026

Uh oh!

AmineDiro left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qgallouedec commented Apr 25, 2026 •

edited by cursor Bot

Loading