feat: add Qwen2.5 training chat template with generation markers by RudrenduPaul · Pull Request #5522 · huggingface/trl

RudrenduPaul · 2026-04-11T06:23:24Z

What does this PR do?

Adds a training-compatible chat template for Qwen2.5 models (e.g. Qwen/Qwen2.5-0.5B-Instruct), following the same pattern introduced for LLaMA 3 in #5493 and GPT-OSS.

Files added:

trl/chat_templates/qwen2_5.jinja — exact copy of the official Qwen2.5 ChatML tokenizer template (sourced from Qwen/Qwen2.5-0.5B-Instruct)
trl/chat_templates/qwen2_5_training.jinja — training variant with {% generation %} / {% endgeneration %} markers wrapping all assistant output (both plain and tool-call branches) for assistant-only loss masking in SFT

Changes to existing files:

trl/chat_template_utils.py: loads both templates, registers Qwen2.5 in get_training_chat_template(), updates docstring
tests/test_chat_template_utils.py: adds trl-internal-testing/tiny-Qwen2ForCausalLM-2.5 to TestGetTrainingChatTemplate

The training template handles all three assistant message cases: (1) plain text response, (2) tool calls with content, (3) tool calls without content — all wrapped in the generation markers.

Closes part of #5471 (tracking issue for training chat templates).

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you update the documentation with your changes?
Did you write any new necessary tests?

AI writing disclosure

I used AI assistance (Claude Code) to draft and implement this change.

Who can review?

@qgallouedec

Note

Low Risk
Low risk: additive template support and a small selector change, covered by existing chat-template compatibility tests. Main risk is mismatching the upstream Qwen2.5 template, which could affect prompt rendering for those models.

Overview
Adds Qwen2.5 support to get_training_chat_template() by loading a new qwen2_5.jinja base template and returning a new qwen2_5_training.jinja variant that wraps all assistant output (including tool-call branches) in {% generation %} / {% endgeneration %} for assistant-only loss masking.

Extends TestGetTrainingChatTemplate to include a Qwen2.5 tiny tokenizer, ensuring the patched template remains prefix-preserving and behavior-compatible with the original rendering.

^{Reviewed by Cursor Bugbot for commit bd28f10. Bugbot is set up for automated code reviews on this repo. Configure here.}

Adds qwen2_5.jinja (official Qwen2.5 ChatML template) and qwen2_5_training.jinja (training variant with {% generation %} / {% endgeneration %} markers for assistant-only loss masking in SFT). Registers both in chat_template_utils.py and adds Qwen2.5 to the TestGetTrainingChatTemplate test suite. Built by Rudrendu Paul, developed with Claude Code

qgallouedec · 2026-04-12T01:25:15Z

@codex review

HuggingFaceDocBuilderDev · 2026-04-12T01:27:29Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

chatgpt-codex-connector · 2026-04-12T01:29:09Z

Codex Review: Didn't find any major issues. 🚀

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 814cb0d. Configure here.}

qgallouedec

thanks!

RudrenduPaul and others added 2 commits April 10, 2026 23:23

Merge branch 'main' into feat/qwen2-5-training-chat-template

8392789

qgallouedec reviewed Apr 12, 2026

View reviewed changes

Comment thread trl/chat_template_utils.py Outdated

Apply suggestions from code review

a363314

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

qgallouedec reviewed Apr 12, 2026

View reviewed changes

Comment thread trl/chat_template_utils.py

Apply suggestions from code review

814cb0d

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

cursor Bot reviewed Apr 12, 2026

View reviewed changes

Comment thread trl/chat_template_utils.py

fix

bd28f10

qgallouedec approved these changes Apr 12, 2026

View reviewed changes

qgallouedec merged commit d6d5efc into huggingface:main Apr 12, 2026
1 check passed

This was referenced Apr 12, 2026

feat: add Phi-3 training chat template with generation markers #5526

Merged

feat: add DeepSeek-V3 training chat template with generation markers #5527

Merged

qgallouedec mentioned this pull request Apr 22, 2026

Tracking: Add {% generation %} chat templates for common model families #5471

Open

24 tasks

aazizyan mentioned this pull request May 7, 2026

Add Qwen2.5 response schema #5728

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Qwen2.5 training chat template with generation markers#5522

feat: add Qwen2.5 training chat template with generation markers#5522
qgallouedec merged 5 commits into
huggingface:mainfrom
RudrenduPaul:feat/qwen2-5-training-chat-template

RudrenduPaul commented Apr 11, 2026 •

edited by cursor Bot

Loading

Uh oh!

qgallouedec commented Apr 12, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 12, 2026

Uh oh!

chatgpt-codex-connector Bot commented Apr 12, 2026

Uh oh!

Uh oh!

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

qgallouedec left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

RudrenduPaul commented Apr 11, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

AI writing disclosure

Who can review?

Uh oh!

qgallouedec commented Apr 12, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 12, 2026

Uh oh!

chatgpt-codex-connector Bot commented Apr 12, 2026

Uh oh!

Uh oh!

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

RudrenduPaul commented Apr 11, 2026 •

edited by cursor Bot

Loading