feat: add Phi-3 training chat template with generation markers by RudrenduPaul · Pull Request #5526 · huggingface/trl

RudrenduPaul · 2026-04-12T20:02:35Z

What does this PR do?

Adds a training-compatible chat template for Microsoft Phi-3 models (e.g. microsoft/Phi-3-mini-4k-instruct), following the same pattern introduced for LLaMA 3 in #5493, GPT-OSS, and Qwen2.5 in #5522.

Files added:

trl/chat_templates/phi3.jinja — exact copy of the official Phi-3 tokenizer template (sourced from microsoft/Phi-3-mini-4k-instruct)
trl/chat_templates/phi3_training.jinja — training variant with {% generation %} / {% endgeneration %} markers wrapping all assistant output for assistant-only loss masking in SFT

Changes to existing files:

trl/chat_template_utils.py: loads both templates, registers Phi-3 in get_training_chat_template(), updates docstring to mention Phi-3
tests/test_chat_template_utils.py: adds trl-internal-testing/tiny-Phi3ForCausalLM to TestGetTrainingChatTemplate

The training template handles the Phi-3 assistant message format (<|assistant|>\n{content}<|end|>\n), wrapping the content in {% generation %} / {% endgeneration %} markers so SFT trainers can apply loss only on assistant tokens.

Closes part of #5471 (tracking issue for training chat templates).

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you update the documentation with your changes?
Did you write any new necessary tests?

AI writing disclosure

I used AI assistance (Claude Code) to draft and implement this change.

Who can review?

@qgallouedec

Note

Low Risk
Low risk: adds a new model-specific training chat template and wires it into template selection logic, with minimal impact on existing models. Main risk is incorrect template matching/formatting for Phi-3 affecting SFT masking for that model only.

Overview
Adds Phi-3 support to get_training_chat_template() by loading new phi3.jinja and a phi3_training.jinja variant that wraps assistant output (and final EOS when appropriate) in {% generation %} markers for assistant-only loss masking.

Updates documentation to describe the new Phi-3 training template, and extends TestGetTrainingChatTemplate to include a Phi-3 tiny tokenizer in the existing behavior/masking test suite.

^{Reviewed by Cursor Bugbot for commit 91f5b79. Bugbot is set up for automated code reviews on this repo. Configure here.}

Add training chat template for Microsoft Phi-3 with {% generation %} markers for SFT assistant-only loss masking. Part of huggingface#5471. Built by Rudrendu Paul, developed with Claude Code

qgallouedec

thanks, can you add phi3 to chat_template/README as well

HuggingFaceDocBuilderDev · 2026-04-14T15:02:30Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@qgallouedec

Per @qgallouedec review: add phi3 entry to chat_templates/README.md. Fix prefix-preserving issue flagged by Cursor — remove else/eos_token branch that breaks prefix preservation (matching llama3/qwen2_5 pattern). Merge upstream/main to resolve conflict. Built by Rudrendu Paul, developed with Claude Code

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 18f5cfa. Configure here.}

RudrenduPaul · 2026-04-17T02:42:57Z

Thanks @qgallouedec for the approval! I'll add Phi-3 to chat_templates/README in the next push.

Re: the Cursor Bugbot finding — it's a valid call. The {% else %}{{ eos_token }}{% endif %} block in phi3_training.jinja does break the prefix-preserving property, and the other training templates (llama3_training, qwen2_5_training, qwen3_training, gptoss_training) all omit that else branch for exactly that reason. I'll align phi3_training.jinja with the rest of the training-template family by removing the else branch, so EOS is only emitted via the per-message <|end|> and <|endoftext|> markers already in the template. Will push the update along with the README change.

RudrenduPaul · 2026-04-19T04:31:44Z

Both follow-ups addressed:

Phi-3 added to chat_templates/README — the template entry is now in the README alongside Llama 3, Qwen2.5, and DeepSeek-V3.
{% else %}{{ eos_token }}{% endif %} branch removed — phi3_training.jinja now omits the else branch, matching the prefix-preserving pattern of llama3_training, qwen2_5_training, and qwen3_training.

Happy to adjust anything further.

feat: add Phi-3 training chat template with generation markers

1966924

Add training chat template for Microsoft Phi-3 with {% generation %} markers for SFT assistant-only loss masking. Part of huggingface#5471. Built by Rudrendu Paul, developed with Claude Code

cursor Bot reviewed Apr 12, 2026

View reviewed changes

Comment thread trl/chat_templates/phi3_training.jinja

qgallouedec approved these changes Apr 14, 2026

View reviewed changes

cursor Bot reviewed Apr 16, 2026

View reviewed changes

Comment thread trl/chat_templates/phi3_training.jinja

qgallouedec mentioned this pull request Apr 22, 2026

Tracking: Add {% generation %} chat templates for common model families #5471

Open

24 tasks

qgallouedec and others added 5 commits April 22, 2026 13:46

Merge branch 'main' into feat/phi-3-training-chat-template

88f696d

fix ref

ac8dca5

fix generation markers

9b609c5

fix tiny

353cc1b

Merge branch 'main' into feat/phi-3-training-chat-template

91f5b79

qgallouedec merged commit 644d173 into huggingface:main Apr 22, 2026
11 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Phi-3 training chat template with generation markers#5526

feat: add Phi-3 training chat template with generation markers#5526
qgallouedec merged 7 commits into
huggingface:mainfrom
RudrenduPaul:feat/phi-3-training-chat-template

RudrenduPaul commented Apr 12, 2026 •

edited by cursor Bot

Loading

Uh oh!

Uh oh!

qgallouedec left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 14, 2026

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

RudrenduPaul commented Apr 17, 2026

Uh oh!

RudrenduPaul commented Apr 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

RudrenduPaul commented Apr 12, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

AI writing disclosure

Who can review?

Uh oh!

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 14, 2026

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

RudrenduPaul commented Apr 17, 2026

Uh oh!

RudrenduPaul commented Apr 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

RudrenduPaul commented Apr 12, 2026 •

edited by cursor Bot

Loading