Tracking: Add `{% generation %}` chat templates for common model families

### Context

SFT with `assistant_only_loss=True` requires the chat template to include `{% generation %}` / `{% endgeneration %}` markers so that `return_assistant_tokens_mask=True` can produce correct masks. Very few models ship these markers natively.

TRL should provide training chat templates with these markers for known model families (via `get_training_chat_template()`), and the SFT trainer auto-applies them when `assistant_only_loss=True`.

This issue tracks adding training templates for all model families with chat template support.

### What needs to happen for each model

1. Create `<model>_training.jinja`: the original template with `{% generation %}` / `{% endgeneration %}` added around assistant output (and prefix-preservation fixes if needed)
2. Load it in `chat_template_utils.py` and add a branch in `get_training_chat_template()`
3. Add a test verifying text output is identical to the original and masks are correct

### Model families

#### Causal LMs with chat template

- [x] Qwen2.5 https://github.com/huggingface/trl/pull/5522
- [x] Qwen3 https://github.com/huggingface/trl/pull/5470
- [x] Qwen3-2507 (Instruct) https://github.com/huggingface/trl/pull/5574
- [x] Qwen3.5 https://github.com/huggingface/trl/pull/5824
- [x] Qwen3.6 https://github.com/huggingface/trl/pull/5642
- [x] Cohere https://github.com/huggingface/trl/pull/5627
- [x] Cohere2 https://github.com/huggingface/trl/pull/5675
- [x] DeepSeek-V3 https://github.com/huggingface/trl/pull/5527
- [x] Gemma https://github.com/huggingface/trl/pull/5523
- [x] Gemma2 https://github.com/huggingface/trl/pull/5523
- [x] GLM-4-MoE #5519
- [x] GPT-OSS #5484
- [x] Llama 3 #5493
- [x] Phi-3 https://github.com/huggingface/trl/pull/5526
- [x] Phi-3.5 https://github.com/huggingface/trl/pull/5746
- [ ] FalconMamba https://github.com/huggingface/trl/pull/5723

#### VLMs

- [x] Gemma3 https://github.com/huggingface/trl/pull/5685
- [x] Qwen3-VL https://github.com/huggingface/trl/pull/5764
- [ ] Qwen2-VL
- [ ] Qwen2.5-VL
- [ ] SmolVLM
- [ ] Idefics3
- [ ] LLaVA
- [ ] LLaVA-Next

#### No chat template (no action needed)

- Bloom, GPT2, GPTNeoX, OPT, T5

### Notes

- VLMs currently don't support `assistant_only_loss` in SFT (blocked by a separate check). These should still be tracked so templates are ready when support lands.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking: Add `{% generation %}` chat templates for common model families #5471

Context

What needs to happen for each model

Model families

Causal LMs with chat template

VLMs

No chat template (no action needed)

Notes

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Tracking: Add {% generation %} chat templates for common model families #5471

Description

Context

What needs to happen for each model

Model families

Causal LMs with chat template

VLMs

No chat template (no action needed)

Notes

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

Tracking: Add `{% generation %}` chat templates for common model families #5471