Refactor CLI [14/N] : Remove TrainingArguments import from core trainers by albertvillanova · Pull Request #5257 · huggingface/trl

albertvillanova · 2026-03-10T06:49:36Z

Remove TrainingArguments import from core trainers.

This PR is part of a refactoring to reduce the CLI latency.

This PR refactors multiple trainer configuration files to remove direct dependencies on transformers.TrainingArguments and instead rely on the project's own _BaseConfig for managing valid dictionary fields. This change improves modularity and reduces external coupling across the configuration classes.

See upstream issue:

Lazy loading is not working properly transformers#44273

Dependency refactoring:

Removed import of TrainingArguments from transformers in all trainer config files (dpo_config.py, grpo_config.py, reward_config.py, rloo_config.py, sft_config.py) to eliminate unnecessary dependency.
Changed _VALID_DICT_FIELDS initialization in all trainer config classes to use _BaseConfig._VALID_DICT_FIELDS instead of TrainingArguments._VALID_DICT_FIELDS, ensuring consistency and independence from external library internals.

Note

Low Risk
Low risk refactor that only changes where _VALID_DICT_FIELDS is sourced for several config dataclasses; behavior should remain equivalent because _BaseConfig inherits TrainingArguments, but any divergence in _BaseConfig._VALID_DICT_FIELDS could affect CLI argument parsing/serialization.

Overview
Reduces direct transformers coupling in trainer config modules by removing TrainingArguments imports from DPOConfig, GRPOConfig, RewardConfig, RLOOConfig, and SFTConfig.

Each config now builds _VALID_DICT_FIELDS from _BaseConfig._VALID_DICT_FIELDS (plus model_init_kwargs) instead of TrainingArguments._VALID_DICT_FIELDS, centralizing config/CLI field validation behind TRL’s base config.

^{Written by Cursor Bugbot for commit 7b59a70. This will update automatically on new commits. Configure here.}

HuggingFaceDocBuilderDev · 2026-03-10T06:52:31Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec

yes

…ers (huggingface#5257)

Move TrainingArguments import from base trainers

7b59a70

albertvillanova changed the title ~~Refactor CLI [14/N] : Move TrainingArguments import from base trainers~~ Refactor CLI [14/N] : Remove TrainingArguments import from base trainers Mar 10, 2026

qgallouedec approved these changes Mar 10, 2026

View reviewed changes

albertvillanova changed the title ~~Refactor CLI [14/N] : Remove TrainingArguments import from base trainers~~ Refactor CLI [14/N] : Remove TrainingArguments import from core trainers Mar 10, 2026

albertvillanova merged commit 41602bc into huggingface:main Mar 10, 2026
12 checks passed

albertvillanova mentioned this pull request Mar 16, 2026

Remove TrainingArguments import from experimental trainers #5290

Merged

songhappy pushed a commit to songhappy/trl that referenced this pull request Apr 20, 2026

Refactor CLI [14/N] : Remove TrainingArguments import from core train…

53115f4

…ers (huggingface#5257)

albertvillanova mentioned this pull request Apr 28, 2026

qa: more lazy loading huggingface/transformers#45599

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor CLI [14/N] : Remove TrainingArguments import from core trainers#5257

Refactor CLI [14/N] : Remove TrainingArguments import from core trainers#5257
albertvillanova merged 1 commit into
huggingface:mainfrom
albertvillanova:refactor-cli-14

albertvillanova commented Mar 10, 2026 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Mar 10, 2026

Uh oh!

qgallouedec left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

albertvillanova commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Mar 10, 2026

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

albertvillanova commented Mar 10, 2026 •

edited

Loading