-
Notifications
You must be signed in to change notification settings - Fork 2.8k
Road to v1 #4374
Copy link
Copy link
Closed
24 / 2624 of 26 issues completedLabels
👶 good first issueGood for newcomersGood for newcomers🧒 good second issueGood for contributors with basic project familiarityGood for contributors with basic project familiarity
Metadata
Metadata
Assignees
Labels
👶 good first issueGood for newcomersGood for newcomers🧒 good second issueGood for contributors with basic project familiarityGood for contributors with basic project familiarity
Type
Fields
Give feedbackNo fields configured for issues without a type.
The purpose of this issue is to list the tasks that need to be completed before we reach v1. This list is evolving and is modified based on recent discussions and progress.
Documentation
how_to_train.mdRemove how_to_train.md: outdated training FAQ #4267using_llama_models.mdRemove using_llama_models.md: outdated Llama2-specific documentation #4268logging.mdRemove logging.md: trainer-specific metrics documentation #4269peft_integration.md#4376conceptual_guides/experimentalinto its own section inexperimental#4377Examples
Tests
test/slowto their respective trainer test files #4401Main codebase
trl.rewardsmodule Add accuracy reward #4270RichProgressCallbackin scripts (trl.scripts).log_example_reports.pyRemove unused log_example_reports.py script #4241commandsdirectory Remove unused commands directory #4258examples/research_projectsRemove unused commands directory #4258trl.extra.dataset_formattingDeprecate unused dataset_formatting module #4242 Remove deprecated functions and parameters #4651BestOfNSampler. DeprecateBestOfNSampler#4291 ⚰️ Remove deprecated #4301flash-attntokernels#4380masked_mean,masked_varandmasked_whitentoppo.py#4403Async generation for Online methodsBump transformers to v5we will keep v4 compatibilityMoving experimental features to experimental submodule
Discussed in #4223 for trainers
trl.experimental#4312trl.experimental#4400WinRateCallbackto experimental #4558