Skip to content

Rewrite peft_integration.md #4376

@qgallouedec

Description

@qgallouedec

This section of the documentation is widely outdated and rely only on PPO.

Ideally, we should have a clear documentation that shows how to use peft with SFT, DPO and GRPO at least, via the peft_config argument. We could have additional subsection about QLoRA and prompt-tuning.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions