Skip to content

Does unsloth support/plan to support RLOOTrainer? #663

@asmith26

Description

@asmith26

Hi,

I recently came across this really interesting blog on Putting RL back in RLHF.

It looks like unsloth supports many of the huggingface Trainer APIs, just wondering if it supports/plans to support this new RLOOTrainer? (Possibly related: #392)

Many thanks for any help, and this amazing lib!!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions