Skip to content

Supports of SFTTrainer / PPOTrainer / DPOTrainer for LLaVA-alike model #1784

@fangkuoyu

Description

@fangkuoyu

TRL SFTTrainer supports LLaVA (Large Language and Vision Assistant) as described in the following link Vision Language Models Explained

Is there any plan to release PPOTrainer and DPOTrainer for LLaVA? If not, could someone explain the concerns about implementing those trainers or suggest any alternatives? Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    👁️ VLMRelated to Visual Language Models

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions