Skip to content

🐯 Integrate Liger GRPO Loss to GRPO Trainer#3184

Merged
kashif merged 20 commits into
huggingface:mainfrom
shivam15s:shisahni/liger_grpo
Apr 3, 2025
Merged

🐯 Integrate Liger GRPO Loss to GRPO Trainer#3184
kashif merged 20 commits into
huggingface:mainfrom
shivam15s:shisahni/liger_grpo

Conversation

@shivam15s

@shivam15s shivam15s commented Mar 31, 2025

Copy link
Copy Markdown
Contributor

What does this PR do?

Integrates LigerFusedLinearGRPOLoss to GRPOTrainer

Currently works with DP and DDP. Will work on FSDP support in a subsequent PR.

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you read the contributor guideline,
    Pull Request section?
  • Was this discussed/approved via a GitHub issue? Please add a link
    to it if that's the case.
  • Did you make sure to update the documentation with your changes? Here are the
    documentation guidelines.
  • Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@kashif kashif self-assigned this Mar 31, 2025
Comment thread trl/trainer/grpo_config.py Outdated
Comment thread trl/trainer/grpo_trainer.py
Comment thread trl/trainer/grpo_trainer.py Outdated
@shivam15s shivam15s marked this pull request as ready for review April 2, 2025 22:11
@shivam15s shivam15s changed the title Integrate Liger GRPO Loss to GRPO Trainer 🐯 Integrate Liger GRPO Loss to GRPO Trainer Apr 2, 2025
@shivam15s

Copy link
Copy Markdown
Contributor Author

liger-kernel v0.5.6 is out which has the changes needed for grpo, so we can officially test this integration!

@HuggingFaceDocBuilderDev

Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@qgallouedec qgallouedec left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice!!

Comment thread tests/slow/test_grpo_slow.py Outdated
@kashif kashif merged commit 793735a into huggingface:main Apr 3, 2025
yxliu-TAMU pushed a commit to mincheolseong/ECEN743-GRPO-Project-Proposal that referenced this pull request Apr 20, 2025
Co-authored-by: Ubuntu <azureuser@liger-ci-h100-vm.kvghai4yzzmufguwws3040dwlf.dx.internal.cloudapp.net>
Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>
Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants