🐯 [Liger] add native liger-kernel ORPO loss by kashif · Pull Request #2482 · huggingface/trl

kashif · 2024-12-15T12:55:12Z

What does this PR do?

Adds support for Liger ORPO loss kernel to the ORPO Trainer natively.

HuggingFaceDocBuilderDev · 2024-12-15T12:58:48Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2024-12-15T15:34:21Z

2 questions/remarks:

can you run benchmark so that we can (1) quantify the improvement and (2) check that results with and without liger are the same
we could have an additional tag for the hub when a model is trained with liger

qgallouedec · 2024-12-15T15:48:46Z

I think we should bump liger version to v0.5 (it doesn't include the loss before), see https://github.com/linkedin/Liger-Kernel/releases/tag/v0.5.0

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

…d an error

SumanthRH

Thanks for this PR! I stumbled upon this and wanted to highlight an important point (maybe this change is in progress already, in which case, great!)

SumanthRH · 2024-12-17T10:47:26Z

-            loss = loss_fct(logits, labels)
-            return loss
+        if self.args.use_liger_loss:
+            # skip the lm head and get the last hidden state


Nice!

I guess we don't have much of an option beyond using a config parameter for now.

Given that we run forward pass on a submodule, it would be very nice to have some validation so that there are no unexpected failures etc with different distributed training settings. But in this case, I feel there might be compatibility issues with FSDP given the limitation from the docs: https://pytorch.org/docs/stable/fsdp.html

"FSDP does not support running the forward pass of a submodule that is contained in an FSDP instance. This is because the submodule’s parameters will be sharded, but the submodule itself is not an FSDP instance, so its forward pass will not all-gather the full parameters appropriately."

(might be fixed by just making the base model attribute an FSDP instance as well, coz why not)

Beyond that this looks fine! I have a couple nits (don't matter that much):

Does model.get_decoder() work all the time btw for AutoModelForCausalLM instances? Was wondering if that's a cleaner solution for getting the base model attribute. But I think some base model classes have some further wrapping over the actual decoder (to format outputs, etc) https://github.com/huggingface/transformers/blob/a7f5479b45a8040392af80bf1107a2bdd796931c/src/transformers/models/opt/modeling_opt.py#L1044

Maybe the config is base_model_attribute_name since its the attribute name of the base model in the CausalLM object?

thanks @SumanthRH yes you are right get_decoder() will work

Yes next is to verify the distributed training cases

kashif · 2024-12-18T10:46:56Z

waiting on linkedin/Liger-Kernel#486

kashif · 2024-12-19T10:09:55Z

waiting on #2502

qgallouedec · 2024-12-19T10:33:44Z

@kashif can you share the curves once it's ready?

kashif · 2024-12-29T14:45:28Z

tests fail as they need: linkedin/Liger-Kernel#503

qgallouedec

lgtm!

shimizust · 2025-05-22T00:00:25Z

Hi @kashif @qgallouedec thanks for the effort here! I was wondering what is the status? Can this be merged? Looks like one test failed, but not sure if it was a real issue

qgallouedec · 2026-05-04T22:30:37Z

closing as stale

add native liger-kernl orpo loss

b480fff

kashif requested a review from qgallouedec December 15, 2024 12:55

qgallouedec reviewed Dec 15, 2024

View reviewed changes

Comment thread tests/test_orpo_trainer.py Outdated

qgallouedec reviewed Dec 15, 2024

View reviewed changes

Comment thread trl/trainer/orpo_trainer.py

kashif and others added 2 commits December 15, 2024 16:52

Update tests/test_orpo_trainer.py

44aa20c

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

passing self.args.use_liger_loss without liger installed should raise…

7682e31

…d an error

qgallouedec reviewed Dec 15, 2024

View reviewed changes

Comment thread trl/trainer/orpo_trainer.py Outdated

qgallouedec reviewed Dec 15, 2024

View reviewed changes

Comment thread trl/trainer/orpo_trainer.py

kashif added 2 commits December 15, 2024 17:46

update liger version

c383bf6

make import more readable

220f754

kashif changed the title ~~[Liger] add native liger-kernl orpo loss~~ [Liger] add native liger-kernel orpo loss Dec 15, 2024

SumanthRH reviewed Dec 16, 2024

View reviewed changes

Comment thread trl/trainer/orpo_trainer.py Outdated

skip the lm_head when use_liger_loss is true

b3f3270

SumanthRH reviewed Dec 17, 2024

View reviewed changes

use get_decoder()

afaf5a8

qgallouedec mentioned this pull request Dec 17, 2024

[Tracking issue] Integrate native liger-kernel losses #2495

Closed

7 tasks

make it a bit more robust

5776a4e

austin362667 reviewed Dec 17, 2024

View reviewed changes

Comment thread tests/test_orpo_trainer.py Outdated

Merge branch 'main' into liger-orpo

aa3c3b7

kashif added 5 commits December 19, 2024 11:35

Merge branch 'main' into liger-orpo

6f7918f

add back missing line

568e21a

pass is_enc_dec

f4979b0

call orpo_loss_fn with shifted inputs

5c6744f

Merge branch 'main' into liger-orpo

5fae1b2

kashif added 3 commits December 28, 2024 16:41

add back the orpo nll labels

e1918b7

call with nll_target

5ee37a6

Merge branch 'main' into liger-orpo

4861e8f

fix enc-dec

f6ffbf6

kashif commented Feb 13, 2025

View reviewed changes

Comment thread setup.py Outdated

kashif added 5 commits February 13, 2025 14:11

Update setup.py

ac2328b

Merge branch 'main' into liger-orpo

e3b4731

use fields

851ff26

undo change

9c317a5

undo change

f2ed765

kashif requested a review from qgallouedec February 20, 2025 12:23

qgallouedec approved these changes Feb 20, 2025

View reviewed changes

Merge branch 'main' into liger-orpo

f0eb1af

qgallouedec changed the title ~~[Liger] add native liger-kernel orpo loss~~ 🐯 [Liger] add native liger-kernel ORPO loss Aug 20, 2025

Merge branch 'main' into liger-orpo

83f7560

qgallouedec closed this May 4, 2026

albertvillanova added the 😴 stale No update from the author, will be closed soon label Jun 3, 2026

Conversation

kashif commented Dec 15, 2024

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Dec 15, 2024

Uh oh!

qgallouedec commented Dec 15, 2024

Uh oh!

Uh oh!

Uh oh!

qgallouedec commented Dec 15, 2024

Uh oh!

Uh oh!

Uh oh!

SumanthRH left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

SumanthRH Dec 17, 2024

Choose a reason for hiding this comment

Uh oh!

kashif Dec 17, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kashif commented Dec 18, 2024

Uh oh!

kashif commented Dec 19, 2024

Uh oh!

qgallouedec commented Dec 19, 2024

Uh oh!

kashif commented Dec 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

shimizust commented May 22, 2025

Uh oh!

qgallouedec commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

kashif commented Dec 29, 2024 •

edited

Loading