fix chosen_nll_loss in chunked losses by kashif · Pull Request #486 · linkedin/Liger-Kernel

kashif · 2024-12-17T17:35:13Z

Summary

Fix the nll loss in the the chunked loses when the model is a decoder only model, by shifting the logits and targets

Hardware Type:
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

shivam15s

Great catch, and thanks for the quick PR! With the TRL fixes we discussed today, we should be able to closely match the loss curve obtained from TRL Trainer

shivam15s · 2024-12-17T23:12:53Z

            compute_nll_loss=compute_nll_loss,
+            is_encoder_decoder=is_encoder_decoder,
        )
        chosen_nll_loss = (


I believe we also have to fix how we do normalization. My hunch is that's the reason for failing tests

shivam15s · 2024-12-17T23:14:44Z

+            if not is_encoder_decoder:
+                shifted_logits = log_probs_chunk[:len_chosen_chunk, :-1].contiguous()
+                shifted_target = target_chunk[:len_chosen_chunk, 1:].contiguous()
+            else:
+                shifted_logits = log_probs_chunk[:len_chosen_chunk].contiguous()
+                shifted_target = target_chunk[:len_chosen_chunk].contiguous()
+


The shifted logits/target should also be used to calculate chosen/rejected logps and in general for everything that follows to compute the loss.

This reverts commit 61eefe9.

This reverts commit 61eefe9. ## Summary   ## Testing Done   - Hardware Type: <BLANK> - [ ] run `make test` to ensure correctness - [ ] run `make checkstyle` to ensure code style - [ ] run `make test-convergence` to ensure convergence

kashif added 4 commits December 17, 2024 18:34

fix chosen_nll_loss in chunked loses

d8e457b

remove unused

8273696

fix get_batch_logps

5d60f34

formatting

f892e35

kashif mentioned this pull request Dec 18, 2024

🐯 [Liger] add native liger-kernel ORPO loss huggingface/trl#2482

Closed

kashif added 4 commits December 18, 2024 13:29

simplify chunk_forward

de267ab

add back float() and normalize correctly

5e1a092

fix tests

a688826

assume ref model is the same family as model

a7ce258

kashif changed the title ~~fix chosen_nll_loss in chunked loses~~ fix chosen_nll_loss in chunked losses Dec 18, 2024

undo change

09a3918

shivam15s approved these changes Dec 18, 2024

View reviewed changes

shivam15s merged commit 61eefe9 into linkedin:main Dec 18, 2024

shivam15s added a commit that referenced this pull request Dec 19, 2024

Revert "fix chosen_nll_loss in chunked losses (#486)"

79b64ff

This reverts commit 61eefe9.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix chosen_nll_loss in chunked losses#486

fix chosen_nll_loss in chunked losses#486
shivam15s merged 9 commits into
linkedin:mainfrom
kashif:fix-orpo-nll

kashif commented Dec 17, 2024 •

edited

Loading

Uh oh!

shivam15s left a comment

Uh oh!

shivam15s Dec 17, 2024

Uh oh!

shivam15s Dec 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kashif commented Dec 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

shivam15s left a comment

Choose a reason for hiding this comment

Uh oh!

shivam15s Dec 17, 2024

Choose a reason for hiding this comment

Uh oh!

shivam15s Dec 17, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kashif commented Dec 17, 2024 •

edited

Loading