[WIP] Add patience argument to run_language_modeling script by thesamuel · Pull Request #2840 · huggingface/transformers

thesamuel · 2020-02-13T10:40:47Z

Summary

Often, we want to stop training if loss does not improve for a number of epochs. This PR adds a "patience" argument, which is a limit on the number of times we can get a non-improving eval loss before stopping training early.

It is implemented by other NLP frameworks, such as AllenNLP (see trainer.py and metric_tracker.py).

Motivation

This feature allows faster fine-tuning by breaking the training loop early and avoids users the toil of checking metrics on Tensorboard.

Caveats

Often, models are evaluated once per epoch, but run_lm_finetuning.py has an option to evaluate after a set number of model update steps (dictated by --logging_steps if --evaluate_during_training is true). Because of this, I've elected to tie patience to the number of evaluations without improvement in loss.

To-do

Add tests
Fix long lines

LysandreJik

This is really cool!

julien-c

I feel like at this point, this should be inside a Trainer class similar to what @jplu is doing for TF (shared between scripts) (and similar to what @srush is doing w/ Lightning), but in the meantime, LGTM. Thanks!

thesamuel · 2020-02-21T05:58:21Z

Sounds great! I'll go ahead and fix the code quality check.

thesamuel · 2020-05-06T21:15:44Z

Since run_langauge_modeling.py now uses the Trainer class, I'll likely create a new PR that adds patience to Trainer.

LysandreJik approved these changes Feb 18, 2020

View reviewed changes

LysandreJik requested a review from julien-c February 18, 2020 21:13

julien-c approved these changes Feb 20, 2020

View reviewed changes

thesamuel force-pushed the samgehman/add-patience-to-run-language-modeling branch from 8cc8568 to a5989b0 Compare March 18, 2020 19:49

thesamuel closed this May 6, 2020

thesamuel force-pushed the samgehman/add-patience-to-run-language-modeling branch from b51db35 to 877fc56 Compare May 6, 2020 21:13

thesamuel deleted the samgehman/add-patience-to-run-language-modeling branch May 6, 2020 21:14

thesamuel mentioned this pull request May 6, 2020

Add patience argument to Trainer #4186

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add patience argument to run_language_modeling script#2840

[WIP] Add patience argument to run_language_modeling script#2840
thesamuel wants to merge 0 commit intohuggingface:masterfrom
thesamuel:samgehman/add-patience-to-run-language-modeling

thesamuel commented Feb 13, 2020 •

edited

Loading

Uh oh!

LysandreJik left a comment

Uh oh!

julien-c left a comment

Uh oh!

thesamuel commented Feb 21, 2020 •

edited

Loading

Uh oh!

thesamuel commented May 6, 2020 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

thesamuel commented Feb 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Caveats

To-do

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

julien-c left a comment

Choose a reason for hiding this comment

Uh oh!

thesamuel commented Feb 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thesamuel commented May 6, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

thesamuel commented Feb 13, 2020 •

edited

Loading

thesamuel commented Feb 21, 2020 •

edited

Loading

thesamuel commented May 6, 2020 •

edited

Loading