Fix for multiple callbacks by SeanNaren · Pull Request #6197 · Lightning-AI/pytorch-lightning

SeanNaren · 2021-02-25T11:48:15Z

What does this PR do?

We recently modified the behaviour of the early stopping callback in the accelerator refactor, this led to the bug mentioned above. This was due to defaulting to False, when other callbacks could've updated this value to True.

Before submitting

Was this discussed/approved via a GitHub issue? (not for typos and docs)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you update the CHANGELOG? (not for typos, docs, test updates, or internal minor changes/refactorings)

PR review

Anyone in the community is free to review the PR once the tests have passed.
Before you start reviewing make sure you have read Review guidelines. In short, see the following bullet-list:

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

Did you have fun?

Make sure you had fun coding 🙃

codecov · 2021-02-25T11:50:20Z

Codecov Report

Merging #6197 (b8e063b) into master (3ed8ef8) will decrease coverage by 0%.
The diff coverage is 100%.

@@          Coverage Diff           @@
##           master   #6197   +/-   ##
======================================
- Coverage      93%     93%   -0%     
======================================
  Files         159     159           
  Lines       11378   11375    -3     
======================================
- Hits        10623   10591   -32     
- Misses        755     784   +29

tests/callbacks/test_early_stopping.py

tchaton

Great fix !

pytorch_lightning/callbacks/early_stopping.py

…eparate

pytorch_lightning/callbacks/early_stopping.py

Borda · 2021-02-25T14:02:58Z

tests/callbacks/test_early_stopping.py

+    def on_train_end(self) -> None:
+        assert self.trainer.current_epoch == self.expected_end_epoch, 'Early Stopping Failed'


I would drop this and rather check in the test trainer epoch is as expected, so there is not random inference

That's what I originally did but because of how DDP Spawn works, the local trainer's current epoch doesn't seem to be kept in sync which is fair (since it's only kept in sync during trainer on the processes). This is why I had to move it to on_train_end because this happens within the spawn process!

so can we have both?

We could but I'd need to separate out the tests, I don't think its really worth it because it would be a lot of duplication

* Fix for multiple callbacks * Add CHANGELOG.md * Remove old params * Skip tests on windows using ddp * Change name of the variable to not clash with should stop, which is separate * Apply suggestions from code review * Fix params Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>

Fix for multiple callbacks

c71c0ee

SeanNaren added bug Something isn't working priority: 0 High priority task callback labels Feb 25, 2021

SeanNaren added this to the 1.2.x milestone Feb 25, 2021

SeanNaren requested review from Borda and awaelchli as code owners February 25, 2021 11:48

SeanNaren self-assigned this Feb 25, 2021

SeanNaren requested review from carmocca, justusschock, tchaton and williamFalcon as code owners February 25, 2021 11:48

Add CHANGELOG.md

f05a209

SeanNaren mentioned this pull request Feb 25, 2021

Latest Lightning does not support multiple callbacks that stop #6194

Closed

kaushikb11 reviewed Feb 25, 2021

View reviewed changes

tests/callbacks/test_early_stopping.py Outdated Show resolved Hide resolved

Remove old params

b47e411

tchaton approved these changes Feb 25, 2021

View reviewed changes

Borda requested changes Feb 25, 2021

View reviewed changes

pytorch_lightning/callbacks/early_stopping.py Outdated Show resolved Hide resolved

SeanNaren added 2 commits February 25, 2021 13:52

Skip tests on windows using ddp

dffe0ea

Change name of the variable to not clash with should stop, which is s…

06d345c

…eparate

Borda approved these changes Feb 25, 2021

View reviewed changes

pytorch_lightning/callbacks/early_stopping.py Outdated Show resolved Hide resolved

pytorch_lightning/callbacks/early_stopping.py Outdated Show resolved Hide resolved

pytorch_lightning/callbacks/early_stopping.py Outdated Show resolved Hide resolved

Apply suggestions from code review

86d916c

Borda reviewed Feb 25, 2021

View reviewed changes

Fix params

b783d78

kaushikb11 approved these changes Feb 25, 2021

View reviewed changes

SeanNaren merged commit dd2f5a0 into master Feb 25, 2021

SeanNaren deleted the fix/multi_early_stopping branch February 25, 2021 15:44

carmocca mentioned this pull request Feb 25, 2021

1.2.x cherries 🍒 #6083

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for multiple callbacks#6197

Fix for multiple callbacks#6197
SeanNaren merged 7 commits intomasterfrom
fix/multi_early_stopping

SeanNaren commented Feb 25, 2021 •

edited

Loading

Uh oh!

codecov bot commented Feb 25, 2021 •

edited

Loading

Uh oh!

Uh oh!

tchaton left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Borda Feb 25, 2021

Uh oh!

SeanNaren Feb 25, 2021

Uh oh!

Borda Feb 25, 2021

Uh oh!

SeanNaren Feb 25, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		def on_train_end(self) -> None:
		assert self.trainer.current_epoch == self.expected_end_epoch, 'Early Stopping Failed'

Conversation

SeanNaren commented Feb 25, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

PR review

Did you have fun?

Uh oh!

codecov bot commented Feb 25, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

tchaton left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Borda Feb 25, 2021

Choose a reason for hiding this comment

Uh oh!

SeanNaren Feb 25, 2021

Choose a reason for hiding this comment

Uh oh!

Borda Feb 25, 2021

Choose a reason for hiding this comment

Uh oh!

SeanNaren Feb 25, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

SeanNaren commented Feb 25, 2021 •

edited

Loading

codecov bot commented Feb 25, 2021 •

edited

Loading