FEA Callbacks base infrastructure + progress bars by jeremiedbb · Pull Request #27663 · scikit-learn/scikit-learn

jeremiedbb · 2023-10-25T16:49:56Z

Extracted from #22000

This PR implements a smaller portion of #22000, with only the base infrastructure for callbacks and the implementation of a single callback (progress bars). It targets the callbacks branch and the goal is to have the full callback implementation done in several smaller PRs merged in this branch before we merge it in main.

This PR proposes significant changes compared to #22000 that should imo improve a lot its change of getting merged 😄
The main improvement is that it no longer requires writing on the disk, but instead relies on multiprocessing.Managers and queues. It simplifies the code a lot.

In #22000 I adapted some estimators to work with the callbacks which I did not include here to keep the PR as light as possible. You can however experiment the callbacks on estimators that I wrote for testing purpose:

from sklearn.callback import ProgressBar
from sklearn.callback.tests._utils import Estimator, MetaEstimator
est = Estimator()
meta_est = MetaEstimator(est, n_jobs=2)
meta_est._set_callbacks(ProgressBar())
meta_est.fit(None, None)

You add sleep calls in these testing estimators to simulate how it changes when the computations take longer.

The plan is to then have several PRs to implement the other callbacks, adapt a few estimators to work with the callbacks, add some documentation and examples, add more tests...

adrinjalali

A very shallow review.

adrinjalali · 2024-01-31T11:44:44Z

sklearn/base.py

    params_set = new_object.get_params(deep=False)

+    # attach callbacks to the new estimator
+    if hasattr(estimator, "_callbacks"):


This can also break quite easily if the callback object keeps references to attributes of the old estimator. Why aren't we creating a copy here?

sklearn/base.py

adrinjalali · 2024-01-31T16:58:14Z

sklearn/base.py

+                try:
+                    return fit_method(estimator, *args, **kwargs)
+                finally:
+                    estimator._eval_callbacks_on_fit_end()


this is becoming larger than just a validation wrapper. We can simplify debugging and magic by having a BaseEstimator.fit which calls self._fit(...) and does all the common stuff before and after. That seems a lot better to understand and debug.

The motivation when I introduced _fit_context was not only for validation but to have a generic context manager to handle everything we need to do before and after fit. That's why I gave it this generic name.

Although having a consistent framework where we'd have a BaseEstimator.fit and every child estimator implements _fit is appealing, I think it goes far beyond the scope of this PR and requires rewriting a lot of estimators.

Btw do you why BaseEstimator does not implement fit in the first place ?

Also, note that _fit_context also handles partial_fit, but I don't think we want BaseEstimator to implement partial_fit

BaseEstimator doesn't implement fit cause we don't generally have methods which raise NotImplementedError. They're simply not there. But now that we have all this work, we can certainly have it in BaseEstimator, and children only implement a __sklearn_fit__ kind of method instead.

I still think it's outside the scope of this PR. Using the existing context manager is just 1 line addition whereas implementing __sklearn_fit__ means countless PRs :)

sklearn/utils/__init__.py

rth · 2024-03-05T18:48:41Z

+1 to merge this. I personally find that it would be more valuable to mark it as experimental (it's private anyway so far) let the users use this for a version or two, aggregate their feedback and iterate in future details if needed.

Overall the API sounds reasonable to me. Rather than to approve a SLEP on this, only then to realize that users would have preferred something else, or that there are some weird edge cases for some estimator that need special handling.

jeremiedbb · 2024-03-05T22:20:20Z

Note that this PRis targeting the callbacks branch, not main so merging this is not a big commitment anyway 😄
And it would allow me to implement the rest that I did not include in this PR to keep it as small as possible.

I personally find that it would be more valuable to mark it as experimental

I agree and we already discussed that with @glemaitre. I plan to do that in a follow up PR.

jeremiedbb · 2024-03-05T22:20:41Z

I still need to figure out the issue with the CI though
EDIT: good now

Scot-Survivor · 2024-06-28T15:52:44Z

Any news on this being merged?

glemaitre · 2024-06-28T15:54:23Z

Any news on this being merged?

We currently working on the design and need an agreement among core developers.

ignaceHelsen · 2024-10-31T12:48:04Z

Any news on this? It would be amazing would this be added.

GaelVaroquaux · 2025-12-11T20:29:50Z

I think that the relevant PR is now #28760

ogrisel · 2026-01-21T14:20:07Z

@jeremiedbb @FrancoisPgm shall we close this?

jeremiedbb · 2026-01-21T15:00:07Z

I was just keeping it open to compare 2 alternatives but I guess this one is completely outdated now. Let's close

jeremiedbb added 30 commits December 16, 2021 20:08

callback API

272e75f

cln nmf and test reconstruction attributes

584bdf7

cln snapshot + test snapshot + uuid for computation tree

bb32ff3

cln

7a1825d

black

3e3b25f

lint

26dbb69

wip

eb7b824

Merge branch 'master' into callback-api

9b913fd

class

f78442e

more tests

34bab15

cln

596a58e

wip

4f9363c

Merge remote-tracking branch 'upstream/main' into callback-api

030f68b

wip

35c5284

wip

115e184

wip

bdb4990

Merge remote-tracking branch 'upstream/main' into callback-api

d1bb5eb

wip

7a43c30

Merge remote-tracking branch 'upstream/main' into callback-api

573fd5d

wip

a218068

update poor_score

f794694

Merge remote-tracking branch 'upstream/main' into pr/jeremiedbb/22000

ab74f19

wip

37e569b

wip

d7208fa

Merge remote-tracking branch 'upstream/main' into pr/jeremiedbb/22000

774ff69

cln

b8ac1a5

Merge remote-tracking branch 'upstream/main' into pr/jeremiedbb/22000

e544cc4

wip

b644430

wip

3ab3d7f

wip

39c04cc

adrinjalali reviewed Jan 31, 2024

View reviewed changes

jeremiedbb added 11 commits February 9, 2024 16:32

Merge branch 'callbacks' into base

e13516d

mixin for callback propagation

a0667c4

rename _skl_callbacks

2fdbda3

clone callbacks

aea9af7

some renaming and cleanup

44b615a

Merge branch 'callbacks' into base

fabe932

Merge branch 'callbacks' into base (continued)

07a6875

Merge branch 'callbacks' into base

02ecb2e

fix imports

6433ba3

Merge remote-tracking branch 'upstream/callbacks' into base

052f9d2

update lock files

268d5cf

jeremiedbb mentioned this pull request Mar 1, 2024

SLEP023: Callback API scikit-learn/enhancement_proposals#90

Merged

jeremiedbb added 5 commits March 6, 2024 13:41

Merge remote-tracking branch 'upstream/callbacks' into base

2381645

debug ci

d392b63

iter

9177757

iter

436bcad

iter

5bf6608

jeremiedbb mentioned this pull request Apr 3, 2024

FEA Callbacks base infrastructure + progress bars #28760

Merged

jeremiedbb closed this Jan 21, 2026

github-project-automation bot moved this from Being dropped to Done in Callbacks Jan 21, 2026

Uh oh!

Conversation

jeremiedbb commented Oct 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

adrinjalali Jan 31, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

adrinjalali Jan 31, 2024

Choose a reason for hiding this comment

Uh oh!

jeremiedbb Feb 9, 2024

Choose a reason for hiding this comment

Uh oh!

jeremiedbb Feb 9, 2024

Choose a reason for hiding this comment

Uh oh!

adrinjalali Feb 13, 2024

Choose a reason for hiding this comment

Uh oh!

jeremiedbb Feb 19, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rth commented Mar 5, 2024

Uh oh!

jeremiedbb commented Mar 5, 2024

Uh oh!

jeremiedbb commented Mar 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Scot-Survivor commented Jun 28, 2024

Uh oh!

glemaitre commented Jun 28, 2024

Uh oh!

ignaceHelsen commented Oct 31, 2024

Uh oh!

GaelVaroquaux commented Dec 11, 2025

Uh oh!

ogrisel commented Jan 21, 2026

Uh oh!

jeremiedbb commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

jeremiedbb commented Oct 25, 2023 •

edited

Loading

jeremiedbb commented Mar 5, 2024 •

edited

Loading