FIX Forward sample weight to the scorer in grid search by antoinebaker · Pull Request #30743 · scikit-learn/scikit-learn

antoinebaker · 2025-01-31T10:58:15Z

Reference Issues/PRs

Part of meta-issue #16298.

What does this implement/fix? Explain your changes.

*SearchCV metaestimators currently do not forward sample_weight to the scorer, as a result they can fail the sample_weight equivalence check even if the underlying subestimator and scorer handle sample_weight correctly.
This PR forwards sample_weight to the scorer when fitting with sample_weight, and adds a more stringent sample_weight equivalence test by checking all scores stored in cv_results_.

…weight

github-actions · 2025-01-31T10:59:30Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: c2a8290. Link to the linter CI: here}

OmarManzoor

LGTM. Thanks @antoinebaker

sklearn/model_selection/tests/test_search.py

Co-authored-by: Omar Salman <omar.salman2007@gmail.com>

ogrisel

Thanks for the PR. Besides the decision not to route to scorer that do not accept weights, this looks good to me, and it's nice to see that it fixes the failure of the RidgeCV common test as expected.

Note that I think that it's important to fix this when metadata routing is disabled to make it easier to test that we get the same behavior when routing is enabled or disabled once the default routing policy for weights is implemented.

sklearn/model_selection/_search.py

ogrisel · 2025-02-21T14:11:51Z

Note that check_sample_weight_equivalence_* are not run on GridSearchCV and the like, since has_fit_parameter(estimator, "sample_weight") returns False for those.

It's a bit unfortunate because this is a false negative, but I don't see an easy way around this. One option would be to introduce a dedicated accept_sample_weight estimator tag, but it would need to be set dynamically for meta-estimators.

Maybe we can re-explore this once we have made progress on implementing a default routing policy when metadata routing is enabled, and rely on routing inspection instead.

adrinjalali

This LGTM. Note that ideally you'd want to send the sample_weight to all scorers in a multimetric scorer which support it, not only when all of them support it.

But I'm happy either way, since it's already an improvement.

doc/whats_new/upcoming_changes/sklearn.model_selection/30743.fix.rst

sklearn/metrics/_scorer.py

sklearn/model_selection/_search.py

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

antoinebaker · 2025-02-27T14:26:29Z

This LGTM. Note that ideally you'd want to send the sample_weight to all scorers in a multimetric scorer which support it, not only when all of them support it.

But I'm happy either way, since it's already an improvement.

@adrinjalali WDYT about #30743 (comment) ? It could solve this issue, but on the downside it seems like an antipattern. Whenever we call a multiscorer, we will need to remember to format the kwargs in some way depending on the metadata routing config.

Actually thinking about it, we could also do the following: when metadata routing config is disabled, if kwargs is keyed by the scorer names then use that, if not use kwargs in the old way.

adrinjalali

The change in this review, plus this diff, is the kinda thing we could do. I'm impartial on whether we wanna do it or not, I'm happy either way:

diff --git a/sklearn/metrics/_scorer.py b/sklearn/metrics/_scorer.py
index 3990389218..b03bb482c3 100644
--- a/sklearn/metrics/_scorer.py
+++ b/sklearn/metrics/_scorer.py
@@ -130,9 +130,22 @@ class _MultimetricScorer:
             routed_params = process_routing(self, "score", **kwargs)
         else:
             # they all get the same args, and they all get them all
+            # except sample_weight. Only the ones having `sample_weight` in their
+            # signature will receive it.
+            # This does not work for metadata other than sample_weight, and for those
+            # users have to enable metadata routing.
+            common_kwargs = {
+                arg: value
+                for arg, value in kwargs.items()
+                if arg != "sample_weight"
+            }
             routed_params = Bunch(
-                **{name: Bunch(score=kwargs) for name in self._scorers}
+                **{name: Bunch(score=common_kwargs) for name in self._scorers}
             )
+            if "sample_weight" in kwargs:
+                for scorer in routed_params.values():
+                    if scorer._accept_sample_weight():
+                        scorer.score["sample_weight"] = kwargs["sample_weight"]
 
         for name, scorer in self._scorers.items():
             try:

sklearn/metrics/_scorer.py

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

antoinebaker · 2025-02-28T10:28:37Z

For the multiscorer case, I followed @adrinjalali suggestion #30743 (review), when calling the multiscorer the passed kwargs stayed as before (for example containing sample_weight), it's up to the multiscorer to format the routed_params appropriately and in particular to forward the sample_weight individually to each scorer.

ogrisel

LGTM as well.

ogrisel · 2025-03-03T17:14:34Z

sklearn/model_selection/_search.py

+                        f"The scoring {name}={scorer} does not support sample_weight, "
+                        "which may lead to statistically incorrect results when "
+                        f"fitting {self} with sample_weight. "
+                    )


Note: I think we should find a way to issue a similar warning when metadata routing is enabled. But I think we can keep the scope of this PR to the case where it is disabled and implement a solution in a subsequent, once the default routing policy is implemented.

Note that we have a similar problem with the warning raised by CalibratedClassifierCV.

antoinebaker added 4 commits January 15, 2025 15:13

cross ref scoring choice

727618c

Merge remote-tracking branch 'upstream/main' into grid_search_sample_…

4c315ff

…weight

Merge remote-tracking branch 'upstream/main' into grid_search_sample_…

72cca0c

…weight

forward sample weight to scorer

c711fda

github-actions bot added the module:model_selection label Jan 31, 2025

antoinebaker changed the title ~~FIX Forward sample weight to the scorer in gird search~~ FIX Forward sample weight to the scorer in grid search Jan 31, 2025

antoinebaker added 3 commits February 4, 2025 17:05

changelog

2f92a43

enforce y tags

b3bf43f

clean xfail

0e95af4

antoinebaker marked this pull request as ready for review February 5, 2025 16:14

OmarManzoor reviewed Feb 10, 2025

View reviewed changes

sklearn/model_selection/tests/test_search.py Outdated Show resolved Hide resolved

sklearn/model_selection/tests/test_search.py Outdated Show resolved Hide resolved

Apply suggestions from code review

3f051dd

Co-authored-by: Omar Salman <omar.salman2007@gmail.com>

OmarManzoor approved these changes Feb 10, 2025

View reviewed changes

OmarManzoor added the Waiting for Second Reviewer First reviewer is done, need a second one! label Feb 10, 2025

ogrisel reviewed Feb 21, 2025

View reviewed changes

sklearn/model_selection/_search.py Outdated Show resolved Hide resolved

antoinebaker added 4 commits February 25, 2025 09:31

simplify

ccb451f

inspect sample_weight support

40cacc1

fix docstring

02e42cd

warns user instead

f90d446

adrinjalali approved these changes Feb 27, 2025

View reviewed changes

Apply suggestions from code review

197baa6

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

adrinjalali reviewed Feb 27, 2025

View reviewed changes

sklearn/metrics/_scorer.py Outdated Show resolved Hide resolved

antoinebaker and others added 2 commits February 28, 2025 11:06

Update sklearn/metrics/_scorer.py

efe216b

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

fwd sample_weight to each scorer

5569934

antoinebaker added 2 commits February 28, 2025 11:31

typo

1e080ac

fix multimetric test

9ddddb8

Merge branch 'main' into grid_search_sample_weight

c2a8290

adrinjalali approved these changes Mar 3, 2025

View reviewed changes

ogrisel approved these changes Mar 3, 2025

View reviewed changes

ogrisel merged commit 7b09f95 into scikit-learn:main Mar 3, 2025
33 checks passed

ogrisel mentioned this pull request Mar 27, 2025

List of estimators with known incorrect handling of sample_weight #16298

Open

54 tasks

lesteve mentioned this pull request Jul 31, 2025

_MultimetricScorer deals with _accept_sample_weights inconsistently #31599

Open

Uh oh!

Conversation

antoinebaker commented Jan 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Uh oh!

github-actions bot commented Jan 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

OmarManzoor left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ogrisel commented Feb 21, 2025

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

antoinebaker commented Feb 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

antoinebaker commented Feb 28, 2025

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

ogrisel Mar 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

antoinebaker commented Jan 31, 2025 •

edited

Loading

github-actions bot commented Jan 31, 2025 •

edited

Loading

antoinebaker commented Feb 27, 2025 •

edited

Loading