[MRG+1] Improve the error message for some metrics when the shape of sample_weight is inappropriate by qinhanmin2014 · Pull Request #9903 · scikit-learn/scikit-learn

qinhanmin2014 · 2017-10-11T03:06:49Z

Reference Issue

proposed by @lesteve in #9786 (comment).

in this PR we spotted a place where check_consistent_lengths(X, y) was used where check_consistent_lengths(X, y, sample_weight) should have called it would be good to double-check that this error is not present in some other places in our codebase.

Fixes #9870

What does this implement/fix? Explain your changes.

Currently, many metrics do not explicitly check the shape of sample_weight. Instead, they rely on certain statement to block the code from running through, this may cause:
(1)Users can't get meaningful error message (e.g., now you may get Axis must be specified when shapes of a and weights differ or even operands could not be broadcast together with shapes (2,1) (3,1))
(2)Sometimes all the statements fail to block the code and you even can't get an erorr (e.g., roc_auc_score previously)
The PR fixes the problem and improves the common test to ensure that meaningful error message is raised by all metrics with sample_weight.

Any other comments?

cc @jnothman @lesteve

lesteve · 2017-10-11T07:23:54Z

That's pretty nice you did it via the common tests, well done! LGTM.

A small tip for PEP8, you should spend some time configuring your editor to have on-the-fly flake8 checks, this saves some time in the long run!

Another small tip you can link to a given comment in an issue e.g. #9786 (comment) (I edited your message accordingly). This makes it a lot easier to find the comment you are referring to.

Note what I had originally in mind was something a bit more generic than just doing it for metrics i.e. checking that each time a function signature had sample_weight, there was a check_consistent_length(..., ..., sample_weight) call within its body. Not sure how easy this though.

qinhanmin2014 · 2017-10-11T07:37:19Z

@lesteve Thanks a lot for the detailed instruction :)

Note what I had originally in mind was something a bit more generic than just doing it for metrics i.e. checking that each time a function signature had sample_weight, there was a check_consistent_length(..., ..., sample_weight) call within its body. Not sure how easy this though.

I think at least it won't be difficult if we go through all the public functions. I'll do this in a couple of days and open PR/issue if necessary.

TomDLT · 2017-10-11T09:17:59Z

That's pretty nice you did it via the common tests, well done! LGTM.

Indeed ! Thanks @qinhanmin2014

…sample_weight is inappropriate (#9903)

qinhanmin2014 · 2017-10-15T08:32:56Z

ping @lesteve I tried my best to go through all the public methods (still hard to guarantee the completeness).

Note what I had originally in mind was something a bit more generic than just doing it for metrics i.e. checking that each time a function signature had sample_weight, there was a check_consistent_length(..., ..., sample_weight) call within its body.

We do not always use check_consistent_length to check the shape of sample_weight. Sometimes, we use an if statement, it enables us to raise even more informative error message than check_sample_weight, e.g., Shapes of X and sample_weight do not match.

Overall, I do not find any public function that can run through with inappropriate shape of sample_weight, but there are indeed some functions which do not check the shape of sample_weight, thus resulting in unclear error message. These include

DummyClassifier
KernelRidge
LinearRegression, RANSACRegressor, Ridge, RidgeClassifier
MultinomialNB, ComplementNB, BernoulliNB

Below are the modules I go through which seems related to sample_weight. Some modules, e.g., sklearn.ensemble, seems to pass sample_weight directly to the underlying estimators, thus are not included.

sklearn.cluster, sklearn.dummy, sklearn.isotonic
sklearn.kernel_ridge, sklearn.linear_mode, sklearn.naive_bayes
sklearn.svm, sklearn.tree

WDYT? Does it worth a fix? Thanks :)

Update: I have opened #9926 for further discussion.

…sample_weight is inappropriate (scikit-learn#9903)

qinhanmin2014 added 2 commits October 11, 2017 10:49

better error message

b3c0285

prp8 fix

e7f2529

lesteve changed the title ~~[MRG] Improve the error message for some metrics when the shape of sample_weight is inappropriate~~ [MRG+1] Improve the error message for some metrics when the shape of sample_weight is inappropriate Oct 11, 2017

TomDLT pushed a commit that referenced this pull request Oct 11, 2017

[MRG+1] Improve the error message for some metrics when the shape of …

6e75058

…sample_weight is inappropriate (#9903)

TomDLT merged commit 6e75058 into scikit-learn:master Oct 11, 2017

qinhanmin2014 deleted the metric_err_message branch October 11, 2017 09:20

qinhanmin2014 mentioned this pull request Oct 15, 2017

Ensure that the shape of sample_weight is checked in all the functions #9926

Closed

maskani-moh pushed a commit to maskani-moh/scikit-learn that referenced this pull request Nov 15, 2017

[MRG+1] Improve the error message for some metrics when the shape of …

68c3876

…sample_weight is inappropriate (scikit-learn#9903)

jwjohnson314 pushed a commit to jwjohnson314/scikit-learn that referenced this pull request Dec 18, 2017

[MRG+1] Improve the error message for some metrics when the shape of …

28bd2cf

…sample_weight is inappropriate (scikit-learn#9903)

lucyleeow mentioned this pull request Feb 8, 2025

Remove median_absolute_error from METRICS_WITHOUT_SAMPLE_WEIGHT #30787

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MRG+1] Improve the error message for some metrics when the shape of sample_weight is inappropriate#9903

[MRG+1] Improve the error message for some metrics when the shape of sample_weight is inappropriate#9903
TomDLT merged 2 commits intoscikit-learn:masterfrom
qinhanmin2014:metric_err_message

qinhanmin2014 commented Oct 11, 2017 •

edited by lesteve

Loading

Uh oh!

lesteve commented Oct 11, 2017 •

edited

Loading

Uh oh!

qinhanmin2014 commented Oct 11, 2017

Uh oh!

TomDLT commented Oct 11, 2017

Uh oh!

qinhanmin2014 commented Oct 15, 2017 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

qinhanmin2014 commented Oct 11, 2017 • edited by lesteve Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issue

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

lesteve commented Oct 11, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qinhanmin2014 commented Oct 11, 2017

Uh oh!

TomDLT commented Oct 11, 2017

Uh oh!

qinhanmin2014 commented Oct 15, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qinhanmin2014 commented Oct 11, 2017 •

edited by lesteve

Loading

lesteve commented Oct 11, 2017 •

edited

Loading

qinhanmin2014 commented Oct 15, 2017 •

edited

Loading