[MRG+1] API Change default multioutput in RegressorMixin.score to keep consistent with metrics.r2_score by qinhanmin2014 · Pull Request #13157 · scikit-learn/scikit-learn

qinhanmin2014 · 2019-02-13T12:55:30Z

Closes #12772
Wondering if someone has a better way :)
In the original issue, I tried to ask why we prefer uniform_average, but received no reply. I guess we choose uniform_average to keep consistent with other regression metrics.

…tent with metrics.r2_score

jnothman · 2019-02-13T20:59:26Z

You should note this in the docstring in any case. An alternative might be too require multioutput to be explicit specified in r2_score

qinhanmin2014 · 2019-02-14T01:21:04Z

An alternative might be too require multioutput to be explicit specified in r2_score

You mean adding a multioutput parameter in RegressorMixin.score. Does this violate our API design?

jnothman · 2019-02-14T02:55:29Z

No, I mean that r2_score would no longer accept being called on multioutput without an explicit choice.

qinhanmin2014 · 2019-02-14T03:27:20Z

No, I mean that r2_score would no longer accept being called on multioutput without an explicit choice.

You mean we'll raise an error for things like r2_score(y_true, y_pred) when y_type is continuous-multioutput? I doubt whether it's a good idea and it's not related to this PR, right?
(1) If we do so, we'll need to update other regression metrics
(2) If we do so, we'll need to intruduce a multioutput parameter in RegressorMixin.score

jnothman · 2019-02-14T04:07:45Z

Well, it solves this problem since users of r2_score would become aware that the equivalence to .score cannot be taken for granted...? I think forcing the user to be explicit has been helpful in precision_score etc. with respect to `average`. It was the motivation for average='binary'.

…

qinhanmin2014 · 2019-02-14T04:18:05Z

Well, it solves this problem since users of r2_score would become aware that the equivalence to .score cannot be taken for granted...?

So with your proposal:
(1) We'll need to update other regression metrics.
(2) Users can't use RegressorMixin.score to evaluate multioutput problems?

And this is not backward compatible, I doubt whether it's worthwhile.

qinhanmin2014 · 2019-02-14T04:21:12Z

(2) Users can't use RegressorMixin.score to evaluate multioutput problems?

Maybe you mean that we should set multioutput explicitly in RegressorMixin.score?

jnothman · 2019-02-14T04:57:43Z

Maybe you mean that we should set multioutput explicitly in

RegressorMixin.score? We already do that??? But we don't document it...

qinhanmin2014 · 2019-02-14T07:27:25Z

We already do that??? But we don't document it...

But this is actually a bug introduced in #5143 right?

I still doubt whether it's worthwhile to let all the regression metrics which supports multioutput to raise an error when parameter multioutput is not set explicitly (y_type is multioutput). Is this your final decision?

jnothman · 2019-02-14T08:01:33Z

No I suppose it doesn't really solve the problem... Documentation does.

qinhanmin2014 · 2019-02-14T08:06:26Z

No I suppose it doesn't really solve the problem... Documentation does.

@jnothman Could you please summarize your proposal here? I'm starting to get confused. Thanks.

qinhanmin2014 · 2019-03-11T12:52:50Z

I'm surprised to find that we implement the score method in MultiOutputRegressor and are using multioutput='uniform_average' there :)
I prefer to include this is 0.21. @jnothman Is it possible to regard it as a bug fix and change it without a deprecation cycle? If not, what's your proposal here?

jnothman · 2019-03-12T00:41:15Z

Can we stay by documenting the current behaviour?

qinhanmin2014 · 2019-03-12T03:13:26Z

@jnothman current behavior:
(1)r2_score default multioutput="uniform_average"
(2)MultiOutputRegressor default multioutput="uniform_average"
(3)RegressorMixin (i.e., all other regressors) default multioutput="variance_weighted"
Do you think that's acceptable? (without (2), maybe acceptable)
I think we can regard it as a bug fix because it's erroneously introduced when changing the default value of a parameter.

jnothman

Okay. I'm persuaded. I think this is an okay solution, albeit raising lots of warnings for the next while.

doc/whats_new/v0.21.rst

qinhanmin2014 · 2019-03-12T09:20:44Z

ready for review @jnothman

sklearn/base.py

qinhanmin2014 · 2019-03-13T09:10:24Z

ping related people here: @amueller @agramfort @ogrisel
and maybe @adrinjalali

amueller · 2019-03-18T18:27:05Z

Thanks! Maybe we should tell the user how to avoid this message? It's a bit ugly unfortunately. They can specify it directly when using cv or grid-search.
If they use the score method they would need to import r2 directly.

qinhanmin2014 · 2019-03-19T04:03:34Z

Thanks! Maybe we should tell the user how to avoid this message? It's a bit ugly unfortunately. They can specify it directly when using cv or grid-search. If they use the score method they would need to import r2 directly.

I've opened #13477 @amueller

…tent with metrics.r2_score (scikit-learn#13157)

…p consistent with metrics.r2_score (scikit-learn#13157)" This reverts commit 587fcf5.

…tent with metrics.r2_score (scikit-learn#13157)

qinhanmin2014 added 2 commits February 13, 2019 20:52

API Change default multioutput in RegressorMixin.score to keep consis…

946a056

…tent with metrics.r2_score

flake8

621b719

qinhanmin2014 added this to the 0.21 milestone Feb 13, 2019

qinhanmin2014 added 3 commits February 13, 2019 21:51

test failures

e4c27c7

import pytest

760dd38

test failure

edebe5d

note in doc

40eab32

jnothman closed this Mar 12, 2019

jnothman reopened this Mar 12, 2019

jnothman reviewed Mar 12, 2019

View reviewed changes

doc/whats_new/v0.21.rst Outdated Show resolved Hide resolved

doc/whats_new/v0.21.rst Outdated Show resolved Hide resolved

qinhanmin2014 added 4 commits March 12, 2019 15:55

Merge remote-tracking branch 'upstream/master' into RegressorMixin

4b4dce9

Joel's review

cc9cc7d

irrelevant change

e1e6d9d

more notes

5a2d3a1

jnothman approved these changes Mar 12, 2019

View reviewed changes

sklearn/base.py Outdated Show resolved Hide resolved

sklearn/base.py Outdated Show resolved Hide resolved

sklearn/base.py Outdated Show resolved Hide resolved

Joel's comment

9842226

agramfort changed the title ~~API Change default multioutput in RegressorMixin.score to keep consistent with metrics.r2_score~~ [MRG+1] API Change default multioutput in RegressorMixin.score to keep consistent with metrics.r2_score Mar 15, 2019

agramfort approved these changes Mar 15, 2019

View reviewed changes

qinhanmin2014 merged commit 73e2ecf into scikit-learn:master Mar 15, 2019

qinhanmin2014 deleted the RegressorMixin branch March 17, 2019 03:22

qinhanmin2014 mentioned this pull request Mar 19, 2019

MNT Update deprecation message in RegressorMixin.score to tell users how to avoid the warning #13477

Merged

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

API Change default multioutput in RegressorMixin.score to keep consis…

587fcf5

…tent with metrics.r2_score (scikit-learn#13157)

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

Revert "API Change default multioutput in RegressorMixin.score to kee…

a1b8c00

…p consistent with metrics.r2_score (scikit-learn#13157)" This reverts commit 587fcf5.

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

Revert "API Change default multioutput in RegressorMixin.score to kee…

ce23b4b

…p consistent with metrics.r2_score (scikit-learn#13157)" This reverts commit 587fcf5.

koenvandevelde pushed a commit to koenvandevelde/scikit-learn that referenced this pull request Jul 12, 2019

API Change default multioutput in RegressorMixin.score to keep consis…

cd86cf7

…tent with metrics.r2_score (scikit-learn#13157)

Uh oh!

Conversation

qinhanmin2014 commented Feb 13, 2019

Uh oh!

jnothman commented Feb 13, 2019 via email

Uh oh!

qinhanmin2014 commented Feb 14, 2019

Uh oh!

jnothman commented Feb 14, 2019

Uh oh!

qinhanmin2014 commented Feb 14, 2019

Uh oh!

jnothman commented Feb 14, 2019 via email

Uh oh!

qinhanmin2014 commented Feb 14, 2019

Uh oh!

qinhanmin2014 commented Feb 14, 2019

Uh oh!

jnothman commented Feb 14, 2019 via email

Uh oh!

qinhanmin2014 commented Feb 14, 2019

Uh oh!

jnothman commented Feb 14, 2019 via email

Uh oh!

qinhanmin2014 commented Feb 14, 2019

Uh oh!

qinhanmin2014 commented Mar 11, 2019

Uh oh!

jnothman commented Mar 12, 2019

Uh oh!

qinhanmin2014 commented Mar 12, 2019

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

qinhanmin2014 commented Mar 12, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qinhanmin2014 commented Mar 13, 2019

Uh oh!

amueller commented Mar 18, 2019

Uh oh!

qinhanmin2014 commented Mar 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants