Add requires_positive_y estimator tag by rth · Pull Request #14095 · scikit-learn/scikit-learn

rth · 2019-06-14T19:00:39Z

This adds a requires_positive_y estimator tag, for estimators that only work with a positive y for regression, such as Poisson regression in the GLM PR #9405.

This tag only makes sense for regressors, but this is not enforced. Estimator tag validation in general could be done in some other PR.

The GLM PR distinguishes estimators, that work with strictly positive y as well as positive or zero. Here I only consider the strictly positive case, as I think for common tests that is enough.

rth · 2019-06-14T19:01:52Z

doc/developers/contributing.rst

    whether the estimator requires positive X.

+requires_positive_y
+    whether the estimator requires a positive y (only applicable for regression).


Maybe we should call the one above requires_positive_X to be consistent, not sure. In any case, I am not so keen on requires_positive_target if we try to reach naming consistency in the other direction.

Why do you not like target? I'm fine either way.

Wanna actually add the one above and do the same thing you did for y? The tag is not used in the estimator or tags yet, I think.

Why do you not like target? I'm fine either way.

Mostly to be consistent with fit(X, y), but it's not too critical I agree.

Wanna actually add the one above and do the same thing you did for y? The tag is not used in the estimator or tags yet, I think.

Sure will make another PR.

rth · 2019-06-14T19:04:05Z

sklearn/utils/tests/test_estimator_checks.py

+
+        # doesn't error on actual estimator
+        LogisticRegression,
+        LogisticRegression(),


Previously this run on AdaBoostClassifier which took significant time (around 3 sec) for two checks. Since it looks like this only intends to check that initialized/non initialized estimators work, replaced it by a faster estimator.

AdaBoostClassifier is picked up by common tests anyway.

rth · 2019-06-15T21:41:36Z

(The test failure in one of the jobs is unrelated).

rth · 2019-06-20T16:34:57Z

In case you are able to have a quick look (should be easy to review) @thomasjpfan @glemaitre. Already has a +1 )

rth · 2019-06-21T16:16:52Z

I think I have addressed all comments. If there aren't new ones, could someone please merge this then, as this PR has a +2? Thanks!

jnothman

I think we should just get some consensus on naming: requires_positive_y vs requires_positive_target. @rth prefers the former, I think I prefer the latter for consistency with requires_positive_data. Other opinions? Does it matter if tags are still experimental?

thomasjpfan · 2019-06-24T02:22:55Z

We have X_types instead of data_types, while also using requires_positive_data. I am +0.5 on using X and y since it used in all our function signatures.

TLDR: +0.5 on requires_positive_y and changing requires_positive_data to requires_positive_X.

rth · 2019-06-24T06:12:42Z

OK, changed requires_positive_data to requires_positive_X for consistency with X_types and requires_positive_y. Though no strong objections on this, can change those to _data and _target if necessary.

jnothman

It's also more consistent with X_types.

doc/developers/contributing.rst

Co-Authored-By: Joel Nothman <joel.nothman@gmail.com>

rth · 2019-06-24T13:28:42Z

Thanks, @jnothman -- I added your suggestion! This should be good to merge then (CI is green)?

rth and others added 2 commits June 14, 2019 19:43

Add requires_positive_y estimator tag

5e57ca6

Add comment

4527cb9

rth commented Jun 14, 2019

View reviewed changes

Remove pytest dependency from test_estimator_checks.py

feb3d03

jnothman approved these changes Jun 19, 2019

View reviewed changes

amueller approved these changes Jun 20, 2019

View reviewed changes

jnothman reviewed Jun 23, 2019

View reviewed changes

Change requires_positive_data -> requires_positive_X

f2f04bc

jnothman reviewed Jun 24, 2019

View reviewed changes

doc/developers/contributing.rst Outdated Show resolved Hide resolved

Update doc/developers/contributing.rst

11b1e3d

Co-Authored-By: Joel Nothman <joel.nothman@gmail.com>

jnothman merged commit 78ac1ab into scikit-learn:master Jun 24, 2019

koenvandevelde pushed a commit to koenvandevelde/scikit-learn that referenced this pull request Jul 12, 2019

TST Add requires_positive_y estimator tag (scikit-learn#14095)

f7873b9

rth mentioned this pull request Jul 25, 2019

Minimal Generalized linear models implementation (L2 + lbfgs) #14300

Merged

7 tasks

wdevazelhes mentioned this pull request Aug 19, 2019

[MRG] Use tag requires_positive_X for NMF + ComplementNB #14680

Merged

Uh oh!

Conversation

rth commented Jun 14, 2019

Uh oh!

rth Jun 14, 2019

Choose a reason for hiding this comment

Uh oh!

amueller Jun 20, 2019

Choose a reason for hiding this comment

Uh oh!

amueller Jun 20, 2019

Choose a reason for hiding this comment

Uh oh!

rth Jun 21, 2019

Choose a reason for hiding this comment

Uh oh!

rth Jun 14, 2019

Choose a reason for hiding this comment

Uh oh!

rth commented Jun 15, 2019

Uh oh!

rth commented Jun 20, 2019

Uh oh!

rth commented Jun 21, 2019

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

thomasjpfan commented Jun 24, 2019

Uh oh!

rth commented Jun 24, 2019

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rth commented Jun 24, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants