MRG adding test of fit attributes by agramfort · Pull Request #16286 · scikit-learn/scikit-learn

agramfort · 2020-01-29T15:36:28Z

What does this implement/fix? Explain your changes.

aims to makes sure that all attributes that appear in the fit are documented and vice versa.

rth

Thanks, a few comments below! Should be move the status to MRG?

rth · 2020-01-29T16:38:54Z

sklearn/tests/test_docstring_parameters.py

+               'StackingRegressor', 'TfidfVectorizer', 'VotingClassifier',
+               'VotingRegressor']
+    if Estimator.__name__ in IGNORED or Estimator.__name__.startswith('_'):
+        pytest.xfail(


Maybe pytest.skip here since realistically we don't intend to make these work in the future.

rth · 2020-01-29T16:40:36Z

sklearn/tests/test_docstring_parameters.py

+    X_reg -= X_reg.min()
+
+    if is_classifier(est):
+        X, y = X_classif, y_classif


Nit: maybe call make_classification / regression here only when necessary, and then,

X -= X.min()

once.

rth · 2020-01-29T16:42:24Z

sklearn/tests/test_docstring_parameters.py

+        est.fit(X, y)
+
+    for attr in attributes:
+        desc = ' '.join(attr.desc).lower()


Maybe we could add a comment what this checks for since it's not very clear after reading the ode.

rth · 2020-01-29T16:46:37Z

sklearn/tests/test_docstring_parameters.py

+            continue
+        if attr.startswith('_'):
+            continue
+        assert attr in fit_attr_names


I think here it might be better to the filtered fit_attr (with removed private attributes and known exceptions), than do,

undocumented_attrs = set(fit_attr_names).difference(fit_attr) assert not undocumented_attrs, "Undocumented attributes: {}".format(undocumented_attrs)

that way all the undocumented attributes are printed at once, and the user doesn't have to iteratively run this test.

rth · 2020-01-29T16:48:21Z

sklearn/tests/test_docstring_parameters.py

+    fit_attr = [k for k in est.__dict__.keys() if k.endswith('_')]
+    fit_attr_names = [attr.name for attr in attributes]
+    for attr in fit_attr:
+        if attr in ['X_offset_', 'X_scale_', 'fit_', 'partial_fit_', 'x_mean_',


Maybe let's add a comment that these should be removed from the public API.

lesteve · 2020-01-29T17:11:54Z

Just for reference @amueller had some snippet in #14312 and an PR attempt at this in #13385. Maybe worth checking out to see how similar/different they are to this PR?

…ikit-learn#13385)

…uded in new test for doc of all attributes

…tion of tag requires_positive_X broke test of sklearn/tests/test_common.py

…into doc_check_attributes_fit

agramfort · 2020-02-15T15:03:41Z

thx @judithabk6 for taking over.

@rth I addressed your last comments. I think it's good enough from my end.

more reviews are welcome

cmarmo · 2020-03-03T14:43:26Z

@judithabk6 @agramfort the failing test is related to #16545 : could you please sync with upstream? This will hopefully solve the issue. Thanks!

rth

A few minor comments otherwise LGTM, thanks!

sklearn/cluster/_mean_shift.py

sklearn/cross_decomposition/_cca.py

sklearn/cross_decomposition/_pls.py

sklearn/discriminant_analysis.py

rth · 2020-03-04T10:34:41Z

sklearn/tests/test_docstring_parameters.py

+        est.k = 2
+
+    if Estimator.__name__ == 'DummyClassifier':
+        est.strategy = "stratified"


I think the following might work for a larger number of estimators,

from sklearn.utils.estimator_checks import _construct_instance, _set_checking_parameters est = _construct_instance(Estimator) _set_checking_parameters(est)

Not asking to do it now, I can change it in a follow up PR.

rth · 2020-03-04T10:38:38Z

sklearn/tests/test_docstring_parameters.py

+               'SkewedChi2Sampler'}
+    if Estimator.__name__ in IGNORED:
+        pytest.xfail(
+            reason="Classifier has too many undocumented attributes.")


FYI: we can now also put these in _xfail_test estimator tag for individual estimators (https://scikit-learn.org/dev/developers/develop.html#estimator-tags) but it's not critical. Not asking to do it.

rth · 2020-03-04T10:41:34Z

Now tests fails because n_features_in_ is not documented, we should probably skip it?

rth

LGTM, assuming CI passes. Thanks!

rth · 2020-03-04T12:51:47Z

Merging +1 as this is fairly low risk (extends an existing test). Thanks!

amueller · 2020-03-05T19:00:09Z

Yay!

WIP adding test of fit attributes

323d660

jeremiedbb added the Sprint label Jan 29, 2020

test the other way around ie that present attributes are documented

063c3f1

rth reviewed Jan 29, 2020

View reviewed changes

lesteve mentioned this pull request Jan 29, 2020

implement a test to enforce better documentation of attributes #16292

Closed

rth mentioned this pull request Jan 30, 2020

ENH Support for XFAIL/XPASS in common tests #16306

Closed

judithabk6 added 8 commits January 31, 2020 15:13

a bit of refactor of test_fit_docstring_attributes (inspiration PR sc…

6236e9b

…ikit-learn#13385)

merge with master + remove test for doc of classes_ attribute as incl…

7bccc4c

…uded in new test for doc of all attributes

fixing conflicts

b6e2b59

fix conflicts

ad42c75

change broken test for CategoricalNB (string was changed because addi…

ffecfe9

…tion of tag requires_positive_X broke test of sklearn/tests/test_common.py

set random state so that RANSAC fit works even with small dataset

d0a4763

fix xfail list

ea662d4

Merge branch 'master' of https://github.com/scikit-learn/scikit-learn …

06d0825

…into doc_check_attributes_fit

judithabk6 mentioned this pull request Jan 31, 2020

fix space missing in documentation #16351

Merged

judithabk6 and others added 3 commits January 31, 2020 19:20

Merge branch 'master' of https://github.com/scikit-learn/scikit-learn …

80cd5c1

…into doc_check_attributes_fit

escape warning that broke tests + performance improvement

b12ca8b

address comments from @rth

37048b9

agramfort changed the title ~~WIP adding test of fit attributes~~ MRG adding test of fit attributes Feb 15, 2020

agramfort marked this pull request as ready for review February 15, 2020 15:02

Merge branch 'master' into doc_check_attributes_fit

edc4600

rth reviewed Mar 4, 2020

View reviewed changes

agramfort and others added 4 commits March 4, 2020 11:52

skip n_features_in_

13abbf4

Update sklearn/discriminant_analysis.py

6e35ec7

Update sklearn/cross_decomposition/_pls.py

e79ae47

Update sklearn/cross_decomposition/_cca.py

5b63848

Update sklearn/cluster/_mean_shift.py

45b899e

rth approved these changes Mar 4, 2020

View reviewed changes

rth merged commit 60b8fb2 into scikit-learn:master Mar 4, 2020

agramfort mentioned this pull request Mar 4, 2020

[WIP] Enforce positiveness in tests using enforce_estimator_tags_X #14705

Closed

amueller mentioned this pull request Mar 5, 2020

Tests for attribute documentation #13385

Closed

ashutosh1919 pushed a commit to ashutosh1919/scikit-learn that referenced this pull request Mar 13, 2020

TST add test of fit attributes (scikit-learn#16286)

6578fb4

gio8tisu pushed a commit to gio8tisu/scikit-learn that referenced this pull request May 15, 2020

TST add test of fit attributes (scikit-learn#16286)

787f191

amueller mentioned this pull request May 29, 2020

Ensure all attributes are documented #14312

Closed

cmarmo mentioned this pull request Apr 30, 2021

Failed xtests on test_docstring_parameters.py (documentation issues) #19781

Closed

4 tasks

Uh oh!

Conversation

agramfort commented Jan 29, 2020

What does this implement/fix? Explain your changes.

Uh oh!

rth left a comment

Choose a reason for hiding this comment

Uh oh!

rth Jan 29, 2020

Choose a reason for hiding this comment

Uh oh!

rth Jan 29, 2020

Choose a reason for hiding this comment

Uh oh!

rth Jan 29, 2020

Choose a reason for hiding this comment

Uh oh!

rth Jan 29, 2020

Choose a reason for hiding this comment

Uh oh!

rth Jan 29, 2020

Choose a reason for hiding this comment

Uh oh!

lesteve commented Jan 29, 2020

Uh oh!

agramfort commented Feb 15, 2020

Uh oh!

cmarmo commented Mar 3, 2020

Uh oh!

rth left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rth Mar 4, 2020

Choose a reason for hiding this comment

Uh oh!

rth Mar 4, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rth commented Mar 4, 2020

Uh oh!

rth left a comment

Choose a reason for hiding this comment

Uh oh!

rth commented Mar 4, 2020

Uh oh!

amueller commented Mar 5, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

rth Mar 4, 2020 •

edited

Loading