[WIP] Adding tests for estimators implementing `partial_fit` and a few other related fixes / enhancements by raghavrv · Pull Request #3907 · scikit-learn/scikit-learn

raghavrv · 2014-11-29T17:50:48Z

New partial_fit tests:
The below tests are based off @arjoly's suggestion.

1. General tests
- Assert that partial_fit returns self. - Thanks @jnothman for the suggestion.
- Clone test... assert clone(est).partial_fit == est.partial_fit
2. Check that an error is raised if the number of features changes.
3. Estimator reset test
- Check that doing fit after a set of fit / partial_fit restarts the estimator.
- Assert that partial_fit and then fit with a different number of features works without raising any Exceptions... - Thanks @amueller for the suggestion
4. Check if partial_fit does not overwrite the previous model.
- Based on @jnothman suggestion
5. Check that classifier handles correctly the classes argument in the partial_fit.
- a. Check if mismatch between classes argument and np.unique(y_i) raises ValueError.
- b. Check that error is raised if classes is not specified during first partial_fit call.
- c. Check error is not raised if classes is not specified during subsequent calls.
Helper functions
- _validate_y_against_classes - Check label mismatch between y and classes arg
- assert_same_model(X, est1, est2)
- assert_not_same_model(X, est1, est2)
  - Use predict, transform, decision_function and predict_proba to check equality of models.
- In sklearn.utils.estimator_checks _partial_fit and _fit to use the appropriate parameters.
- assert_attributes_equal(est1, est2)
- assert_attributes_not_equal(est1, est2)
- assert_array_not_equal

Refs
Also see #406 - This PR fixes 1st under not so easy for pfit-able estimators, via 3a ( of this PR )

This change is

coveralls · 2014-11-29T18:10:52Z

Coverage remained the same when pulling 79a620b on ragv:partial_fit_testing_3896 into 4f1c381 on scikit-learn:master.

jnothman · 2014-11-29T22:06:17Z

That seems a nice list (although testing that models are the same currently is done by testing the output of predict/transform).

I'm not altogether certain about the fit then partial_fit semantics: fit does not necessarily leave the model in a state in which incremental training can occur. I do think that we should decide what correct semantics are.

amueller · 2014-12-02T18:39:04Z

What is the current semantics?
Should fit always leave the model in a state where partial_fit is possible?

raghavrv · 2014-12-05T01:38:34Z

I am stuck with the 2nd alone... The case when fit then partial_fit == partial_fit then partial_fit does not seem to be consistently true.

( assuming that the equality checks for the accuracy ( score) )

Kindly review my current implementation of the other tests.

jnothman · 2014-12-05T05:52:58Z

You should probably use predict/transform directly rather than score, as do the existing invariance tests.

I think fit, partial fit == partial_fit, partial_fit is the wrong semantics. I'd rather fit, partial fit => exception just to be clear.

jnothman · 2014-12-05T06:09:33Z

You're assuming all partial_fit estimators are predictors. Some are or may be transformers. Perhaps we should use something like:

def _assert_same_model_method(method, X, estimator1, estimator2, msg):
    if hasattr(estimator1, method):
        assert hasattr(estimator2, method), '{!r} has transform, but {!r} does not'.format(estimator1, estimator2)
        assert_array_almost_equal(getattr(estimator1, method)(X), getattr(estimator2, method)(X), 2, 'When testing {}: {}'.format(method, msg))


def assert_same_model(X, estimator1, estimator2, msg):
    _assert_same_model_method('predict')
    _assert_same_model_method('transform')

jnothman · 2014-12-05T06:10:41Z

sklearn/utils/estimator_checks.py

Alg -> Estimator

Even I thought so... but this seems to be the convention followed by other checks...
Could I change all the others uniformly to Estimator too or I'll change this one alone?
Also what does Alg mean btw?

Algorithm ;)

Feel free to change everything to Estimator, I might have picked the name, but don't have strong opinions about it.

Ah... okay ! Will change it :)

raghavrv · 2014-12-05T10:45:51Z

@jnothman partial_fit after a fit does not raise an exception... Is it probably a bug?

For your 2nd comment, thanks a lot for the review... I'll make the changes :)

MechCoder · 2014-12-05T10:52:13Z

Should fit always leave the model in a state where partial_fit is possible?

@amueller I'm not quite sure. I think it should either raise an error or it should overwrite and do a new fit from the beginning, if one calls partial_fit after fit. If partial_fit starts from where fit left, then it does not make a difference whether you fit or partial_fit, right? ;)

raghavrv · 2014-12-05T11:38:12Z

@MechCoder @amueller From what I observed fit always restarts the estimator. partial_fit continues from the previous setup irrespective of whether the previous setup was a fit or a partial_fit.

But as you and others suggested, perhaps partial_fit after fit should raise an exception?

MechCoder · 2014-12-05T12:02:02Z

We could either.

Raise an exception.
Or overwrite the previously fit model if partial_fit is being called after fit or fit is being called after partial_fit.

I'm inclined towards the second one but I'm not sure what is the best thing to do.

raghavrv · 2014-12-05T12:03:19Z

I feel 2b is already implemented... I guess its between 2a or 1... Anyway thanks for your comments :)

raghavrv · 2014-12-05T12:13:39Z

A code revision related question... When revisions are suggested to the code, am I expected to add a new commit reflecting those revision or do I (soft) reset my previous work, make the changes and recommit?

MechCoder · 2014-12-05T12:26:33Z

I think, you can do it either way. But if those revisions are minor, and if you feel it worthless to add a new commit for those (like pep8, cosmits and stuff), you can just amend your commit and push by force (at least, that's what I do).

raghavrv · 2014-12-05T14:16:59Z

Thanks... I'll also follow the same :)

jnothman · 2014-12-06T12:29:14Z

The point is that the interaction between fit and partial_fit hasn't been previously addressed. The assumption is that the user will choose one or the other, but not both. Forcing a particular behaviour by way of the proposed tests will make the matter more consistent, but is also perhaps more of a framework than is actually needed, because the case is somewhat pathological.

raghavrv · 2014-12-06T19:45:13Z

@jnothman To your earlier comment,

Perhaps we should use something like {code}

Could you expand a little by what you meant?

I understood the part where you said all those estimators should not be tested assuming they are predictors and that we should separately test them based on whether they are a Transformer or Predictor ...

But I fail to understand the two code snippets...
If you meant that we could filter out Transformer like estimators which implement partial_fit... I just modified the method_filter to accept ! as a prefix to the method names...

[ @amueller perhaps such a functionality could be useful for type_filter too? ]

So for estimators that are not Transformers:
ests = all_estimators(method_filter=["predict", "!transform", "partial_fit"])

And for Transformers with partial_fit capability :
ests = all_estimators(method_filter=["transform", "partial_fit"], type_filter="transformer")
# The type filter is perhaps redundant here

Excuse me if I understood your comment incorrectly...

raghavrv · 2014-12-15T17:14:11Z

@amueller I added some tests for all_estimators function as discussed here. Kindly take a look when you find time... I've also added ! prefixing for type_filter and method_filter....

~~( The pfit tests are WIP though... )~~

raghavrv · 2015-05-16T08:04:08Z

I am tempted to ask for a check similar to the one I did in #4535 [edit] for just ensuring that fitting twice gives the same model. It seems that currently gives "interesting" results.

@amueller Could I do that in #4162 ? :) after this gets merged :)

raghavrv · 2015-05-16T08:04:40Z

I think arrays should be compared using array_almost_equal not array_equal

Noted Thanks :))

amueller · 2015-05-18T17:55:21Z

sklearn/tests/test_common.py

why did you do this in its own test? This means they are not part of check_estimator, right?

refactored (haven't pushed yet) :)

amueller · 2015-05-18T22:58:31Z

I'm not entirely convinced of the usefulness of a general assert_not_same_model helper. When would you use that? For the partial_fit tests, can't you just test that predict yields different results?

amueller · 2015-05-18T23:04:01Z

sklearn/utils/testing.py

I don't understand that. Why are these skipped? What is the point of the test if we skip these?

Spectral embedding is still non-deterministic due to the zero eigen value problem which I am attempting to solve at #4299 ;)

I forgot the algorithm which uses the weights_ attribute ;) will comment on why I had skipped it for checking shortly :)

In sklearn/utils/testing.py
#3907 (comment):

any_method_differed = any((t1, t2, t3, t4))

if not any_method_differed:

# If all methods return similar results for both ests or if those

# methods are not present

assert_fitted_attributes_not_equal(estimator1, estimator2)

+def _attributes_equal(estimator1, estimator2):

"""Helper function to check if fitted model attributes are equal or not

Raises AssertionError if the attributes are not equal.

"""

A list of attributes which are known to be inconsistent.

skip_attributes = ('embedding_', 'weights_')

Spectral embedding is still non-deterministic due to the zero eigen
value problem which I am attempting to solve at #4299
#4299 ;)

I forgot which algorithm uses this |weights_| attribute ;) will
comment why I put the same in this list shortly :)

You should not skip it for all estimators. Hard-coding this in this
place is really bad!

jnothman · 2015-06-10T04:08:51Z

When would you use that? For the partial_fit tests, can't you just test that predict yields different results?

If you know your estimator is a predictor, then yes. The assert_{not_}same_model helpers should apply to whatever extent possible to dictionary learning, IPCA, MBKMeans, Birch... While a bit hacky given that we don't officially have a notion of model equality in scikit-learn API, I think this sort of helper has the potential to clean up a number of common tests.

I'd be interested in seeing these helpers moved to a smaller PR, like #4162, so that it can be reviewed and merged without a large change to the tests. Then work on merging this PR, and refactoring existing tests to use the helpers can begin.

raghavrv · 2015-06-10T09:39:01Z

Yes that will be really better! I'll cherry pick the helpers into a new branch and raise a PR!!

FIX Return self in partial_fit of BernoulliRBM

* Check if no error is raised when fit is called with a different number of features * Check if estimator is reset when fit is called after a partial_fit * Check if error is raised when no of features changes between different partial_fit calls * Check if partial_fit does not overwrite the previous model * Check if classes argument is parsed and validated correctly * Check if clone(estimator).partial_fit == estimator.partial_fit * Check if partial_fit returns self

giorgiop · 2015-10-16T11:03:46Z

Not sure what's the state of this PR at the moment, but please consider #5416.
It would be nice to have a common test over all estimators with partial_fit to check whether two subsequent calls to fit (not partial) behave well. For example:

est.fit(X)
# Different input shape. It may raise errors in the case the internal state is not reset
est.fit(X[, :-1])
# It should give the same result as the first line
est.fit(X)

In the case of scalers, the issue came from the fact that we are now implementing fit with one call to a partial_fit, without a reset first.

Ping @lesteve

raghavrv · 2015-10-16T12:04:30Z

@giorgiop Work is going on at #4841, where new assert helpers to compare models is being written... Once that gets over fit reset tests will go in and later we'll have this PR in (hopefully) :)

haiatn · 2023-07-29T12:21:45Z

Is this PR just waiting for review or is there still work to be done?

adrinjalali · 2024-03-07T13:24:27Z

This is gonna need a fresh start if we're going to do it. And we'd need to do significant work on the semantics of partial_fit.

jnothman mentioned this pull request Nov 30, 2014

Invariance testing for partial_fit #3896

Closed

raghavrv force-pushed the partial_fit_testing_3896 branch 2 times, most recently from 7284706 to 8bfbd75 Compare December 4, 2014 23:36

raghavrv force-pushed the partial_fit_testing_3896 branch from c3089fd to ee13683 Compare December 5, 2014 01:40

jnothman reviewed Dec 5, 2014
View reviewed changes

raghavrv force-pushed the partial_fit_testing_3896 branch 2 times, most recently from 562bed4 to 133778a Compare December 6, 2014 11:08

raghavrv force-pushed the partial_fit_testing_3896 branch from 133778a to bcb342d Compare December 7, 2014 12:34

raghavrv changed the title ~~[WIP]Adding tests for estimators implementing partial_fit~~ [WIP]Adding tests for estimators implementing partial_fit and tests for utils.testing.all_estimators function. Dec 7, 2014

raghavrv force-pushed the partial_fit_testing_3896 branch 3 times, most recently from 672b06a to 3baac44 Compare December 12, 2014 17:32

raghavrv mentioned this pull request Dec 15, 2014

[MRG+1] GaussianNB.partial_fit - Raise ValueError when there is a mismatch between target and classes argument #3944

Merged

amueller reviewed May 18, 2015
View reviewed changes

jnothman mentioned this pull request Jun 10, 2015

[MRG+1] Add sample_weight support to RidgeClassifier #4838

Merged

FIX check for mismatch between y and classes and raise ValueError

effd94d

FIX Return self in partial_fit of BernoulliRBM

raghavrv mentioned this pull request Jun 10, 2015

[WIP] New assert helpers for model comparison and fit reset checks #4841

Closed

4 tasks

giorgiop mentioned this pull request Aug 10, 2015

[MRG+2] partial_fit for StandardScaler, MinMaxScaler and MaxAbsScaler #5104

Merged

12 tasks

amueller mentioned this pull request Oct 16, 2015

[MRG+1] BUG: reset internal state of scaler before fitting #5416

Merged

giorgiop mentioned this pull request Oct 27, 2015

BUG: StandardScaler partial_fit overflows #5602

Open

jnothman mentioned this pull request Aug 29, 2016

[WIP] Test determinism of estimators #7270

Closed

5 tasks

Uh oh!

Conversation

raghavrv commented Nov 29, 2014 • edited by lesteve Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coveralls commented Nov 29, 2014

Uh oh!

jnothman commented Nov 29, 2014

Uh oh!

amueller commented Dec 2, 2014

Uh oh!

raghavrv commented Dec 5, 2014

Uh oh!

jnothman commented Dec 5, 2014

Uh oh!

jnothman commented Dec 5, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

raghavrv commented Dec 5, 2014

Uh oh!

MechCoder commented Dec 5, 2014

Uh oh!

raghavrv commented Dec 5, 2014

Uh oh!

MechCoder commented Dec 5, 2014

Uh oh!

raghavrv commented Dec 5, 2014

Uh oh!

raghavrv commented Dec 5, 2014

Uh oh!

MechCoder commented Dec 5, 2014

Uh oh!

raghavrv commented Dec 5, 2014

Uh oh!

jnothman commented Dec 6, 2014

Uh oh!

raghavrv commented Dec 6, 2014

Uh oh!

raghavrv commented Dec 15, 2014

Uh oh!

raghavrv commented May 16, 2015

Uh oh!

raghavrv commented May 16, 2015

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

amueller commented May 18, 2015

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

A list of attributes which are known to be inconsistent.

Uh oh!

jnothman commented Jun 10, 2015

Uh oh!

raghavrv commented Jun 10, 2015

Uh oh!

giorgiop commented Oct 16, 2015

Uh oh!

raghavrv commented Oct 16, 2015

Uh oh!

haiatn commented Jul 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adrinjalali commented Mar 7, 2024

Uh oh!

Reviewers

raghavrv commented Nov 29, 2014 •

edited by lesteve

Loading

haiatn commented Jul 29, 2023 •

edited

Loading