[MRG] Fix predict_proba not fitted check in SGDClassifier by aniruddhadave · Pull Request #10961 · scikit-learn/scikit-learn

aniruddhadave · 2018-04-12T21:04:14Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Fixes not fitted check in predict_proba method so that it doesn't throw a not fitted error while referencing the method. Checks whether the classifier is fitted or not when the method is called.

Any other comments?

- Remove not fitted check from predict_proba method of SGDClassifier - Check only while calling predic_proba

jnothman

I think that is the right fix. Please add a non-leash regression test

aniruddhadave · 2018-04-13T06:24:45Z

I understand non-regression testing as given on the contributing guidelines but how is a non-leash regression test different from that?

lesteve · 2018-04-13T12:26:53Z

I think @jnothman meant a non-regression test (maybe autocorrect or something?). Great if you understand what it is from the contributing guidelines. Can you add a non-regression test then?

lesteve · 2018-04-13T12:35:42Z

Suggestion for non-regression test (add a test function in sklearn/linear_model/tests/test_sgd.py and mention the issue number for completeness):

from sklearn.linear_model import SGDClassifier

clf = SGDClassifier()
clf.predict_proba
clf.predict_log_proba

aniruddhadave · 2018-04-13T16:48:33Z

@lesteve There already exists a test case for predict_proba method (test_sgd_proba) shouldn't it be modified instead of writing a separate test case?

lesteve · 2018-04-14T05:43:42Z

Not really important, but I would rather put that in a separate test function. It is a bit a special case to just test that you can access the predict_proba and predict_log_proba attribute.

-Test if the the predict_proba and predict_log_proba methods can be accessed before fitting -Test if the not fitted check is performed before calling the proba methods

jnothman

Otherwise LGTM

jnothman · 2018-04-15T03:21:15Z

sklearn/linear_model/tests/test_sgd.py

+
+            # Checks if not fitted check is performed while calling 
+            # the methods
+            assert_raises(NotFittedError, clf.predict_proba,[[3,2]])


Space after commas please

jnothman · 2018-04-15T03:22:59Z

sklearn/linear_model/tests/test_sgd.py

+        # is accessible for refrencing before fitting 
+        # the SGD classifier
+        clf = SGDClassifier()
+        assert_false(hasattr(clf,"predict_proba"))


Please just use bare assert not ... rather than assert_false. Thanks

-Remove extra line -Add space after comma -Use assert instead of asser_false

lesteve

Some comments.

lesteve · 2018-04-16T06:10:45Z

sklearn/linear_model/tests/test_sgd.py

+
+            # Checks if not fitted check is performed while calling
+            # the methods
+            assert_raises(NotFittedError, clf.predict_proba, [[3, 2]])


I would expect the NotFittedError case to be already tested in test_common.py so I would remove this from the test.

lesteve · 2018-04-16T06:10:56Z

sklearn/linear_model/tests/test_sgd.py

+
+        for loss in ["log", "modified_huber"]:
+            clf = SGDClassifier(loss=loss)
+            assert_true(hasattr(clf, "predict_proba"))


Use bare assert here too.

lesteve · 2018-04-16T06:13:36Z

sklearn/linear_model/tests/test_sgd.py

+        # is accessible for refrencing before fitting
+        # the SGD classifier
+        clf = SGDClassifier()
+        assert not(hasattr(clf, "predict_proba"))


Maybe check this for all the losses that do not support predict_proba, you can use SGD.loss_functions to get all the possible losses I think.

Slight improvement would be to also check the error message:

with pytest.raises(AttributeError, 'probability estimates are not available for loss={!r}'.format(loss)

Remove test for NotFittedError

lesteve · 2018-04-16T21:31:32Z

I pushed some minor tweaks, I think this can be merged when CIs are green.

jnothman · 2018-04-17T01:26:51Z

Perhaps this needs a changelog entry.
Please add an entry to the change log at doc/whats_new/v0.20.rst under API changes. Like the other entries there, please reference this pull request with :issue: and credit yourself (and other contributors if applicable) with :user:

lesteve · 2018-04-17T05:26:23Z

Perhaps this needs a changelog entry.

I am a bit undecided but I would say that this is rather an obscure problem (trying to only access predict_proba rather than calling it before fit) and I feel this does not really deserve a log entry. I am going to merge this one.

Fix Issue 10938

6c22645

- Remove not fitted check from predict_proba method of SGDClassifier - Check only while calling predic_proba

jnothman reviewed Apr 13, 2018

View reviewed changes

Add test case for issue scikit-learn#10938

d326c69

-Test if the the predict_proba and predict_log_proba methods can be accessed before fitting -Test if the not fitted check is performed before calling the proba methods

jnothman approved these changes Apr 15, 2018

View reviewed changes

aniruddhadave added 2 commits April 15, 2018 14:14

Issue scikit-learn#10938: Code Cleanup

c2dcd0d

-Remove extra line -Add space after comma -Use assert instead of asser_false

Remove Trailing whitespaces

a3f514d

lesteve reviewed Apr 16, 2018

View reviewed changes

aniruddhadave and others added 2 commits April 16, 2018 22:04

Add test to check error message

b2be9db

Remove test for NotFittedError

Tweaks

3b86a08

lesteve merged commit 5d9a686 into scikit-learn:master Apr 17, 2018

TomDLT mentioned this pull request Oct 1, 2018

[MRG+1] Fix SGDClassifier never has the attribute "predict_proba" (even with log or modified_huber loss) #12222

Merged

Uh oh!

Conversation

aniruddhadave commented Apr 12, 2018

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

aniruddhadave commented Apr 13, 2018

Uh oh!

lesteve commented Apr 13, 2018

Uh oh!

lesteve commented Apr 13, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aniruddhadave commented Apr 13, 2018

Uh oh!

lesteve commented Apr 14, 2018

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

jnothman Apr 15, 2018

Choose a reason for hiding this comment

Uh oh!

jnothman Apr 15, 2018

Choose a reason for hiding this comment

Uh oh!

lesteve left a comment

Choose a reason for hiding this comment

Uh oh!

lesteve Apr 16, 2018

Choose a reason for hiding this comment

Uh oh!

lesteve Apr 16, 2018

Choose a reason for hiding this comment

Uh oh!

lesteve Apr 16, 2018

Choose a reason for hiding this comment

Uh oh!

lesteve commented Apr 16, 2018

Uh oh!

jnothman commented Apr 17, 2018

Uh oh!

lesteve commented Apr 17, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lesteve commented Apr 13, 2018 •

edited

Loading