[MRG] Raise descriptive ValueError if number of samples equals number of classes in Linear Discriminant Analysis by eddbrown · Pull Request #12391 · scikit-learn/scikit-learn

eddbrown · 2018-10-15T23:00:17Z

Fixes #12374

Before: Failed with division by zero error if no_classes == no_sample in Linear Discriminant Analysis.

After: This PR raises a more descriptive ValueError, explaining the problem.

jnothman · 2018-10-16T03:10:37Z

Please give your PR a more descriptive title

jnothman · 2018-10-16T03:11:06Z

The PR *description* should contain "Fixes #xxxx"

adrinjalali · 2018-10-16T09:10:28Z

Could you please also add a test? You also have a PEP8 issue, hence travis failing. flake8 /path/to/file.py would tell you where to fix as well.

adrinjalali · 2018-10-16T20:38:38Z

sklearn/tests/test_discriminant_analysis.py

+    X = np.zeros((no_classes, no_classes))
+    y = np.zeros((no_classes, 1))
+    clf = LinearDiscriminantAnalysis(solver="svd")
+    assert_raises(ValueError, clf.fit, X, y)


Suggested change

assert_raises(ValueError, clf.fit, X, y)

with pytest.raises(ValueError, match="The number of samples must be more"):

clf.fit(X, y)

ah nice, tests the message too to make sure that its actually THAT ValueError being raised

yes, and since the adoption of pytest, pytest.... are generally preferred to custom assert_... ones.

thanks for your advice!

jnothman · 2018-10-17T07:46:42Z

Tests are failing

…of samples

jnothman · 2018-10-17T09:47:53Z

Please append commits rather than amending and force-pushing. It makes it hard for us to review when we aren't notified that a commit has been added and we can't see the diff

jnothman

Since classes_ is set as unique values in y, it seems a little strange to have the message state the number of samples and classes when they must be equal

adrinjalali · 2018-10-17T09:54:59Z

sklearn/discriminant_analysis.py

+                             "Currently, you have {} samples and {} "
+                             "classes.".format(n_samples, n_classes))
+
+


please have only one empty new line. Why wouldn't flake8 not fail on this on travis?

jnothman · 2018-10-17T09:59:57Z

This case should also be handled for the other solvers. Currently eigen raises an error, while lsqr gives a warning.

The lsqr and eigen solvers also will fail if the number of samples equals the number of classes, so this commit moves the error raise to cover all cases.

eddbrown · 2018-10-17T22:27:56Z

@jnothman @adrinjalali thanks for your patience. How is it looking?

jnothman

Otherwise LGTM. I don't know if it's helpful to users to put this in our change log...?

jnothman · 2018-10-18T06:10:39Z

sklearn/tests/test_discriminant_analysis.py

    assert_almost_equal(c_s, c_s.T)
+
+
+def test_raises_value_error_on_same_number_of_classes_and_samples():


Use pytest.mark.parameterize to test this for each solver I suppose

adrinjalali

Thanks @eddbrown , LGTM.

eddbrown · 2018-10-21T16:13:22Z

@adrinjalali what is the next step with getting this merged then? I think I've misunderstood the process a bit...

adrinjalali · 2018-10-21T16:37:26Z

@eddbrown an unwritten part of the process is the casual (read usual, normal) patience required for a core developer to do the next action ;) let it be a review, response to a question, merge, etc.

Your PR now is in a state to be merged, and I guess it will be soon (I'm not somebody with permissions to do a merge). Now you can just sit back and relax until it happens, or go on and find another issue to investigate ;)

jnothman · 2018-10-23T07:05:11Z

an unwritten part of the process is the casual (read usual, normal) patience

I'm pretty sure I mentioned patience somewhere in the contributor docs in the last couple of years... ?

Thanks @eddbrown

…f classes in Linear Discriminant Analysis (scikit-learn#12391)

…number of classes in Linear Discriminant Analysis (scikit-learn#12391)" This reverts commit 6cae448.

…f classes in Linear Discriminant Analysis (scikit-learn#12391)

eddbrown changed the title ~~Fixes #12374~~ [MRG] Fixes #12374 Oct 15, 2018

eddbrown changed the title ~~[MRG] Fixes #12374~~ [MRG] Raise descriptive ValueError if number of sample equals number of classes in Linear Discriminant Analysis Oct 16, 2018

eddbrown changed the title ~~[MRG] Raise descriptive ValueError if number of sample equals number of classes in Linear Discriminant Analysis~~ [MRG] Raise descriptive ValueError if number of samples equals number of classes in Linear Discriminant Analysis Oct 16, 2018

eddbrown force-pushed the master branch 2 times, most recently from 6a09189 to 3c1f7a1 Compare October 16, 2018 20:32

adrinjalali reviewed Oct 16, 2018

View reviewed changes

eddbrown force-pushed the master branch 3 times, most recently from 706a3df to d0f62c8 Compare October 16, 2018 22:09

Raise descriptive value error if number of classes equals the number …

28cb37f

…of samples

eddbrown force-pushed the master branch from d0f62c8 to 28cb37f Compare October 17, 2018 08:48

jnothman reviewed Oct 17, 2018

View reviewed changes

adrinjalali reviewed Oct 17, 2018

View reviewed changes

Edward Brown added 2 commits October 17, 2018 23:23

Cover all linear discriminant analysis methods

22290b8

The lsqr and eigen solvers also will fail if the number of samples equals the number of classes, so this commit moves the error raise to cover all cases.

Add back missing variables

d89af6b

jnothman reviewed Oct 18, 2018

View reviewed changes

Edward Brown added 2 commits October 18, 2018 20:21

Iterate through solvers with pytest parameterise

0113876

Fix test

0de3220

jnothman approved these changes Oct 21, 2018

View reviewed changes

adrinjalali approved these changes Oct 21, 2018

View reviewed changes

agramfort approved these changes Oct 21, 2018

View reviewed changes

jnothman merged commit b498ac7 into scikit-learn:master Oct 23, 2018

thoo pushed a commit to thoo/scikit-learn that referenced this pull request Nov 14, 2018

ENH Raise descriptive ValueError if number of samples equals number o…

24274af

…f classes in Linear Discriminant Analysis (scikit-learn#12391)

amueller mentioned this pull request Nov 20, 2018

[MRG] Release 0.20.1 #12383

Merged

jnothman pushed a commit to jnothman/scikit-learn that referenced this pull request Nov 20, 2018

ENH Raise descriptive ValueError if number of samples equals number o…

0ec901f

…f classes in Linear Discriminant Analysis (scikit-learn#12391)

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

ENH Raise descriptive ValueError if number of samples equals number o…

6cae448

…f classes in Linear Discriminant Analysis (scikit-learn#12391)

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

Revert "ENH Raise descriptive ValueError if number of samples equals …

d08d3a6

…number of classes in Linear Discriminant Analysis (scikit-learn#12391)" This reverts commit 6cae448.

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

Revert "ENH Raise descriptive ValueError if number of samples equals …

a5a31f8

…number of classes in Linear Discriminant Analysis (scikit-learn#12391)" This reverts commit 6cae448.

koenvandevelde pushed a commit to koenvandevelde/scikit-learn that referenced this pull request Jul 12, 2019

ENH Raise descriptive ValueError if number of samples equals number o…

96d1e29

…f classes in Linear Discriminant Analysis (scikit-learn#12391)

	assert_raises(ValueError, clf.fit, X, y)
	with pytest.raises(ValueError, match="The number of samples must be more"):
	clf.fit(X, y)

		"Currently, you have {} samples and {} "
		"classes.".format(n_samples, n_classes))

		assert_almost_equal(c_s, c_s.T)


		def test_raises_value_error_on_same_number_of_classes_and_samples():

Uh oh!

Conversation

eddbrown commented Oct 15, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jnothman commented Oct 16, 2018 via email

Uh oh!

jnothman commented Oct 16, 2018 via email

Uh oh!

adrinjalali commented Oct 16, 2018

Uh oh!

adrinjalali Oct 16, 2018

Choose a reason for hiding this comment

Uh oh!

eddbrown Oct 16, 2018

Choose a reason for hiding this comment

Uh oh!

adrinjalali Oct 16, 2018

Choose a reason for hiding this comment

Uh oh!

eddbrown Oct 16, 2018

Choose a reason for hiding this comment

Uh oh!

jnothman commented Oct 17, 2018

Uh oh!

jnothman commented Oct 17, 2018

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

adrinjalali Oct 17, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman commented Oct 17, 2018

Uh oh!

eddbrown commented Oct 17, 2018

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

jnothman Oct 18, 2018

Choose a reason for hiding this comment

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

eddbrown commented Oct 21, 2018

Uh oh!

adrinjalali commented Oct 21, 2018

Uh oh!

jnothman commented Oct 23, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

eddbrown commented Oct 15, 2018 •

edited

Loading

adrinjalali Oct 17, 2018 •

edited

Loading