[MRG+1] Adding a fit_predict method for the GMM by clorenz7 · Pull Request #4593 · scikit-learn/scikit-learn

clorenz7 · 2015-04-14T20:26:27Z

With low iterations, the prediction might not be 100% accurate due to
the final maximization step in the EM algorithm.

See issue:
#4579

ogrisel · 2015-04-14T20:36:35Z

sklearn/mixture/gmm.py

It's better to use the _check_fitted_model: do a git grep _check_fitted_model to see examples in the code base.

actually this comment is no longer relevant in light of the other comments.

ogrisel · 2015-04-14T22:52:09Z

sklearn/mixture/gmm.py

How could that ever happen?

I would return directly:

return self._fit(X, y).argmax(axis=1)

and make sure that _fit can never return None (make it raise a ValueError or similar with a meaningful error message otherwise).

It could happen in the case when n_iter == 0. You're right that never returning None is better, so I added a check to run score_samples to get the correct output value in that case (that apparently happens when running an HMM). My current idea is just to output zeros because the idea of n_iter=0 seems to be to quickly initialize a model.

ogrisel · 2015-04-14T23:15:06Z

I am not sure I understand the travis failures, it probably requires to launch a debugger.

clorenz7 · 2015-04-15T17:59:38Z

@ogrisel Besides addressing your comments, I changed the GMM subclass fit method to _fit, and added some additional test cases.

ogrisel · 2015-04-15T18:14:11Z

For the travis failure, a solution would be to not implement fit_predict for DPGMM and VBGMM, it is possible to introduce a new _BaseGMM abstract base class with most of the current methods of GMM in it and then make GMM, ``DPGMMandVBGMM`. Finally only implement `fit_predict` in the `GMM` class.

git grep ABCMeta to see how we create abstract base classes in sklearn that support both Python 2 and Python 3 in the same code base.

ogrisel · 2015-04-15T18:16:09Z

Ah alright, ignore my last comment, I had an internet connection pbm and could not post it when I first wrote it. Now I see that fixed the problems with the subclasses.

ogrisel · 2015-04-15T18:17:29Z

sklearn/mixture/tests/test_gmm.py

Great! Thanks for having updated that test.

ogrisel · 2015-04-15T18:22:44Z

@eyaler does that PR meet your requirements from #4579?

LGTM, +1 for merge on my side.

@clorenz7 could please just add a new entry in the section on the new features for 0.17.dev0 in the doc/whats_new.rst file?

clorenz7 · 2015-04-15T18:37:17Z

@ogrisel Added what's new. Thanks for all your help!

amueller · 2015-04-16T15:31:52Z

sklearn/mixture/dpgmm.py

It needs to document its return value.

Done, thanks.

clorenz7 mentioned this pull request Apr 14, 2015

add fit_predict to mixture.GMM #4579

Closed

ogrisel reviewed Apr 14, 2015
View reviewed changes

clorenz7 force-pushed the gmm_fit_predict branch from 9349c48 to 18d4df4 Compare April 14, 2015 21:14

ogrisel reviewed Apr 14, 2015
View reviewed changes

clorenz7 force-pushed the gmm_fit_predict branch from 18d4df4 to 986defe Compare April 15, 2015 17:27

ogrisel reviewed Apr 15, 2015
View reviewed changes

sklearn/mixture/tests/test_gmm.py

Copy link
Copy Markdown

Member

ogrisel Apr 15, 2015

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great! Thanks for having updated that test.

ogrisel changed the title ~~Adding a fit_predict method for the GMM~~ [MRG+1] Adding a fit_predict method for the GMM Apr 15, 2015

clorenz7 force-pushed the gmm_fit_predict branch from 986defe to 234bd07 Compare April 15, 2015 18:36

clorenz7 force-pushed the gmm_fit_predict branch from 234bd07 to ff4d8d2 Compare April 15, 2015 18:44

amueller reviewed Apr 16, 2015
View reviewed changes

Uh oh!

Conversation

clorenz7 commented Apr 14, 2015

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ogrisel commented Apr 14, 2015

Uh oh!

clorenz7 commented Apr 15, 2015

Uh oh!

ogrisel commented Apr 15, 2015

Uh oh!

ogrisel commented Apr 15, 2015

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ogrisel commented Apr 15, 2015

Uh oh!

clorenz7 commented Apr 15, 2015

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants