[MRG+2] Modification of GaussianMixture class. by tguillemot · Pull Request #7123 · scikit-learn/scikit-learn

tguillemot · 2016-08-01T11:33:43Z

The PR is to simplify the integration of the BayesianGaussianMixture class #6651.
I've simplify the GaussianMixture class by integrating a function which computes the determinant of the cholesky decomposition of the precision matrix (which will be usefull for BayesianGaussianMixture).

I've also corrected a bug during the EM process: normally, the lower bound is computed after the M-step not after the E-step. It's not a problem for GMM but that creates some problem for BayesianGaussianMixture.

@agramfort @ogrisel @amueller Can you have a look ?

The purpose here is to prepare the integration of BayesianGaussianMixture.

agramfort · 2016-08-01T16:00:55Z

sklearn/mixture/base.py

            if do_init:
                self._initialize_parameters(X)
-            current_log_likelihood, resp = self._e_step(X)
+                self.lower_bound_ = np.infty


shouldn't you document the new attribute?

Indeed, sorry for that mistake.

I didn't know np.infty also returns np.inf... Interesting...

Shouldn't it be self.lower_bound_ = -np.infty ?

It has been merged but I re-ask the question: shouldn't it be -np.infty ?

ping @tguillemot

@ngoix Sorry I forgot that - indeed. I don't know why I've missed your comment.
Sorry for that. I've solved that on #7180

agramfort · 2016-08-01T16:09:14Z

that's it for me

tguillemot · 2016-08-02T16:37:08Z

@amueller @ogrisel @raghavrv If you have time to do another review :)

raghavrv · 2016-08-02T16:39:23Z

sklearn/mixture/base.py

        X : array-like, shape  (n_samples, n_features)
        """
-        n_samples = X.shape[0]
+        n_samples, _ = X.shape


Why this change?

Sorry if this was suggested before...

It just I prefer like this and it also check that the shape of X is a tuple.
Sorry for these useless little modifications.

it also checks implicitly that X.ndim == 2. It do the same in my code.

agramfort · 2016-08-03T10:53:02Z

sklearn/mixture/gaussian_mixture.py

    n_iter_ : int
        Number of step used by the best fit of EM to reach the convergence.
+
+    lower_bound_ : float


lower_bound_ -> best_log_likelihood_ ?

In fact for GMM this is the best log likelihood but not for VBGMM which is lower bound.
I've chosen lower_bound_ because it was the most understandable.

ok. Can you clarify this the docstring?

tguillemot · 2016-08-04T12:43:18Z

@amueller @ogrisel I really need this to be merged before you review #6651.

tguillemot · 2016-08-05T07:05:18Z

@ngoix This PR is related to Bayesian Gaussian Mixture, if you have some time to review it.
Thx in advance :)

agramfort · 2016-08-05T15:17:18Z

one more +1 and we're good here...

ngoix · 2016-08-07T21:52:34Z

sklearn/mixture/base.py


            for n_iter in range(self.max_iter):
-                prev_log_likelihood = current_log_likelihood
+                prev_log_likelihood = self.lower_bound_


prev_log_likelihood -> prev_lower_bound ?

ngoix · 2016-08-08T01:12:06Z

By testing the code, I realized that there is a problem which was here before this PR:
In gaussian_mixture._set_parameters(), self.covariances_ is not updated. This causes that with n_init > 1, the covariances outputed do not correspond to the means.

ngoix · 2016-08-08T01:12:18Z

sklearn/mixture/gaussian_mixture.py


+    def _compute_lower_bound(self, _, log_prob_norm):
+        return log_prob_norm
+


In _set_parameters below, compute self.covariances_ from self.precisions_.

ngoix · 2016-08-08T01:20:20Z

That's all from me!

tguillemot · 2016-08-08T07:30:32Z

@ngoix Indeed you're right. I've missed that. Thanks.
I've corrected that and added a test to check it.

tguillemot · 2016-08-08T14:51:42Z

@agramfort merge ?

Modification of GaussianMixture class.

cf97453

The purpose here is to prepare the integration of BayesianGaussianMixture.

agramfort reviewed Aug 1, 2016
View reviewed changes

tguillemot changed the title ~~Modification of GaussianMixture class.~~ [MRG] Modification of GaussianMixture class. Aug 1, 2016

tguillemot force-pushed the gmm-modif branch from 32aacb7 to 3893502 Compare August 2, 2016 13:07

Fix comments.

6eaf9c5

tguillemot force-pushed the gmm-modif branch from 3893502 to 6eaf9c5 Compare August 2, 2016 13:10

tguillemot changed the title ~~[MRG] Modification of GaussianMixture class.~~ [MRG+1] Modification of GaussianMixture class. Aug 2, 2016

raghavrv reviewed Aug 2, 2016
View reviewed changes

tguillemot mentioned this pull request Aug 3, 2016

[MRG+1] Bayesian Gaussian Mixture (Integration of GSoC2015 -- second step) #6651

Merged

8 tasks

agramfort reviewed Aug 3, 2016
View reviewed changes

tguillemot added 2 commits August 4, 2016 10:05

Modification of the Docstring.

94418ec

Add license and author.

7514cc4

ngoix reviewed Aug 7, 2016
View reviewed changes

ngoix reviewed Aug 8, 2016
View reviewed changes

Fix review and add tests for init.

c481d65


		def _compute_lower_bound(self, _, log_prob_norm):
		return log_prob_norm

Uh oh!

Conversation

tguillemot commented Aug 1, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

agramfort commented Aug 1, 2016

Uh oh!

tguillemot commented Aug 2, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tguillemot Aug 2, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

agramfort Aug 3, 2016 via email

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tguillemot commented Aug 4, 2016

Uh oh!

tguillemot commented Aug 5, 2016

Uh oh!

agramfort commented Aug 5, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ngoix commented Aug 8, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ngoix commented Aug 8, 2016

Uh oh!

tguillemot commented Aug 8, 2016

Uh oh!

tguillemot commented Aug 8, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tguillemot Aug 2, 2016 •

edited

Loading