[MRG+1] FIX consensus score on non-square similarity matrices by untom · Pull Request #3640 · scikit-learn/scikit-learn

untom · 2014-09-05T09:04:42Z

This PR fixes #2445: When similarity matrices are non-square, the bicluster.consensus_score gave wrong results.

coveralls · 2014-09-05T09:17:51Z

Coverage remained the same when pulling 308c7ef on untom:fix_consensus_score into 3c92686 on scikit-learn:master.

jnothman · 2014-09-06T10:37:46Z

fine, but needs a test

untom · 2014-09-07T10:14:35Z

Added a test that shows the issue (fails in the current master, runs through with this patch).

coveralls · 2014-09-07T10:17:16Z

Coverage increased (+0.0%) when pulling 171139f on untom:fix_consensus_score into 3c92686 on scikit-learn:master.

untom · 2014-11-12T12:40:01Z

pinging potential reviewers (@arjoly @jnothman ? ). I've been bitten by the bug #2445 again today, I'd like to make sure this patch doesn't fall through the cracks.

jnothman · 2014-11-12T12:49:32Z

This LGTM.

Aside: I am altogether not very happy with the dense data format that the bicluster module currently produces (see #2484). And do you find, @untom, that the consensus_score performs reasonably efficiently? In other work where I have used a similar metric (for coreference resolution which is a special kind of clustering), there is a lot of sparsity in the similarity matrix passed to Kuhn-Munkres (i.e. most pairs of clusters have no overlap in a standard system output), so one gets a substantial speedup by only performing that algorithm on strongly connected components of the similarity graph. Do you think a similar strategy would be beneficial here?

jnothman · 2014-11-12T12:49:47Z

And thanks for the ping!

untom · 2014-11-12T13:02:33Z

I absolutely agree with the Issue raised in #2484. I found the picked data format odd as well, back when I started working on PR #2476. As far as performance goes, I've never seen consensus_score come up as bottleneck in my code (I've never worked with large number of biclusters though, usually < 10-20).

jnothman · 2014-11-12T13:17:11Z

Ah of course. Thanks.

untom · 2015-01-11T23:56:17Z

pinging @arjoly : could this get a MRG+2 ?

untom · 2015-01-12T00:02:34Z

Or maybe @amueller is the right person to ping? (he added the issue to the 0.15.1 milestone)

amueller · 2015-01-13T23:44:35Z

The fix looks good. Thanks. Sorry the fix was lying around for so long.

[MRG+1] FIX consensus score on non-square similarity matrices

FIX consensus score on non-square similarity matrices

308c7ef

ENH add testcase for issue 2445

171139f

untom changed the title ~~FIX consensus score on non-square similarity matrices~~ [MRG] FIX consensus score on non-square similarity matrices Oct 2, 2014

MechCoder force-pushed the master branch from 6deaea0 to 3f49cee Compare November 3, 2014 12:36

jnothman changed the title ~~[MRG] FIX consensus score on non-square similarity matrices~~ [MRG+1] FIX consensus score on non-square similarity matrices Nov 12, 2014

amueller added a commit that referenced this pull request Jan 13, 2015

Merge pull request #3640 from untom/fix_consensus_score

c579244

[MRG+1] FIX consensus score on non-square similarity matrices

amueller merged commit c579244 into scikit-learn:master Jan 13, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MRG+1] FIX consensus score on non-square similarity matrices#3640

[MRG+1] FIX consensus score on non-square similarity matrices#3640
amueller merged 2 commits intoscikit-learn:masterfrom
untom:fix_consensus_score

untom commented Sep 5, 2014

Uh oh!

coveralls commented Sep 5, 2014

Uh oh!

jnothman commented Sep 6, 2014

Uh oh!

untom commented Sep 7, 2014

Uh oh!

coveralls commented Sep 7, 2014

Uh oh!

untom commented Nov 12, 2014

Uh oh!

jnothman commented Nov 12, 2014

Uh oh!

jnothman commented Nov 12, 2014

Uh oh!

untom commented Nov 12, 2014

Uh oh!

jnothman commented Nov 12, 2014

Uh oh!

untom commented Jan 11, 2015

Uh oh!

untom commented Jan 12, 2015

Uh oh!

amueller commented Jan 13, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

untom commented Sep 5, 2014

Uh oh!

coveralls commented Sep 5, 2014

Uh oh!

jnothman commented Sep 6, 2014

Uh oh!

untom commented Sep 7, 2014

Uh oh!

coveralls commented Sep 7, 2014

Uh oh!

untom commented Nov 12, 2014

Uh oh!

jnothman commented Nov 12, 2014

Uh oh!

jnothman commented Nov 12, 2014

Uh oh!

untom commented Nov 12, 2014

Uh oh!

jnothman commented Nov 12, 2014

Uh oh!

untom commented Jan 11, 2015

Uh oh!

untom commented Jan 12, 2015

Uh oh!

amueller commented Jan 13, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants