[WIP] FIX ndcg to work for arbitrarily many samples by jnothman · Pull Request #9928 · scikit-learn/scikit-learn

jnothman · 2017-10-16T03:19:21Z

TODO: add test for binary case

jnothman · 2017-10-16T03:20:00Z

This is intended to be a quick fix for 0.19.1. I am creating other issues to address shortfalls in ndcg API and testing.

qinhanmin2014 · 2017-10-16T03:54:14Z

Sorry to disturb if I'm not qualified to post my opinion here.
I don't think the PR fixes #9921, because seems that we still can't run the following script posted at the beginning of the issue:

y_true = [0, 1, 0, 1]
y_score = [[0.15, 0.85], [0.7, 0.3], [0.06, 0.94], [0.7, 0.3]]
metrics.ndcg_score(y_true, y_score)

There seems two problems and the second is not solved here:
(1)The shape of binarized_y_true is not correct. It seems that this can be solved by using y_score.shape[1]
(2)For LabeBinarizer, binary targets is transformed to a column vector, which is not we want. An alternative solution might be using OneHotEncoder.

Also, if current implementation is right, could we provide a reference for users (and for me :) )? I can't find any reference which is consistent with current implementation and the reference in the doc is a dead link. Personally, I might still don't think current implementation is right. It's a simple copy-paste from kaggle and I think ogrisel's implementation here is at least what I would use.

jnothman · 2017-10-16T04:33:41Z

Yes, of course I forgot about the binary case quirk. Thanks. I'm not certain about what definitions are standard. I don't really think we should be implementing true learning to rank metrics in scikit-learn. It is not a task our estimators solve. But we can use multilabel and multiclass evaluations based on ndcg. The current implementation handles the multiclass case. It should be easy to extend to the multilabel case. But atm I'm just trying to put out fires. The alternative way to do so is to retract the implementation: after all, clearly no one has used it.

FIX ndcg to work for arbitrarily many samples

b331527

Fixes scikit-learn#9921

jnothman added this to the 0.19.1 milestone Oct 16, 2017

This was referenced Oct 16, 2017

ndcg_score and dcg_score are not present in common tests #9929

Closed

metrics.ndcg_score is busted #9921

Closed

ndcg_score should have a lablels parameter or similar to specify class-column mapping #9930

Closed

jnothman changed the title ~~[MRG] FIX ndcg to work for arbitrarily many samples~~ [WIP] FIX ndcg to work for arbitrarily many samples Oct 16, 2017

Handle binary case

6de5553

jnothman mentioned this pull request Oct 16, 2017

[MRG] FIX Revert the addition of ndcg_score and dcg_score #9932

Merged

jnothman closed this in #9932 Oct 16, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] FIX ndcg to work for arbitrarily many samples#9928

[WIP] FIX ndcg to work for arbitrarily many samples#9928
jnothman wants to merge 2 commits intoscikit-learn:masterfrom
jnothman:ndcg-samples

jnothman commented Oct 16, 2017 •

edited

Loading

Uh oh!

jnothman commented Oct 16, 2017

Uh oh!

qinhanmin2014 commented Oct 16, 2017

Uh oh!

jnothman commented Oct 16, 2017 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

jnothman commented Oct 16, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jnothman commented Oct 16, 2017

Uh oh!

qinhanmin2014 commented Oct 16, 2017

Uh oh!

jnothman commented Oct 16, 2017 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jnothman commented Oct 16, 2017 •

edited

Loading