[MRG] Add calibration loss metric in classification by aishgrt1 · Pull Request #12479 · scikit-learn/scikit-learn

aishgrt1 · 2018-10-29T02:46:27Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Added calibration loss metric for classification

Any other comments?

eamanu · 2018-10-29T13:29:36Z

sklearn/metrics/classification.py

+     y_prob : array, shape (n_samples,)
+        Probabilities of the positive class.
+     bin_size : int
+        Size of the bin (samples) analysed in one iteration


in my opinion is better say: Size of the bin (samples) analysed on each iteration

eamanu · 2018-10-29T13:35:38Z

sklearn/metrics/classification.py

+    """
+    pos_loss = 0.0
+    neg_loss = 0.0
+    for bin_start in range(0, len(y_true) - bin_size + 1):


why don't use just range(len(y_true) - bin_size +1)

eamanu · 2018-10-29T13:37:04Z

sklearn/metrics/classification.py

+    pos_loss /= (len(y_true) - bin_size + 1)
+    neg_loss /= (len(y_true) - bin_size + 1)
+    loss = (0.5) * (pos_loss + neg_loss)
+


IMO it would better save the len(y_true) in a variable to don't call len() some times

eamanu · 2018-10-29T13:40:16Z

sklearn/metrics/classification.py

+                            - actual_per_pos_class).sum()
+        pos_loss += bin_error_pos
+        actual_per_neg_class = (bin_size - y_true[bin_start:bin_end]
+                                .sum()) / bin_size


If bin_size is negative this will raise a exception. Maybe have to add a control before.

I am getting this error:
AssertionError: Docstring Error: sklearn.metrics.classification.calibration_loss arg mismatch:

eamanu · 2018-10-30T12:05:26Z

sklearn/metrics/classification.py

+        Probabilities of the positive class.
+     bin_size : int
+        Size of the bin (samples) analysed in each iteration
+     Returns


Try to the let a newline below returns and try to --- match the length of Return

Suggested change

Returns

Returns

-------

eamanu · 2018-10-30T12:06:08Z

sklearn/metrics/classification.py

+    -------
+    score : float
+        Calibration loss
+     Examples


The same

Suggested change

Examples

Examples

--------

eamanu · 2018-10-30T13:16:52Z

sklearn/metrics/classification.py

+    ----------
+    y_true : array, shape (n_samples,)
+        True targets.
+     y_prob : array, shape (n_samples,)


Suggested change

y_prob : array, shape (n_samples,)

y_prob : array, shape (n_samples,)

eamanu · 2018-10-30T13:17:04Z

sklearn/metrics/classification.py

+        True targets.
+     y_prob : array, shape (n_samples,)
+        Probabilities of the positive class.
+     bin_size : int


Suggested change

bin_size : int

bin_size : int

eamanu · 2018-10-30T13:18:02Z

If you apply my comments this solve the Travis error

aishgrt1 · 2018-10-30T16:46:50Z

@eamanu Done!!

eamanu

LGTM. Please, edit the PR to add the [MRG] tag.

aishgrt1 · 2018-10-30T18:21:03Z

I need to add my name to the contributors list right?

jnothman · 2018-10-31T11:08:29Z

I need to add my name to the contributors list right?

The changelog entry is sufficient, but it should be in v0.21.rst, not v0.20.rst

jnothman

I'm not doing a full review yet. This needs to be in doc/modules/{model_evaluation,classes}.rst

amueller · 2019-08-20T16:43:57Z

this needs to add
https://www.math.ucdavis.edu/~saito/data/roc/ferri-class-perf-metrics.pdf
and maybe references therein.
Maybe we should add that one to the metric docs just somewhere in general?

amueller · 2019-12-11T18:37:28Z

There's an interesting discussion of debiasing the calibration error in https://arxiv.org/pdf/1909.10155.pdf

That's a current NeurIPS paper but the method they are discussing is actually already established, so it might be a good candidate. cc @thomasjpfan who has shown interest.

ogrisel · 2020-11-09T08:27:04Z

I just realized that #11096 implements the same thing and additional variations and has a more complete documentation.

Sorry @aishgrt1 for our failure to review your work properly on time. Thanks again for your contribution.

Add calibration loss metric in classification

5f0b732

aishgrt1 mentioned this pull request Oct 29, 2018

calibration_loss calculator added #10971

Closed

aishgrt1 added 4 commits October 28, 2018 22:56

Update classification.py

c7aadc3

Update classification.py

3e67217

Update test_classification.py

1b60cb9

Update classification.py

0def738

eamanu reviewed Oct 29, 2018

View reviewed changes

eamanu suggested changes Oct 29, 2018

View reviewed changes

aishgrt1 added 3 commits October 29, 2018 12:54

Update classification.py

38262c1

Update classification.py

f6f5400

Update classification.py

9b56a13

eamanu reviewed Oct 30, 2018

View reviewed changes

aishgrt1 added 4 commits October 30, 2018 10:36

Update classification.py

9a913bc

Update __init__.py

d4ee194

Update test_classification.py

024d8d4

Merge branch 'master' into calibration-loss

7ef6aa0

eamanu approved these changes Oct 30, 2018

View reviewed changes

aishgrt1 changed the title ~~Add calibration loss metric in classification~~ [MRG] Add calibration loss metric in classification Oct 30, 2018

aishgrt1 added 2 commits October 30, 2018 14:40

Update v0.20.rst

1a88d5a

Update v0.20.rst

aaa0442

jnothman reviewed Oct 31, 2018

View reviewed changes

amueller added the Stalled label Aug 5, 2019

This was referenced Sep 4, 2019

[MRG] Add decision threshold calibration wrapper #10117

Closed

[MRG] Implement calibration loss metrics #11096

Open

github-actions bot added the module:metrics label Mar 2, 2020

ogrisel closed this Nov 9, 2020

ogrisel added Superseded PR has been replace by a newer PR and removed Stalled labels Nov 9, 2020

-     Returns
+     Returns
+     -------

	y_prob : array, shape (n_samples,)
	y_prob : array, shape (n_samples,)

Uh oh!

Conversation

aishgrt1 commented Oct 29, 2018

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eamanu Oct 30, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eamanu Oct 30, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eamanu commented Oct 30, 2018

Uh oh!

aishgrt1 commented Oct 30, 2018

Uh oh!

eamanu left a comment

Choose a reason for hiding this comment

Uh oh!

aishgrt1 commented Oct 30, 2018

Uh oh!

jnothman commented Oct 31, 2018

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

amueller commented Aug 20, 2019

Uh oh!

amueller commented Dec 11, 2019

Uh oh!

ogrisel commented Nov 9, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

eamanu Oct 30, 2018 •

edited

Loading

eamanu Oct 30, 2018 •

edited

Loading