-
-
Notifications
You must be signed in to change notification settings - Fork 26.9k
Incorrect interpretation of Brier score loss in docstring #10883
Copy link
Copy link
Closed
Labels
Description
scikit-learn/sklearn/metrics/classification.py
Line 1852 in a24c8b4
| Therefore, the lower the Brier score is for a set of predictions, the |
Therefore, the lower the Brier score is for a set of predictions, the better the predictions are calibrated.
As far as I can tell, this claim is incorrect (also incorrect in the Wikipedia entry cited) and should be deleted or detailed (one component of the two-term decomposition of the Brier score does assess calibration).
Here's a counter-example on a dataset with 12 points and 2 estimators.
| model A | model B | y |
|---|---|---|
| 0.00 | 0.00 | 0 |
| 0.25 | 0.25 | 0 |
| 0.25 | 0.25 | 0 |
| 0.25 | 0.25 | 0 |
| 0.25 | 0.75 | 1 |
| 0.50 | 0.50 | 0 |
| 0.50 | 0.50 | 1 |
| 0.75 | 0.25 | 0 |
| 0.75 | 0.75 | 1 |
| 0.75 | 0.75 | 1 |
| 0.75 | 0.75 | 1 |
| 1.00 | 1.00 | 1 |
Brier scores:
| model A | model B |
|---|---|
| 0.166666666667 | 0.0833333333333 |
Calibration curves:
Reactions are currently unavailable
