FEA add `confusion_matrix_at_thresholds` by SuccessMoses · Pull Request #30134 · scikit-learn/scikit-learn

SuccessMoses · 2024-10-22T21:13:55Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Fixes #16470

Any other comments?

In sklearn/metrics/_ranking.py, changed the name of the function _binary_clf_curve to binary_classifcation_curve without changing the body. I also changed test functions like test_binary_clf_curve_multiclass_error without changing the body
det_curve, roc_curve and precision_recall_curve call this function, so I updated the name of the function in the body
I added examples in the docstring of the function

github-actions · 2024-10-22T21:15:41Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: d214d32. Link to the linter CI: here}

…into feature UPDATE.

adrinjalali

You'd also need to add this in api_reference.py under the right section to have it rendered in the docs properly.

@glemaitre you happy with the name?

sklearn/metrics/_ranking.py

glemaitre · 2024-11-05T19:09:18Z

I think I'm fine with the name. I was looking if can have the word counts in the name of the function but it starts to be really long. So I would be OK with the proposed name.

Co-authored-by: Guillaume Lemaitre <guillaume@probabl.ai>

adrinjalali

Otherwise LGTM.

doc/whats_new/upcoming_changes/sklearn.metrics/30134.api.rst

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

SuccessMoses · 2024-11-07T09:29:08Z

@adrinjalali There is an issue with numpydoc validation in the binary_classification_curve function, the error is RT03: Return value has no description. Do you know why this is? I already documented the return value of the function.

adrinjalali · 2024-11-07T11:19:56Z

When you look at the CI log, this is the error, not the return:

[gw0] linux -- Python 3.12.7 /usr/share/miniconda/envs/testvenv/bin/python
810         Decreasing score values.
811 
812     Examples
813     -------
814     >>> import numpy as np
815     >>> from sklearn.metrics import binary_classification_curve
816     >>> y_true = np.array([0, 0, 1, 1])
817     >>> y_scores = np.array([0.1, 0.4, 0.35, 0.8])
818     >>> fps, tps, thresholds = binary_classification_curve(y_true, y_scores)
819     >>> fps
Expected:
    array([0, 1, 1, 2])
Got:
    array([0., 1., 1., 2.])

You just need to fix the output to floats

jeremiedbb

Thanks for the PR @SuccessMoses. I directly pushed the requested change to return negatives as well. I also added a small smoke test, we don't need more since it's heavily tested through the other curve functions.

LGTM.

jeremiedbb · 2025-10-08T14:46:08Z

I wonder if binary_classification_curve is the best name though. Maybe confusion_matrix_curve or binary_confusion_matrix_curve would be more explicit. What do you think @glemaitre @adrinjalali ?

jeremiedbb · 2025-10-08T16:04:16Z

After some discussion with @glemaitre and @ogrisel, we converged toward confusion_matrix_at_thresholds (with or without final "s") or binary_confusion_matrix_at_threshold.

I tested with both and I find the second option a bit too long since it makes all calls to this function being multi-line, hence hurting the readability a bit. Since a confusion matrix per threshold doesn't make sense for multiclass (with a single threshold as we use to do in sklearn) it didn't feel that necessary.

So in the end I went for confusion_matrix_at_thresholds.

adrinjalali

LGTM.

adrinjalali · 2025-10-13T11:13:46Z

examples/model_selection/plot_confusion_matrix.py

+# For binary problems, :func:`sklearn.metrics.confusion_matrix` has the ``ravel`` method
+# we can use get counts of true negatives, false positives, false negatives and
+# true positives.


should we remove this paragraph? It's not clear to me here.

I've added a line, just to link the two paragraphs, but not sure if it is ideal either.

adrinjalali · 2025-10-13T11:18:38Z

Doc's failing @jeremiedbb

lucyleeow · 2025-10-28T04:38:10Z

Doc's failing

Just need to update the import in plot_confusion_matrix.py

lucyleeow

Do we want to amend the tests names (e.g., test_binary_clf_curve_multiclass_error) to use the new name?

lucyleeow · 2025-10-28T04:58:17Z

examples/model_selection/plot_confusion_matrix.py

+# we can use get counts of true negatives, false positives, false negatives and
+# true positives.
+#
+# :func:`sklearn.metrics.binary_classification_curve`


I think this needs updating

examples/model_selection/plot_confusion_matrix.py

lucyleeow · 2025-10-28T05:00:36Z

doc/modules/model_evaluation.rst

  (2, 1, 2, 3)

+With :func:`confusion_matrix_at_thresholds` we can get true negatives, false positives,
+false negatives and true positives for different thresholds::


For the new user, should we expand on how we determine the thresholds used?

lucyleeow

I've directly pushed my suggested changes, to help move this along, and fixed the doc failure.

Also checked the docs render okay.

lucyleeow · 2025-11-03T07:39:36Z

Actually, still one last comment we may want to address here:

Do we want to amend the tests names (e.g., test_binary_clf_curve_multiclass_error) to use the new name?

lucyleeow · 2025-11-03T09:12:43Z

@adrinjalali or @jeremiedbb feel free to merge if you're happy!

SuccessMoses added 3 commits October 22, 2024 09:37

Changed _binary_clf_curve to binary_clf_curve

75a8512

Changed binary_clf_curve to binary_classification_curve

8b26c82

DOC Added examples for binary_classification_curve

4e2b276

github-actions bot added the module:metrics label Oct 22, 2024

Merge branch 'main' into feature

ad7ff13

SuccessMoses added 3 commits October 22, 2024 14:58

Reformatted with black

97d3a92

Merge branch 'scikit-learn:main' into feature

8f8c41c

Merge branch 'feature' of https://github.com/SuccessMoses/scikit-learn …

48c80cc

…into feature UPDATE.

adrinjalali reviewed Nov 5, 2024

View reviewed changes

glemaitre reviewed Nov 5, 2024

View reviewed changes

sklearn/metrics/_ranking.py Outdated Show resolved Hide resolved

glemaitre reviewed Nov 5, 2024

View reviewed changes

sklearn/metrics/_ranking.py Outdated Show resolved Hide resolved

glemaitre reviewed Nov 5, 2024

View reviewed changes

sklearn/metrics/_ranking.py Show resolved Hide resolved

glemaitre changed the title ~~Added binary_classification_curve from _binary_clf_curve~~ FEA Added binary_classification_curve from _binary_clf_curve Nov 5, 2024

glemaitre changed the title ~~FEA Added binary_classification_curve from _binary_clf_curve~~ FEA add binary_classification_curve Nov 5, 2024

SuccessMoses and others added 8 commits November 6, 2024 09:45

update documentation

c6079b7

Co-authored-by: Guillaume Lemaitre <guillaume@probabl.ai>

update documentation

c37f479

Co-authored-by: Guillaume Lemaitre <guillaume@probabl.ai>

update documentation

bba7958

Co-authored-by: Guillaume Lemaitre <guillaume@probabl.ai>

Merge branch 'main' into feature

761221f

add new api to api_reference

8c89cbe

add new api to __init__.py

f4be0b0

add validate_parameters

50f1a01

add changelog

fbf0172

adrinjalali approved these changes Nov 7, 2024

View reviewed changes

doc/whats_new/upcoming_changes/sklearn.metrics/30134.api.rst Outdated Show resolved Hide resolved

update changelog

4477d6d

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

adrinjalali requested a review from glemaitre November 7, 2024 09:06

fix doctest error

0d7ff48

jeremiedbb modified the milestones: 1.7, 1.8 Jul 1, 2025

StefanieSenger added help wanted and removed help wanted labels Jul 30, 2025

jeremiedbb mentioned this pull request Oct 8, 2025

FEA Adds decision_threshold_curve function #31338

Closed

3 tasks

jeremiedbb added 4 commits October 8, 2025 15:39

iter

4beccf5

Merge remote-tracking branch 'upstream/main' into pr/SuccessMoses/30134

4cd30ea

iter

158e0c3

oops

d245b31

jeremiedbb approved these changes Oct 8, 2025

View reviewed changes

rename

af3197c

adrinjalali approved these changes Oct 13, 2025

View reviewed changes

lucyleeow changed the title ~~FEA add binary_classification_curve~~ FEA add confusion_matrix_at_thresholds Oct 28, 2025

lucyleeow reviewed Oct 28, 2025

View reviewed changes

lucyleeow added 3 commits November 3, 2025 14:40

use new name in plot confusion matrix

cd7b9ad

add note on thresholds and link paragraphs in plot confusion matrix

e1f4d64

merge main

ef32557

lucyleeow approved these changes Nov 3, 2025

View reviewed changes

doc fixes

d214d32

adrinjalali merged commit 04972ec into scikit-learn:main Nov 5, 2025
38 checks passed

github-project-automation bot moved this to Done in Community feature request Nov 5, 2025

lucyleeow mentioned this pull request Nov 6, 2025

MNT: Update test names to use confusion_matrix_at_thresholds #32657

Merged

glemaitre mentioned this pull request Nov 7, 2025

enh(skore): Add a decision_threshold parameter to the confusion_matrix display probabl-ai/skore#2112

Closed

lucyleeow mentioned this pull request Dec 11, 2025

TST Add confusion_matrix_at_thresholds to common tests #32883

Merged

4 tasks

Uh oh!

Conversation

SuccessMoses commented Oct 22, 2024

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Oct 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glemaitre commented Nov 5, 2024

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

SuccessMoses commented Nov 7, 2024

Uh oh!

adrinjalali commented Nov 7, 2024

Uh oh!

jeremiedbb left a comment

Choose a reason for hiding this comment

Uh oh!

jeremiedbb commented Oct 8, 2025

Uh oh!

jeremiedbb commented Oct 8, 2025

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

adrinjalali Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

lucyleeow Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

adrinjalali commented Oct 13, 2025

Uh oh!

lucyleeow commented Oct 28, 2025

Uh oh!

lucyleeow left a comment

Choose a reason for hiding this comment

Uh oh!

lucyleeow Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lucyleeow Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

lucyleeow left a comment

Choose a reason for hiding this comment

Uh oh!

lucyleeow commented Nov 3, 2025

Uh oh!

lucyleeow commented Nov 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

github-actions bot commented Oct 22, 2024 •

edited

Loading