TST Extend tests for `scipy.sparse.*array` in `sklearn/utils/tests/test_estimator_checks.py` by Tialo · Pull Request #27203 · scikit-learn/scikit-learn

Tialo · 2023-08-29T15:28:17Z

Reference Issues/PRs

Towards #27090.

What does this implement/fix? Explain your changes.

Any other comments?

Added __init__ to SparseTransformer so it can transform both into a matrix and into an array.

github-actions · 2023-08-29T15:30:29Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 6b5dfbf. Link to the linter CI: here}

Tialo · 2023-09-29T21:26:11Z

Also, I wonder why pytest can't be used here? @glemaitre

glemaitre · 2023-09-29T21:30:29Z

From the top of the head, it comes to not request pytest as a required dependency for third-party library that want to use the checks available in estimator_checks.py. Because of this, we want to run some test in test_estimator_checks.py that does not expect pytest to be installed and it seems that in the past, we needed to not have any pytest import in this file.

This is the meaning of the comment on the top of the file:

# We can not use pytest here, because we run
# build_tools/azure/test_pytest_soft_dependency.sh on these
# tests to make sure estimator_checks works without pytest.

glemaitre · 2023-09-30T08:59:52Z

sklearn/utils/estimator_checks.py


    if sparse.issparse(result_full):
-        result_full = result_full.A
+        result_full = result_full.toarray()


It was the last error due to the deprecation.

glemaitre

LGTM Thanks @Tialo

jjerphan · 2023-10-01T11:19:42Z

@StefanieSenger: I do not know if you have been reviewing Pull Requests, but I think you should be able to review PR for #27090 starting with this Pull Request for instance. :)

This is just a suggestion and you might already focus on something: feel free to review those PRs or not.

sklearn/utils/tests/test_estimator_checks.py

StefanieSenger · 2023-10-02T13:54:29Z

Thank you @jjerphan for you trust. Though, I‘m not sure if I can be of use here.

This is the first review that I‘m trying, the changes in the code make sense to me, but I don‘t have anything to add except for a question. This is not really a review, I‘m afraid.

Edit: I've just seen that I'm not done yet. I need some time to understand check_estimator_sparse_data. I cannot work on it for the next 1,5 days, but I will get back to it as soon as I can manage.

StefanieSenger · 2023-10-03T05:51:26Z

So, the idea behind check_estimator_sparse_data, if I understand correctly, is to check whether a bad configured estimator would trigger one of the error messages „doesn't seem to fail gracefully“. When LargeSparseNotSupportedClassifier is tested in test_estimator_checks.py this is supposed to happen (because it claims to accept large sparse data (accept_large_sparse=True), but doesn‘t).

But the check_estimator_sparse_data is only done on sparse matrices, not sparse arrays, and thus the test is. I have no idea if this means something for this PR. (Why is #27090 only concerned with the tests and not with the rest of the codebase?) Should check_estimator_sparse_data be made to iterate through both types?

glemaitre · 2023-10-03T08:28:15Z

@StefanieSenger Thanks for noting this issue. I think we should address it in another PR because as you mentioned, we try first to fix the test files.

However, we want the scikit-learn estimators to be accepting sparse arrays and this should be also true for estimator that are compatible with scikit-learn. So we will need to create/duplicate the check_sparse_* from estimator_checks to test for sparse arrays. I assume that we want a mechanism for people to deactivate the test so we would need a new tag. Up to know, I think that "sparse" was never used in the common test. So we could have a "sparse matrix" and "sparse array" for the X_types. I will open a new issue to track this problem.

jjerphan

Thank you, @Tialo.

…st_estimator_checks.py` (scikit-learn#27203) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

TST add sparse containers

c0fa849

github-actions bot added the module:utils label Aug 29, 2023

OmarManzoor added the No Changelog Needed label Aug 30, 2023

jjerphan mentioned this pull request Aug 30, 2023

TST Extend tests for scipy.sparse.*array #27090

Closed

glemaitre self-requested a review September 13, 2023 20:11

glemaitre added 3 commits September 13, 2023 22:12

Merge branch 'main' into tests/test_estimator_checks

0c9f6c4

we cannot use pytest

b72ff93

Merge branch 'main' into tests/test_estimator_checks

d541d94

handle deprecation warning

6b5dfbf

glemaitre reviewed Sep 30, 2023

View reviewed changes

glemaitre approved these changes Sep 30, 2023

View reviewed changes

StefanieSenger reviewed Oct 2, 2023

View reviewed changes

sklearn/utils/tests/test_estimator_checks.py Show resolved Hide resolved

glemaitre added the Waiting for Second Reviewer First reviewer is done, need a second one! label Oct 31, 2023

jjerphan approved these changes Nov 1, 2023

View reviewed changes

jjerphan merged commit 9621539 into scikit-learn:main Nov 1, 2023

Tialo deleted the tests/test_estimator_checks branch November 1, 2023 22:24

REDVM pushed a commit to REDVM/scikit-learn that referenced this pull request Nov 16, 2023

TST Extend tests for scipy.sparse.*array in `sklearn/utils/tests/te…

2244309

…st_estimator_checks.py` (scikit-learn#27203) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

TST Extend tests for `scipy.sparse.*array` in `sklearn/utils/tests/test_estimator_checks.py`#27203

TST Extend tests for `scipy.sparse.*array` in `sklearn/utils/tests/test_estimator_checks.py`#27203
jjerphan merged 5 commits intoscikit-learn:mainfrom
Tialo:tests/test_estimator_checks

Tialo commented Aug 29, 2023

Uh oh!

github-actions bot commented Aug 29, 2023 •

edited

Loading

Uh oh!

Tialo commented Sep 29, 2023

Uh oh!

glemaitre commented Sep 29, 2023

Uh oh!

glemaitre Sep 30, 2023

Uh oh!

glemaitre left a comment

Uh oh!

jjerphan commented Oct 1, 2023

Uh oh!

Uh oh!

StefanieSenger commented Oct 2, 2023 •

edited

Loading

Uh oh!

StefanieSenger commented Oct 3, 2023 •

edited

Loading

Uh oh!

glemaitre commented Oct 3, 2023

Uh oh!

jjerphan left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

Tialo commented Aug 29, 2023

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Aug 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

Tialo commented Sep 29, 2023

Uh oh!

glemaitre commented Sep 29, 2023

Uh oh!

glemaitre Sep 30, 2023

Choose a reason for hiding this comment

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

jjerphan commented Oct 1, 2023

Uh oh!

Uh oh!

StefanieSenger commented Oct 2, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

StefanieSenger commented Oct 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glemaitre commented Oct 3, 2023

Uh oh!

jjerphan left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

github-actions bot commented Aug 29, 2023 •

edited

Loading

StefanieSenger commented Oct 2, 2023 •

edited

Loading

StefanieSenger commented Oct 3, 2023 •

edited

Loading