DOC Ensures that sklearn.datasets._base.load_breast_cancer passes numpydoc validation by DennisOsei · Pull Request #22266 · scikit-learn/scikit-learn

DennisOsei · 2022-01-21T18:59:01Z

Reference Issues/PRs

Addresses #21350

What does this implement/fix? Explain your changes.

Removed sklearn.datasets._base.load_breast_cancer from FUNCTION_DOCSTRING_IGNORE_LIST
Changed return value in sklearn.datasets._base.load_diabetes to original version.
Added spaces before and after colon of return values in sklearn.datasets._base.load_breast_cancer.
Change return value in sklearn.datasets._base.load_breast_cancer to :
A tuple of two ndarrays. The first contains a 2D array of shape (569, 30)
with each row representing one sample and each column representing the features.
The second array of shape (569,) contains the target samples.

Any other comments?

I'm still getting the "return value has no description" error.

ArturoAmorQ · 2022-01-27T09:45:17Z

sklearn/datasets/_base.py

+        A tuple of two ndarrays. The first contains a 2D array of shape (569, 30) 
+        with each row representing one sample and each column representing the features. 
+        The second array of shape (569,) contains the target samples.


Suggested change

A tuple of two ndarrays. The first contains a 2D array of shape (569, 30)

with each row representing one sample and each column representing the features.

The second array of shape (569,) contains the target samples.

A tuple of two ndarrays by default. The first contains a 2D ndarray of

shape (569, 30) with each row representing one sample and each column

representing the features. The second ndarray of shape (569,) contains

the target samples. If `as_frame=True`, both arrays are pandas objects.

Maybe being more explicit here could make the description easier to understand.

Also try to follow the PEP 8 convention, i.e. limit all lines to a maximum of 79 characters and avoid blank spaces at the end of each line.

I'm still getting the "return value has no description" error.

The following lines of code are creating the error:

The copy of UCI ML Breast Cancer Wisconsin (Diagnostic) dataset is downloaded from: https://goo.gl/U2Uwz2

I propose moving them to the description in the header below the table.

Thanks for the help @ArturoAmorQ it worked.

glemaitre · 2022-01-27T12:52:04Z

@DennisOsei Additionally to @ArturoAmorQ comment, could you solve the conflict by merging main into your branch.

DennisOsei added 2 commits January 20, 2022 23:48

added return description

2f1908b

added return description

5b31543

github-actions bot added module:datasets Documentation labels Jan 21, 2022

ArturoAmorQ reviewed Jan 27, 2022

View reviewed changes

thomasjpfan mentioned this pull request Jan 29, 2022

Ensure that functions's docstrings pass numpydoc validation #21350

Closed

DennisOsei closed this Jan 31, 2022

ArturoAmorQ mentioned this pull request Jan 31, 2022

DOC Ensures that sklearn.datasets._base.load_breast_cancer passes numpydoc validation #22346

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DOC Ensures that sklearn.datasets._base.load_breast_cancer passes numpydoc validation#22266

DOC Ensures that sklearn.datasets._base.load_breast_cancer passes numpydoc validation#22266
DennisOsei wants to merge 2 commits intoscikit-learn:mainfrom
DennisOsei:numpydoc-validation

DennisOsei commented Jan 21, 2022

Uh oh!

ArturoAmorQ Jan 27, 2022 •

edited

Loading

Uh oh!

ArturoAmorQ Jan 27, 2022 •

edited

Loading

Uh oh!

DennisOsei Jan 31, 2022

Uh oh!

glemaitre commented Jan 27, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

-        A tuple of two ndarrays. The first contains a 2D array of shape (569, 30)
-        with each row representing one sample and each column representing the features.
-        The second array of shape (569,) contains the target samples.
+        A tuple of two ndarrays by default. The first contains a 2D ndarray of
+        shape (569, 30) with each row representing one sample and each column
+        representing the features. The second ndarray of shape (569,) contains
+        the target samples.  If `as_frame=True`, both arrays are pandas objects.

Uh oh!

Conversation

DennisOsei commented Jan 21, 2022

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

ArturoAmorQ Jan 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ArturoAmorQ Jan 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DennisOsei Jan 31, 2022

Choose a reason for hiding this comment

Uh oh!

glemaitre commented Jan 27, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ArturoAmorQ Jan 27, 2022 •

edited

Loading

ArturoAmorQ Jan 27, 2022 •

edited

Loading