Better documentation for `RFECV`

There is almost no description in the documentation of how `RFECV` actually works. The [user guide](https://scikit-learn.org/stable/modules/feature_selection.html#rfe) simply says

> [RFECV](https://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.RFECV.html#sklearn.feature_selection.RFECV) performs RFE in a cross-validation loop to find the optimal number of features.

and the [API page](https://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.RFECV.html) simply says

> Recursive feature elimination with cross-validation to select features. 

My best guess for what `RFECV` is actually doing is the following.

1. Start with all features.
2. Do the following (in either order):
    a) Fit the estimator on all rows of `X` (for the current subset of features). Use `coefs_` or `feature_importances_` or a callable to select the feature(s) that will be removed in the next round.
    b) Run cross-validation with the estimator on `X` to estimate the accuracy of the estimator trained on the current subset of features.
3. Remove the features chosen for removal in step 2a.
4. Repeat steps 2 and 3 until the minimum number of features has been reached.
5. Select the set of features that maximizes the CV scores calculated in step 2b. (This set of features is recorded in the `support_` attribute.)

Is that correct? Furthermore, can a detailed explanation of what `RFECV` is doing be added to the documentation?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Better documentation for `RFECV` #27193

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Better documentation for RFECV #27193

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Better documentation for `RFECV` #27193