Skip to content

Description of bootstrapping for RandomForest estimators #16733

@johannfaouzi

Description

@johannfaouzi

Describe the issue linked to the documentation

The docstrings for sklearn.ensemble.RandomForestClassifier and sklearn.ensemble.RandomForestClassifier states that:

The sub-sample size is always the same as the original input sample size but the samples are drawn with replacement if bootstrap=True (default).

However, the new max_samples parameter has been added in version 0.22, so it needs updating (or removing).

Suggest a potential alternative/fix

A simple change would be to mention this new parameter and to borrow the documentation for the bootstrap parameter:

The sub-sample size is controlled with the max_samples parameter if bootstrap=True (default), otherwise the whole dataset is used to build each tree.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions