ColumnTransformer should always apply sparse_threshold

I hate to do this, but I think we should change the behavior of ``sparse_threshold`` for the final release.
I think it's unnatural to not apply this when all matrices are sparse. In the (not uncommon) case that all columns are categorical it's easy to get a sparse array otherwise because the default of ``OneHotEncoder`` is sparse.

This is an issue (similar to #12071) that pops up when building a general pipeline to be applied to several datasets. Right now the presence of a single continuous feature changes whether the output will be sparse or not, while that's not really relevant, I think.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ColumnTransformer should always apply sparse_threshold #12150

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

ColumnTransformer should always apply sparse_threshold #12150

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions