ColumnTransformer.transfomers_ should store indices rather than a function

When column is specified as a function, this should not be stored in `transformers_`. Rather, the calculated indices should be stored. The current approach risks getting different sets of indices returned when `fit` and `transform` are called.

```py
>>> from sklearn.preprocessing import StandardScaler
>>> from sklearn.compose import ColumnTransformer
>>> def get_all(X):
...     return np.arange(X.shape[1])
...
>>> trans = ColumnTransformer([('foobar', StandardScaler(), get_all)])
>>> trans.fit(np.array([[1., 2, 3]]))
```

Expected:

```py
>>> trans.transformers_
[('foobar', StandardScaler(copy=True, with_mean=True, with_std=True), array([0, 1, 2]))]
```

Actual:

```py
>>> trans.transformers_
[('foobar', StandardScaler(copy=True, with_mean=True, with_std=True), <function <lambda> at 0x1811fa3048>)]
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ColumnTransformer.transfomers_ should store indices rather than a function #12097

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

ColumnTransformer.transfomers_ should store indices rather than a function #12097

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions