WIP/RFC Targettransformer by amueller · Pull Request #8988 · scikit-learn/scikit-learn

amueller · 2017-06-05T15:51:03Z

Implements #8678.

No tests yet.

jnothman · 2017-06-05T21:38:27Z

sklearn/preprocessing/label.py

+    1.0
+    >>> tt.estimator_.coef_
+    array([ 2.])
+


Remove line

jnothman · 2017-06-05T21:41:53Z

sklearn/preprocessing/label.py

+    estimator : object
+        Estimator object to wrap.
+
+    func : function


Are there cases in which it is useful to have a transformer here, i.e. one of:

The transformation involves statistics from the training portion of y

The transformation is parameterised and those parameters should be searchable

Second one one can be done, but ugly, by searching over functions? Do you have a use-case for 1?

I like the second bullet; the first doesn't seem pipeline-friendly but I guess could be useful in custom code

Well LabelBinarizer without specifying the set of labels is an example of learnt state... Centring or Box Cox for regression?

Yeah I completely agree, I misread the first bullet earlier re:training portion.

Come to think of it, centering / standardization in linear models is also a good example of something that could be expressed under this form, and also confirms @amueller's point that we care about the score after inverse-transforming.

amueller · 2017-06-06T08:42:43Z

with #8022 I wouldn't need to add the estimator to donttest ^^

vene · 2017-06-06T09:52:55Z

sklearn/preprocessing/label.py

+                "TargetTransformer only implements a score method if "
+                "estimator is a classifier or regressor, but " + err)
+
+


So you are computing the score in the "outer" domain, not in the "inner" domain that the estimator_ is fit.

An alternative would be: return self.estimator_.score(X, self.func(y)) which doesn't need error checking but returns scores in the "inner" domain, where the estimator is truly fit.

I would prefer this alternative. I guess it depends on the use cases this transformer is designed for?

vene · 2017-06-06T10:00:21Z

The name is confusing: this is called TargetTransfomer but it's not a transformer: it's a metaclassifier. (Is it intended to have it be a metaestimator that could wrap transformers as well?)

jnothman · 2017-06-06T10:11:07Z

Call it Retarget? not sure I'm serious

…

On 6 Jun 2017 8:00 pm, "Vlad Niculae" ***@***.***> wrote: The name is confusing: this is called TargetTransfomer but it's not a transformer: it's a metaclassifier. (Is it intended to have it be a metaestimator that could wrap transformers as well?) — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#8988 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAEz62RNf4ijE5ZDRlNC5cBOOR0yiyk_ks5sBSM3gaJpZM4NwOlJ> .

vene · 2017-06-06T10:12:00Z

MovingTargetClassifier :P (definitely not serious)

More seriously: FunctionRetargeter? (along the lines of FunctionTransformer)

amueller · 2017-06-06T11:19:07Z

I think BoxCox is a good example. how would we support that best?
Do the simple case first or try to get it right?

TargetFunction? TransformTarget?

jnothman · 2017-06-06T12:31:31Z

Well, I could imagine TransformedY(BoxCox(), SVR()) would we then want a shorthand for the common case you now handle, e.g.: TransformedY(FunctionTransformer(np.log, np.exp), SVR()) ? 1d y would automatically be converted to and from 2d in fitting and transforming.

…

On 6 June 2017 at 21:19, Andreas Mueller ***@***.***> wrote: I think BoxCox is a good example. how would we support that best? Do the simple case first or try to get it right? TargetFunction? TransformTarget? — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#8988 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAEz6w6gc7t7FbcG8h4ZOuMu3v4atDKUks5sBTWtgaJpZM4NwOlJ> .

amueller · 2017-06-06T12:34:13Z

I would have base_estimator as the first parameter, as I think usually it is.

We can have it be a transformer or a tuple? Though I'd actually prefer kwargs to be more explicit, either specifying transformer or func and inverse_func?

vene · 2017-06-06T14:13:34Z

I prefer the explicit API with kwargs to the tuple.

vene · 2017-06-06T17:02:18Z

quoth @ogrisel on API discussion

(transformer=None, func=None, inverse_func=None)

If all are none, this just does an identity function.

vene · 2017-06-06T17:04:49Z

TransformedTargetRegressor?

glemaitre · 2017-06-06T17:16:20Z

FYI I'll make some changes and drop the PR on Andy branch

amueller · 2017-06-06T17:23:13Z

@glemaitre ? you already have it open, right?

glemaitre · 2017-06-06T23:53:38Z

Is is what I have for tonight here

@amueller @jnothman I got trapped with the fit of the transformer.
With the FunctionTransformer, we can trigger or not the check_array which is really useful.
However, in the case of the BoxCox, there is nothing that allow for such thing which mean that y should be a 2d array to pass the through the fit.

What did you have in the head to pass by this problem?

jnothman · 2017-06-07T00:24:35Z

If a transformer is provided, we'd need to assume that it requires a 2d array. We would need to reshape at fit, and ravel (and check n_samples is maintained) at transform. I can see problems in that, such as that existing label transformers would not work where they require 1d.

…

On 7 June 2017 at 09:53, Guillaume Lemaitre ***@***.***> wrote: Is is what I have for tonight here <https://github.com/amueller/scikit-learn/pull/30/files> @amueller <https://github.com/amueller> @jnothman <https://github.com/jnothman> I got trapped with the fit of the transformer. With the FunctionTransformer, we can trigger or not the check_array which is really useful. However, in the case of the BoxCox, there is nothing that allow for such thing which mean that y should be a 2d array to pass the through the fit. What did you have in the head to pass by this problem? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#8988 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAEz62QuSCSttzsWOTshhKpSPZmvzsuMks5sBeaEgaJpZM4NwOlJ> .

jnothman · 2018-02-13T20:57:15Z

Closed by #9041

amueller added 3 commits June 5, 2017 17:35

implement target transformer

21a2ff3

make example use log and ex

7b1f7e8

some docstrings

b306fa5

amueller mentioned this pull request Jun 5, 2017

TargetTransformer meta-estimator? #8678

Closed

jnothman reviewed Jun 5, 2017

View reviewed changes

vene reviewed Jun 6, 2017

View reviewed changes

glemaitre mentioned this pull request Jun 7, 2017

[MRG+2] TransformedTargetRegressor #9041

Merged

3 tasks

jnothman closed this Feb 13, 2018

		"TargetTransformer only implements a score method if "
		"estimator is a classifier or regressor, but " + err)

Uh oh!

Conversation

amueller commented Jun 5, 2017

Uh oh!

jnothman Jun 5, 2017

Choose a reason for hiding this comment

Uh oh!

jnothman Jun 5, 2017

Choose a reason for hiding this comment

Uh oh!

amueller Jun 6, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vene Jun 6, 2017

Choose a reason for hiding this comment

Uh oh!

jnothman Jun 6, 2017

Choose a reason for hiding this comment

Uh oh!

vene Jun 6, 2017

Choose a reason for hiding this comment

Uh oh!

amueller commented Jun 6, 2017

Uh oh!

vene Jun 6, 2017

Choose a reason for hiding this comment

Uh oh!

vene commented Jun 6, 2017

Uh oh!

jnothman commented Jun 6, 2017 via email

Uh oh!

vene commented Jun 6, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amueller commented Jun 6, 2017

Uh oh!

jnothman commented Jun 6, 2017 via email

Uh oh!

amueller commented Jun 6, 2017

Uh oh!

vene commented Jun 6, 2017

Uh oh!

vene commented Jun 6, 2017

Uh oh!

vene commented Jun 6, 2017

Uh oh!

glemaitre commented Jun 6, 2017

Uh oh!

amueller commented Jun 6, 2017

Uh oh!

glemaitre commented Jun 6, 2017

Uh oh!

jnothman commented Jun 7, 2017 via email

Uh oh!

jnothman commented Feb 13, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

amueller Jun 6, 2017 •

edited

Loading

vene commented Jun 6, 2017 •

edited

Loading