[MRG+1] FIX Isotonic Regression for duplicate minimal value by jmetzen · Pull Request #3995 · scikit-learn/scikit-learn

jmetzen · 2014-12-23T12:25:15Z

Fixes an issue of Isotonic Regression when the minimal value of fitting is duplicated, e.g.:

ir = IsotonicRegression(increasing=True, out_of_bounds="clip")
ir.fit([0, 0, 1], [0, 0, 1])
ir.predict([0])

would return nan (see test_isotonic_duplicate_min_entry). The deeper reason for this seems to be in interpolate.interp1d. This issue is fixed by clipping not to minimal value observed in fitting but to minimal value + np.finfo(float).resolution.

…s in IsotonicRegression

ogrisel · 2014-12-23T12:55:27Z

LGTM.

ogrisel · 2014-12-23T12:56:51Z

Actually it would be great to try to understand what is the issue with interp1d.

jmetzen · 2014-12-23T13:32:10Z

The following happens when the values are passed directly to interp1d:

In [7]: f = interpolate.interp1d([0, 0, 1], [0, 0, 1], kind='linear')
In [8]: f(0)
 /home/jmetzen/.anaconda/lib/python2.7/site-packages/scipy/interpolate/interpolate.py:445: RuntimeWarning: invalid value encountered in true_divide
 slope = (y_hi - y_lo) / (x_hi - x_lo)[:, None]
Out[8]: array(nan)

Everything works fine in the case without duplicates:

In [13]: f = interpolate.interp1d([0, 1], [0,  1], kind='linear')
In [14]: f(0)
Out[14]: array(0.0)

I will open a related bug report in scipy. For the moment, the workaround of this PR should fix the issue.

ogrisel · 2014-12-23T14:20:56Z

Ok once you have the issue reported to scipy, please add an inline comment with the scipy issue number next to the line with the workaround in the scikit-learn source code.

agramfort · 2014-12-24T11:28:36Z

LGTM too

ogrisel · 2014-12-24T11:59:15Z

@jmetzen have you opened the issue at scipy? I would like to reference it before we merge this workaround to sklearn master to ensure traceability.

jmetzen · 2014-12-24T12:08:58Z

The scipy issue is 4304 (scipy/scipy#4304) and I've added an inline comment in isotonic_regression.

coveralls · 2014-12-24T12:16:51Z

Coverage increased (+0.0%) when pulling d255866 on jmetzen:fix_isotonic into d5c57a7 on scikit-learn:master.

jnothman · 2014-12-24T12:33:58Z

Then this looks good to me too. Merging. Thanks!

[MRG] FIX Isotonic Regression for duplicate minimal value

MechCoder · 2014-12-24T13:34:40Z

sklearn/tests/test_isotonic.py

no newline :P

sorry, my bad. this PR is already merged but I will fix it in the related PR #1176

amueller · 2015-01-13T22:25:18Z

Should we not rather use one of the recommended interpolation routines such as
>>> fs = interp1d([0, 0, 1], [0, 0, 1], kind='slinear')?

jmetzen · 2015-01-14T18:35:34Z

I tried to avoid any changes that would cause differences in the learned model. But you are right, kind="linear" and kind="slinear" give identical results as far as I can see it (besides the nan issue, which does not occur in slinear). See also here: scipy/scipy#4304
We should use slinear in isotonic as it is a less hacky .

… slinear.

[MRG+1] MAINT Remove temporary fix #3995 in view of the change to slinear.

ev-br · 2016-10-22T09:21:53Z

Just a note that the workaround from 2014, scipy/scipy#4304, no longer works.

ogrisel · 2016-10-24T12:33:05Z

@ev-br thanks for the heads up but as of 0.18 we don't use kind='slinear' anymore.

Jan Hendrik Metzen added 2 commits December 23, 2014 12:44

TEST A duplicate minimum value should not yield non-finite prediction…

3056ec1

…s in IsotonicRegression

FIX Adding eps to minimum value in clipping in IsotonicRegression

983e574

jmetzen mentioned this pull request Dec 23, 2014

[MRG+1] Isotonic calibration #1176

Closed

5 tasks

ogrisel changed the title ~~FIX Isotonic Regression for duplicate minimal value~~ [MRG+1] FIX Isotonic Regression for duplicate minimal value Dec 23, 2014

DOC Add inline comment with reference to scipy issue

d255866

jnothman added a commit that referenced this pull request Dec 24, 2014

Merge pull request #3995 from jmetzen/fix_isotonic

636502f

[MRG] FIX Isotonic Regression for duplicate minimal value

jnothman merged commit 636502f into scikit-learn:master Dec 24, 2014

MechCoder reviewed Dec 24, 2014
View reviewed changes

jmetzen mentioned this pull request Dec 29, 2014

Duplicate minimal entries in interpolate.interp1d result in nan scipy/scipy#4304

Closed

amueller mentioned this pull request Jan 14, 2015

Use slinear interpolation in Isotonic #4101

Closed

raghavrv added a commit to raghavrv/scikit-learn that referenced this pull request Jan 17, 2015

MAINT Remove temporary fix scikit-learn#3995 in view of the change to…

5a3b9e9

… slinear.

agramfort added a commit that referenced this pull request Jan 17, 2015

Merge pull request #4113 from ragv/isotonic

7b36bf6

[MRG+1] MAINT Remove temporary fix #3995 in view of the change to slinear.

Uh oh!

Conversation

jmetzen commented Dec 23, 2014

Uh oh!

ogrisel commented Dec 23, 2014

Uh oh!

ogrisel commented Dec 23, 2014

Uh oh!

jmetzen commented Dec 23, 2014

Uh oh!

ogrisel commented Dec 23, 2014

Uh oh!

agramfort commented Dec 24, 2014

Uh oh!

ogrisel commented Dec 24, 2014

Uh oh!

jmetzen commented Dec 24, 2014

Uh oh!

coveralls commented Dec 24, 2014

Uh oh!

jnothman commented Dec 24, 2014

Uh oh!

MechCoder Dec 24, 2014

Choose a reason for hiding this comment

Uh oh!

jmetzen Dec 29, 2014

Choose a reason for hiding this comment

Uh oh!

amueller commented Jan 13, 2015

Uh oh!

jmetzen commented Jan 14, 2015

Uh oh!

ev-br commented Oct 22, 2016

Uh oh!

ogrisel commented Oct 24, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants