[MRG] Support for infinite values in GBDTs by NicolasHug · Pull Request #14406 · scikit-learn/scikit-learn

NicolasHug · 2019-07-18T17:21:38Z

I think we need this merged before the missing values support :)

sklearn/ensemble/_hist_gradient_boosting/gradient_boosting.py

adrinjalali · 2019-07-18T17:54:01Z

sklearn/ensemble/_hist_gradient_boosting/gradient_boosting.py

+        # This is not strictly True, but it's needed since
+        # force_all_finite=False means accept both nans and infinite values.
+        # Without the tag, common checks would fail.
+        # This comment must be removed once we merge PR 13911


Maybe add a "TODO", we sometimes go through them and it'll be easier to find it then. But if you're gonna fix it yourself, then no big deal.

adrinjalali · 2019-07-18T17:54:32Z

ping when tests pass?

NicolasHug · 2019-07-18T18:46:02Z

ping @adrinjalali They pass ^^ it's a docker issue

ogrisel

LGTM. Just a quick comment to make the atol in a test more easy to understand but not big deal. Feel free to merge without addressing it if you don't like my suggestion :)

ogrisel · 2019-07-19T12:17:13Z

sklearn/ensemble/_hist_gradient_boosting/tests/test_gradient_boosting.py

+
+    gbdt = HistGradientBoostingRegressor(min_samples_leaf=1)
+    gbdt.fit(X, y)
+    np.testing.assert_allclose(gbdt.predict(X), y, atol=1e-4)


Why such a high value for atol? Maybe max_iter it too small for the default value of the learning rate? Maybe you could set the learning rate to 1.0 and a single split in a single tree (max_iter=1, max_leaf_nodes=2)would be enough to perfectly fit the data?

ogrisel · 2019-07-19T12:22:16Z

I launched a rebuild of azure and circle as the failures did not look related to this PR.

ogrisel · 2019-07-19T12:58:46Z

The tests pass. Let's merge, we can always improve the test later :)

NicolasHug added 2 commits July 18, 2019 13:14

Support for infinite values

f2f68d5

pep8

6471149

NicolasHug commented Jul 18, 2019

View reviewed changes

sklearn/ensemble/_hist_gradient_boosting/gradient_boosting.py Show resolved Hide resolved

Added comment

5bc719d

NicolasHug mentioned this pull request Jul 18, 2019

[MRG] Native support for missing values in GBDTs #13911

Merged

adrinjalali reviewed Jul 18, 2019

View reviewed changes

ogrisel approved these changes Jul 19, 2019

View reviewed changes

ogrisel merged commit dd78658 into scikit-learn:master Jul 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MRG] Support for infinite values in GBDTs#14406

[MRG] Support for infinite values in GBDTs#14406
ogrisel merged 3 commits intoscikit-learn:masterfrom
NicolasHug:gbdt_nan

NicolasHug commented Jul 18, 2019

Uh oh!

Uh oh!

adrinjalali Jul 18, 2019

Uh oh!

adrinjalali commented Jul 18, 2019

Uh oh!

NicolasHug commented Jul 18, 2019

Uh oh!

ogrisel left a comment

Uh oh!

ogrisel Jul 19, 2019

Uh oh!

ogrisel commented Jul 19, 2019

Uh oh!

ogrisel commented Jul 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

NicolasHug commented Jul 18, 2019

Uh oh!

Uh oh!

adrinjalali Jul 18, 2019

Choose a reason for hiding this comment

Uh oh!

adrinjalali commented Jul 18, 2019

Uh oh!

NicolasHug commented Jul 18, 2019

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

ogrisel Jul 19, 2019

Choose a reason for hiding this comment

Uh oh!

ogrisel commented Jul 19, 2019

Uh oh!

ogrisel commented Jul 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants