TST Replace Boston dataset in test_tree by lucyleeow · Pull Request #17290 · scikit-learn/scikit-learn

lucyleeow · 2020-05-20T14:24:36Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Replace Boston dataset with diabetes dataset in sklearn/tree/trests/test_tree.py

Any other comments?

I noticed that in test_boston (now test_diabetes) the score was always 0 for all estimators and criterions - for both boston and diabetes datasets. I confirmed that reg.predict(diabetes.data) gives exactly diabetes.target - possibly due to the reg being fitted on the same dataset. I don't think this is what was intended? Happy to amend, here or in another PR, if this is not right.

glemaitre · 2020-05-21T08:20:21Z

If the tree is grown without fixing the depth, each leaf in the tree will be a sample. So you clearly overfit but you will do a perfect classification if you train and test on the same data. Basically, this is a behaviour that we want to check (I don't know if it was intended here).

glemaitre · 2020-05-21T08:21:08Z

it would be the same behaviour for regression.

sklearn/tree/tests/test_tree.py

glemaitre

Maybe we can check the score but the changes LGTM

lucyleeow · 2020-05-21T12:26:59Z

If the tree is grown without fixing the depth, each leaf in the tree will be a sample.

Thanks, that makes sense.

I think the test copied from (or at least is the same as) this one in test_forest:

scikit-learn/sklearn/ensemble/tests/test_forest.py

Lines 159 to 175 in 2f26540

    
           def check_regression_criterion(name, criterion): 
        
               # Check consistency on regression dataset. 
        
               ForestRegressor = FOREST_REGRESSORS[name] 
        
               reg = ForestRegressor(n_estimators=5, criterion=criterion, 
        
                                     random_state=1) 
        
               reg.fit(X_reg, y_reg) 
        
               score = reg.score(X_reg, y_reg) 
        
               assert score > 0.93, ("Failed with max_features=None, criterion %s " 
        
                                     "and score = %f" % (criterion, score)) 
        
               reg = ForestRegressor(n_estimators=5, criterion=criterion, 
        
                                     max_features=6, random_state=1) 
        
               reg.fit(X_reg, y_reg) 
        
               score = reg.score(X_reg, y_reg) 
        
               assert score > 0.92, ("Failed with max_features=6, criterion %s " 
        
                                     "and score = %f" % (criterion, score))

In test_forest the regressors are not completely overfit as each tree uses only a subset of the data and the scores are not perfect.

I'm not sure what this test is checking for - but if is checking that using less features reduces the learning ability (as suggested by the comment and the worse score value in the 2nd assert) then, it isn't doing a good job. We should restrict depth.

lucyleeow · 2020-05-26T10:43:23Z

ping @glemaitre

sklearn/tree/tests/test_tree.py

glemaitre · 2020-05-27T10:25:46Z

This is kinda funny that we only have a single build failing here. The splits are probably different in 32 bits. The max_depth is probably not sufficient since the splits are completely random. You can increase the depth of the trees then.

lucyleeow · 2020-05-27T12:03:53Z

But we set random state? For me(64 bit) overfitting occurs quickly. For max_depth=20, 4/6 scores are 0

DecisionTreeRegressor mse score 0.0
DecisionTreeRegressor mae score 5.786199095022624
DecisionTreeRegressor friedman_mse score 0.0
ExtraTreeRegressor mse score 0.0
ExtraTreeRegressor mae score 3.6911764705882355
ExtraTreeRegressor friedman_mse score 0.0

lucyleeow · 2020-05-27T13:33:30Z

The score is larger than 60 with 32bit

Tree       = <class 'sklearn.tree._classes.ExtraTreeRegressor'>
criterion  = 'mse'
max_depth  = 15
name       = 'ExtraTreeRegressor'
reg        = ExtraTreeRegressor(max_depth=15, max_features=6, random_state=0)
score      = 281.43585091379208

The large difference between 32 and 64 is odd.

glemaitre · 2020-05-27T14:11:48Z

The large difference between 32 and 64 is odd.

Basically, I recall that the trees built with 32 bits architecture are different from the one built with 64 bits. So either we increase the threshold or we skip the test on 32 bits architecture with the decorator @skip_if_32bit from sklearn.utils._testing. Let's do the latest and see what people think about it in a second review.

thomasjpfan

LGTM

thomasjpfan · 2020-06-01T20:55:34Z

Thank you @lucyleeow !

lucyleeow added 2 commits May 20, 2020 16:18

use diabetes

675b6b6

comment wording

dd1b41c

github-actions bot added the module:tree label May 20, 2020

glemaitre reviewed May 21, 2020

View reviewed changes

sklearn/tree/tests/test_tree.py Outdated Show resolved Hide resolved

glemaitre reviewed May 21, 2020

View reviewed changes

sklearn/tree/tests/test_tree.py Outdated Show resolved Hide resolved

glemaitre approved these changes May 21, 2020

View reviewed changes

glemaitre reviewed May 27, 2020

View reviewed changes

sklearn/tree/tests/test_tree.py Outdated Show resolved Hide resolved

lucyleeow added 2 commits May 27, 2020 10:20

limit depth

ed09964

amend score

423b15c

amend depth

b9c31eb

glemaitre self-assigned this May 27, 2020

split tests

d19c62d

skip 32bit

c275948

thomasjpfan approved these changes Jun 1, 2020

View reviewed changes

thomasjpfan merged commit f27adc5 into scikit-learn:master Jun 1, 2020

lucyleeow deleted the test_tree branch June 2, 2020 08:57

viclafargue pushed a commit to viclafargue/scikit-learn that referenced this pull request Jun 26, 2020

TST Replace Boston dataset in test_tree (scikit-learn#17290)

8731614

jayzed82 pushed a commit to jayzed82/scikit-learn that referenced this pull request Oct 22, 2020

TST Replace Boston dataset in test_tree (scikit-learn#17290)

909c5c1

Uh oh!

Conversation

lucyleeow commented May 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

glemaitre commented May 21, 2020

Uh oh!

glemaitre commented May 21, 2020

Uh oh!

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

lucyleeow commented May 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lucyleeow commented May 26, 2020

Uh oh!

Uh oh!

glemaitre commented May 27, 2020

Uh oh!

lucyleeow commented May 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lucyleeow commented May 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glemaitre commented May 27, 2020

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

thomasjpfan commented Jun 1, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lucyleeow commented May 20, 2020 •

edited

Loading

lucyleeow commented May 21, 2020 •

edited

Loading

lucyleeow commented May 27, 2020 •

edited

Loading

lucyleeow commented May 27, 2020 •

edited

Loading