-
-
Notifications
You must be signed in to change notification settings - Fork 27k
[RFC] Tree module improvements #5212
Copy link
Copy link
Closed
Labels
EnhancementModerateAnything that requires some knowledge of conventions and best practicesAnything that requires some knowledge of conventions and best practiceshelp wantedmodule:tree
Metadata
Metadata
Assignees
Labels
EnhancementModerateAnything that requires some knowledge of conventions and best practicesAnything that requires some knowledge of conventions and best practiceshelp wantedmodule:tree
I am planning on submitting several PRs in an attempt to merge #5041 in slowly, with the ultimate goal being a clean implementation of multithreaded decision tree building so that Gradient Boosting can be faster. With one of the main concepts merged (#5203), here is a list of separate PRs which I'd like to merge in the near future.
Longer range goals which I'd like to work towards (but have no clear plan as of right now) are the following:
Issue [RFC] Missing values in RandomForest #5870, PR ENH Adds missing value support for trees #23595, FEA Add support for missing values in tree estimators with
criterion="absolute_error"by greatly simplifying the logic #32119At this point, it will be clearer to me what specific changes to Splitter, Criteria, and TreeBuilder need to be added to make multithreading a possibility. @glouppe @arjoly @GaelVaroquaux @pprett if you have any comments, I'd love to hear them.