-
-
Notifications
You must be signed in to change notification settings - Fork 26.9k
API for getting DBSCAN-like clusterings out of OPTICS with fit_predict #12044
Copy link
Copy link
Closed
Labels
EnhancementModerateAnything that requires some knowledge of conventions and best practicesAnything that requires some knowledge of conventions and best practiceshelp wanted
Description
Currently we have an interface for OPTICS with custom method extract_dbscan. This is good for usability and visibility of the functionality, but means that a generic parameter search tool (like GridSearchCV) can't use OPTICS to perform DBSCAN at various eps.
This would involve adding an eps parameter which, when None, would use the default OPTICS clustering; when not None would use extract_dbscan. But we would also need to retain the model across multiple fits...
Here are two alternative interfaces:
- Add a
warm_startparameter (like many classifiers, regressors, but uncharted territory for clusterers). When True, andfitorfit_predictis called, the currentreachability_,ordering_andcore_distances_would be kept, but a different final clustering step would be used to output / storelabels_. - Add a
memoryparameter, like in hierarchical clustering. This would cache the mapping from parameters toreachability_,ordering_andcore_distances_using ajoblib.Memory.
I think the first option sounds more appropriate.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
EnhancementModerateAnything that requires some knowledge of conventions and best practicesAnything that requires some knowledge of conventions and best practiceshelp wanted