[ENH] ensure that all estimators have two test parameter sets

We should ensure that all estimators (that have parameters) possess at least two test parameter sets.

The two (or more) parameter sets should:

* be fast to run together - `fit` is the bottleneck (so we should not overdo it with too many parameter sets)
* cover substantially different settings for all the important parameters, i.e., substantially different typical cases and/or important edge cases

Recipe:

1. search for estimators which have parameters but only a single test parameter set. These are estimators with no `get_test_params` implemented, or `get_test_params` returning only a single dictionary instead of a list of two (or more) dictionaries.
2. post here in this issue which estimator you picked (to avoid duplication of work)
3. come up with a parameter set satisfying the above constraints and add it to the return (should be list of two or more dictionaries)
4. make a PR, this should add the parameter set in `get_test_params`, and remove the estimator from `EXCLUDED_TESTS_BY_TEST` in `sktime.tests._config`
    * if the estimator has a `"results_comparison"` parameter set, ignore that - add new parameters only to the unconditional parameter set

An example PR that adds second parameter sets for some estimators can be found here: https://github.com/alan-turing-institute/sktime/pull/3428

Finding some estimators that have only one parameter set can be done quickly by checking the variable `EXCLUDED_TESTS_BY_TEST` in `sktime.tests._config`, estimators which do not comply with the "number of test parameter sets" requirements are skipping `"test_get_test_params_coverage"`.

Alternatively, locally running code which does this:
```python
from sktime.registry import all_estimators

all_ests = all_estimators()
ests = [x for x in all_ests if (len(x[1].get_test_params())<2 or isinstance(x[1].get_test_params(), dict)) and len(x[1].get_param_names())>0]
[x[0] for x in ests if len(x[1].get_param_names()) > len(x[1].get_class_tag("reserved_params", []))]
```

Current output:
- [ ] `ClaSPTransformer`
- [ ] `CNTCNetwork`,
- [ ] `CanonicalIntervalForest`,
- [ ] `CircularBinarySegmentation`,
- [ ] `ColumnEnsembleClassifier`
- [ ] `ColumnTransformer`
- [ ] `ColumnwiseTransformer`
- [ ] `DOBIN`
- [ ] `DWTTransformer`
- [ ] `DistFromAligner`
- [ ] `DistanceFeatures`
- [ ] `DontUpdate`
- [ ] `DummyRegressor`
- [ ] `FeatureSelection`
- [ ] `GreedyGaussianSegmentation`
- [ ] `InceptionTimeNetwork`
- [ ] `IndividualBOSS`
- [ ] `IndividualTDE`
- [ ] `LogTransformer`
- [ ] `MCDCNNClassifier`
- [ ] `MCDCNNNetwork`
- [ ] `MCDCNNRegressor`
- [ ] `MLPNetwork`
- [ ] `MatrixProfile`
- [ ] `OnlineEnsembleForecaster`
- [ ] `PAA`
- [ ] `PCATransformer`
- [ ] `PlateauFinder`
- [ ] `RandomIntervalFeatureExtractor`
- [ ] `RandomIntervalSegmenter`
- [ ] `RandomIntervals`
- [ ] `RandomSamplesAugmenter`
- [ ] `ReducerTransform`
- [ ] `ShapeletTransform`
- [ ] `SlidingWindowSegmenter`
- [ ] `SlopeTransformer`
- [ ] `StackingForecaster`
- [ ] `SupervisedTimeSeriesForest`
- [ ] `TapNetNetwork`
- [ ] `TimeBinner`
- [ ] `TruncationTransformer`
- [ ] `UnobservedComponents`
- [ ] `WhiteNoiseAugmenter`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ENH] ensure that all estimators have two test parameter sets #3429

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[ENH] ensure that all estimators have two test parameter sets #3429

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions