Closes DAS-1146: Sample weight metric calculations by ryan-deak-zefr · Pull Request #3 · ZEFR-INC/dask-ml

ryan-deak-zefr · 2019-02-17T19:03:55Z

Summary

This PR address that dask_ml (like sklearn) doesn't use the importance weights when calculating metrics when scoring folds in cross validation. This affects all CV methods.

Changes

This PR only addresses the use of sample_weight in the metric calculations in cross fold validation and doesn't address the issue of combining the metrics across folds, which is currently done by simple averaging and not by weighted averaging using the sum of the sample weights for a fold as the weight.

Tests

test_sample_weight_in_metrics in tests/model_selection/dask_searchcv/test_model_selection.py.

This test is written to pass given the current way of combining metrics across folds (which is likely not the best / correct way to do it).

ryan-deak-zefr

This works to fix some of the issues but there are presumably much deeper methodological issues that might make this PR moot.

ryan-deak-zefr · 2019-02-17T19:04:35Z

dask_ml/model_selection/methods.py

+    return train_sample_weight, test_sample_weight
+
+
+def _apply_scorer(estimator, X, y, scorer, sample_weight):


Same as in the sklearn PR for DAS-1145.

ryan-deak-zefr · 2019-02-17T19:05:05Z

dask_ml/model_selection/methods.py

+    y_train,
+    scorer,
+    error_score,
+    test_sample_weight=None,


Ordering is consistent with the test then train parameter order established above.

ryan-deak-zefr · 2019-02-17T19:05:42Z

dask_ml/model_selection/_search.py

    scorer,
    return_train_score,
 ):
+    if "sample_weight" in fit_params:


At the top since this is used in both branches.

ryan-deak-zefr · 2019-02-17T19:06:41Z

dask_ml/model_selection/_search.py

+        #
+        # Each value in the fit_params dict is a 2-tuple where the
+        # data representation is in the second dimension (dim 1).
+        sample_weight = fit_params["sample_weight"][1]


Not sure about getting this from some cache or something.

ryan-deak-zefr · 2019-02-17T19:07:27Z

dask_ml/model_selection/methods.py

    params=None,
    fit_params=None,
    return_train_score=True,
+    sample_weight=None,


Place at the end and make it None by default to make it "backward compatible".

ryan-deak-zefr · 2019-02-17T19:24:55Z

tests/model_selection/dask_searchcv/test_model_selection.py

        clf.fit(X, y)
+
+
+@pytest.mark.parametrize(


This test kind of subsumes the test in the sklearn PR because it tests the situation with one fold metric calculation and multiple fold metric calculations that need to be combined. It does this by testing at the dcv.GridSearchCV level rather than at the level of functions in dask_ml/model_selection/methods.py.

ryan-deak-zefr · 2019-02-17T19:29:52Z

tests/model_selection/dask_searchcv/test_model_selection.py

+        ),
+    ],
+)
+def test_sample_weight_in_metrics(cv_ind, exp_acc):


NOTE: If this test were used in the current master of dask_ml, it would fail because the metric returned by all tests is 0.5.

0.5 is definitely incorrect as it disregards sample_weight entirely.

zexuan-zhou

Looks all good to me

jon-morra-zefr · 2019-02-19T22:39:20Z

dask_ml/model_selection/methods.py

+    ----------
+    sample_weight : array-like
+        sample weights.  Should contain all sample weights needed for
+        training and testing. May be None.


Why is it sensical for sample_weight to be None in a function called _get_fold_sample_weights?

jon-morra-zefr · 2019-02-19T22:40:01Z

dask_ml/model_selection/methods.py

+        train_sample_weight = None
+        test_sample_weight = None
+    else:
+        # "0" is the train split, "1" is the test split.


Is this a convention somewhere else?

jon-morra-zefr · 2019-02-19T22:41:39Z

tests/model_selection/dask_searchcv/test_model_selection.py

 from itertools import product
 from multiprocessing import cpu_count

+# sklearn.metrics.make_scorer


What is the point of these comments?

vecchp

+1 Let' just make sure we issue the sklearn pr first before this goes out.

ryan-deak-zefr · 2019-02-27T22:38:24Z

This PR has the same tests as the scikit-learn PR: ZEFR-INC/scikit-learn#2 So it should be working very similarly.

ryan-deak-zefr added 5 commits February 17, 2019 10:01

Made tests work. Still need to refactor.

fc9cc4f

changed sample wt param order and made default None.

02640fe

black reformatted.

a19b6d8

Added _get_fold_sample_weights

d1d5714

changed test to approximate equality.

2c838f3

ryan-deak-zefr requested review from jon-morra-zefr, vecchp, zexuan-zhou and zhibiao-rao-zefr February 17, 2019 19:03

ghost added the wip label Feb 17, 2019

ghost assigned ryan-deak-zefr Feb 17, 2019

ryan-deak-zefr commented Feb 17, 2019

View reviewed changes

zexuan-zhou approved these changes Feb 19, 2019

View reviewed changes

jon-morra-zefr reviewed Feb 19, 2019

View reviewed changes

ryan-deak-zefr added 3 commits February 27, 2019 14:04

Got everything working with the weighted average of importances

e03bf7f

removed commented out code.

2363309

working with lots of tests from merged sklearn PR.

9fbdba2

vecchp approved these changes Feb 27, 2019

View reviewed changes

ryan-deak-zefr merged commit dec2c7f into master Feb 27, 2019

ryan-deak-zefr deleted the DAS-1146__sample_wt_cv branch February 27, 2019 22:38

zexuan-zhou restored the DAS-1146__sample_wt_cv branch August 13, 2019 18:10

		return train_sample_weight, test_sample_weight


		def _apply_scorer(estimator, X, y, scorer, sample_weight):

		clf.fit(X, y)


		@pytest.mark.parametrize(

Conversation

ryan-deak-zefr commented Feb 17, 2019

Summary

Changes

Tests

Uh oh!

ryan-deak-zefr left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zexuan-zhou left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vecchp left a comment

Choose a reason for hiding this comment

Uh oh!

ryan-deak-zefr commented Feb 27, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants