Skip to content

Make GridSearchCV reuse scattered data.#622

Merged
TomAugspurger merged 2 commits intodask:masterfrom
lesteve:grid-search-cv-scattered-data
Mar 5, 2020
Merged

Make GridSearchCV reuse scattered data.#622
TomAugspurger merged 2 commits intodask:masterfrom
lesteve:grid-search-cv-scattered-data

Conversation

@lesteve
Copy link
Copy Markdown
Member

@lesteve lesteve commented Mar 4, 2020

Fix #516.

Maybe there is a better test? Also could use type(x).__name__ (rather than hardcoded ndarray) as in client.scatter: https://github.com/dask/distributed/blob/9b5bf448af478a166069aefc9c0c1354a29ae482/distributed/client.py#L1930

@TomAugspurger
Copy link
Copy Markdown
Member

TomAugspurger commented Mar 4, 2020

Thanks! I think you're missing a .utils in your test.

I think what you have (directly testing to_keys against client.scatter) is about as good as we could do. I never came up with an easy way to test it using GridSearchCV itself.

w.r.t. using type(x).__name__, that might be a bit better. We might be able to get a pandas DataFrame here.

@lesteve
Copy link
Copy Markdown
Member Author

lesteve commented Mar 5, 2020

Done. I also test pandas Dataframe now.

@TomAugspurger TomAugspurger merged commit 69680f7 into dask:master Mar 5, 2020
@TomAugspurger
Copy link
Copy Markdown
Member

Thanks!

@lesteve lesteve deleted the grid-search-cv-scattered-data branch March 17, 2020 16:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

GridsearchCV with prescattered data

2 participants