[WIP] Blockwise from array by ian-r-rose · Pull Request #6984 · dask/dask

ian-r-rose · 2020-12-17T00:19:15Z

Follow-up to #6931. Switches from_array and its ilk (zarr, hdf5) to use BlockwiseIO.

Tests added / passed
Passes black dask / flake8 dask

function.

mrocklin · 2020-12-17T17:37:25Z

dask/array/core.py

    else:
        # Common case, drop extra parameters
-        values = [(getitem, arr, x) for x in slices]
+        getter = partial(getitem, arr)


In general we prefer to avoid including partial, lambdas, closures, or other dynamically generated functions in task graphs. They're harder to serialize.

mrocklin · 2020-12-17T17:41:53Z

dask/array/core.py

    return list(product(*slices))


-def getem(


We should check with Xarray tests to make sure that we don't break anything there. They may use this (I hope not though).

mrocklin · 2020-12-17T17:42:08Z

@rjzamora if you have a moment can you take a look at this?

ian-r-rose · 2020-12-17T18:46:26Z

I think the thing which will be trickiest here is rewriting slicing to propagate down to the array access level. The blockwise IO layer is opaque, so any subsequent slicing can't get past it. If I understand the corresponding parquet code on the dataframe side, there is rewriting of the blockwise IO in the optimization phase to get around this. Is that right @rjzamora?

ian-r-rose · 2021-03-19T14:37:14Z

Superseded by #7417

Ian Rose added 5 commits December 16, 2020 16:16

Refactor BlockwiseCreateArray to pass richer block_info to the user

65915c9

function.

Use block_info with ones/zeros/full.

45809a7

WIP refactor from_array to use high level graph array creation.

ef97829

Update lock test.

628efdd

Test array embedding in graph_from_arraylike

0d42562

ian-r-rose changed the title ~~]WIP] Blockwise from array~~ [WIP] Blockwise from array Dec 17, 2020

mrocklin reviewed Dec 17, 2020

View reviewed changes

Base automatically changed from master to main March 8, 2021 20:19

ian-r-rose mentioned this pull request Mar 19, 2021

Blockwise array creation redux #7417

Merged

2 tasks

ian-r-rose closed this Mar 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Blockwise from array#6984

[WIP] Blockwise from array#6984
ian-r-rose wants to merge 5 commits intodask:mainfrom
ian-r-rose:blockwise-from-array

ian-r-rose commented Dec 17, 2020 •

edited

Loading

Uh oh!

mrocklin Dec 17, 2020

Uh oh!

mrocklin Dec 17, 2020

Uh oh!

mrocklin commented Dec 17, 2020

Uh oh!

ian-r-rose commented Dec 17, 2020

Uh oh!

ian-r-rose commented Mar 19, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ian-r-rose commented Dec 17, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mrocklin Dec 17, 2020

Choose a reason for hiding this comment

Uh oh!

mrocklin Dec 17, 2020

Choose a reason for hiding this comment

Uh oh!

mrocklin commented Dec 17, 2020

Uh oh!

ian-r-rose commented Dec 17, 2020

Uh oh!

ian-r-rose commented Mar 19, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ian-r-rose commented Dec 17, 2020 •

edited

Loading