Consider reactivating low-level DataFrame optimization when not all layers are Blockwise

Since https://github.com/dask/dask/pull/7620, we've seen a few instances where users have gotten burned by root-task overproduction (see https://github.com/dask/distributed/issues/5555, https://github.com/dask/distributed/issues/5223 for background) because certain DataFrame optimizations still use low-level graphs, and therefore aren't getting fused anymore. Examples:
* https://github.com/dask/dask/issues/8445
* https://github.com/dask/dask/issues/8309
* https://github.com/dask/dask/issues/8306

We do want to get everything to Blockwise eventually, but our bandwith to track these down and fix them is limited. In the interim, I propose that by default, we still do low-level fusion when any of the layers in the graph are materialized.

cc @rjzamora @ian-r-rose @jrbourbeau 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Consider reactivating low-level DataFrame optimization when not all layers are Blockwise #8447

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Consider reactivating low-level DataFrame optimization when not all layers are Blockwise #8447

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions