Use Blockwise/`map_partitions` in various DataFrame join methods

I noticed that some join methods have things like
```python
        dsk = {
            (name, i): (apply, merge_chunk, [left_key, right_key], kwargs)
            for i, right_key in enumerate(right.__dask_keys__())
        }
```
where we're generating a low-level graph that could just be done with `map_partitions`. Using `map_partitions` in these scenarios would both speed up graph transmission and allow for blockwise fusion across the operations. Refactoring this simple sorts of graphs should be straightforward.

- [x] `single_partition_join`
- [ ] `hash_join`'s `merge_chunk`
- [ ] `stack_partitions` should use `HighLevelGraph.from_collections` instead of merging all of the input graphs

cc @rjzamora @ncclementi @jrbourbeau 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use Blockwise/`map_partitions` in various DataFrame join methods #8306

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Use Blockwise/map_partitions in various DataFrame join methods #8306

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Use Blockwise/`map_partitions` in various DataFrame join methods #8306