Satisfy broadcast argument in DataFrame.merge#9852
Conversation
|
This fix is relatively minor - Planning to merge by the end of the day. |
broadcast argument in DataFrame.merge
jrbourbeau
left a comment
There was a problem hiding this comment.
Thanks for moving forward with this @rjzamora. Question: based on this comment in the original issue
The dask.dataframe.merge API should use a broadcast-based merge when broadcast=True. The only exception should be when a broadcast-based algorithm is prohibited for the specified how and/or shuffle arguments
could we just raise an error if the user specifies values for how / shuffle that are inconsistent with broadcast=True? That seems like a better UX than silently not using broadcast
Yes, it definitely makes sense to raise an error any time |
Minimal change needed to ensure
broadcast=Truewill be satisfied when the broadcast algorithm is not prohibited by thehoworshufflearguments.pre-commit run --all-files