Test broadcast merge in local_cudf_merge benchmark#507
Test broadcast merge in local_cudf_merge benchmark#507rapids-bot[bot] merged 4 commits intorapidsai:branch-0.19from
Conversation
|
run tests |
|
Now that dask#7143 was merged, it is now safe to consider this PR for integration. |
|
Retargeting seems to have no conflicts, let's see what the tests say. 🙂 |
|
Could you please run |
They still run, so we can wait another hour or so to see the results, but feel free to restyle it anyway in the meantime Rick. |
|
Ah ok. If they still run, great let's wait Know that previously changelog (ofc no longer included) and style were required to pass before running tests. Given this, couldn't tell if the jobs here were actually queued to run or maybe we were running into some edge case in gpuCI where they only appear queued (but don't actually run). |
Codecov Report
@@ Coverage Diff @@
## branch-0.19 #507 +/- ##
===============================================
+ Coverage 62.37% 63.77% +1.39%
===============================================
Files 22 22
Lines 2517 2523 +6
===============================================
+ Hits 1570 1609 +39
+ Misses 947 914 -33
Continue to review full report at Codecov.
|
Co-authored-by: Peter Andreas Entschev <peter@entschev.com>
|
Looks like we are seeing some failures in |
Hmm - It doesn't seem like that test is running the benchmark modified in this PR, is it? |
|
Was this from a change in Dask or Distributed then? Both of those are planning to be released tomorrow. So if they have issues, we should try to identify those quickly |
|
Just to update this thread, Rick did find a recent Dask change that is causing failures here as mentioned in this comment ( dask/dask#7305 (comment) ) in the PR with the change. We have surfaced this in the Dask release issue ( dask/community#129 (comment) ). Trying to figure out now how best to address this before the Dask + Distributed release tomorrow |
|
I'm attempting to fix the issue mentioned above in dask/dask#7325 . |
|
rerun tests Edit: As Peter's fix has landed. Thanks Peter! 😄 |
|
@gpucibot merge |
Includes the necessary changes to test dask#7143. More specifically, this adds the following options:
--base-chunks: Number of base-DataFrame partitions (default: n_workers)--other-chunks: Number of other-DataFrame partitions (default: n_workers)--broadcast-join: Use broadcast join when possible--shuffle-join: Use shuffle join (takes precedence over '--broadcast-join')