Add HighLevelGraph.validate call to assert_eq in tests#6509
Add HighLevelGraph.validate call to assert_eq in tests#6509mrocklin wants to merge 2 commits intodask:masterfrom
Conversation
Follows on from dask#6508
|
FYI: I have been debugging using |
|
Good to know, and thank you for organizing things as we proceed down this path. HLGs are relatively new in the codebase and not as well worn as other parts. I hope that this PR helps to expose additional errors. My experience is that placing checks into |
|
I'm not sure what RAPIDS folks are up to these days, but this seems like a task taht @rjzamora could probably knock out pretty well. |
When <dask#6509> passes, we can remove this fix, which introduces a significant overhead
|
It looks like #6507 fixes all failures besides |
My guess is that the smallest delta of a change would be to keep Delayed using HighLevelGraphs. Naively I would expect that using the keyname as the layer name would be fine. I'm curious what it isn't working out well. It also looks like we didn't wire everything up properly in the svd code, which is not surprising, it's one of the more complicated multi-layered functions we have. |
When <dask#6509> passes, we can remove this fix, which introduces a significant overhead
|
Is this still needed now that PR ( #6588 ) is in? |
|
Nope, we're good. Thanks for flagging this @jakirkham |
This applies a validation function made by @madsbk in #6508 to all of the dask array and dask dataframe test suite. It exposes a few cases where our book keeping around HLG dependencies is incorrect.