Skip to content

Add __dask_tokenize__ definitions to cudf classes#13695

Merged
rapids-bot[bot] merged 21 commits intorapidsai:branch-23.08from
rjzamora:normalize-token-cudf-2
Jul 18, 2023
Merged

Add __dask_tokenize__ definitions to cudf classes#13695
rapids-bot[bot] merged 21 commits intorapidsai:branch-23.08from
rjzamora:normalize-token-cudf-2

Conversation

@rjzamora
Copy link
Member

@rjzamora rjzamora commented Jul 13, 2023

Description

Possible alternative to #13692. See this comment

General motivation: We need deterministic tokenization in dask-expr. These changes provide what we need. We also need dask-cudf to be imported whenever we are using a cudf backend in dask.dataframe or dask_expr. These changes ensure that dask_cudf is imported (when available).

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@rjzamora rjzamora added 2 - In Progress Currently a work in progress Python Affects Python cuDF API. dask Dask issue improvement Improvement / enhancement to an existing function non-breaking Non-breaking change helps: Dask labels Jul 13, 2023
@rjzamora rjzamora self-assigned this Jul 13, 2023
@rjzamora rjzamora marked this pull request as ready for review July 17, 2023 13:07
@rjzamora rjzamora requested review from a team as code owners July 17, 2023 13:07
@rjzamora rjzamora requested review from galipremsagar and wence- July 17, 2023 13:07
@rjzamora rjzamora added 3 - Ready for Review Ready for review by team and removed 2 - In Progress Currently a work in progress labels Jul 17, 2023
Copy link
Contributor

@wence- wence- left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not sure the specialisation for genericindex is necessary.

Copy link
Contributor

@wence- wence- left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor change if you like, but looks good now I think.

@rjzamora rjzamora added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team labels Jul 18, 2023
@rjzamora
Copy link
Member Author

/merge

@rapids-bot rapids-bot bot merged commit 2aa0bab into rapidsai:branch-23.08 Jul 18, 2023
@rjzamora rjzamora deleted the normalize-token-cudf-2 branch July 18, 2023 13:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

5 - Ready to Merge Testing and reviews complete, ready to merge dask Dask issue improvement Improvement / enhancement to an existing function non-breaking Non-breaking change Python Affects Python cuDF API.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants