Skip to content

Add pre-deprecation warnings for chunksize and aggregate_files to read_parquet#9052

Merged
jsignell merged 1 commit intodask:mainfrom
rjzamora:pre-deprecate-chunksize-aggregate_files
May 10, 2022
Merged

Add pre-deprecation warnings for chunksize and aggregate_files to read_parquet#9052
jsignell merged 1 commit intodask:mainfrom
rjzamora:pre-deprecate-chunksize-aggregate_files

Conversation

@rjzamora
Copy link
Copy Markdown
Member

@rjzamora rjzamora commented May 6, 2022

As discussed in #9043 (for chunksize) and #9051 (for aggregate_files), I propose that we deprecate two complex and rarely-utilized arguments from read_parquet: chunksize and aggregate_files.

This PR simply adds "pre-deprectation" warnings for the targeted arguments (including links to the relevant Issues discussing their deprecation). My goal is to find (and inform) whatever users may be depending on these obscure options.

@jsignell
Copy link
Copy Markdown
Member

Thanks Rick!

@jsignell jsignell merged commit 2115574 into dask:main May 10, 2022
@rjzamora rjzamora deleted the pre-deprecate-chunksize-aggregate_files branch May 10, 2022 21:00
erayaslan pushed a commit to erayaslan/dask that referenced this pull request May 12, 2022
…` and ``aggregate_files`` (dask#9052)

As discussed in dask#9043 (for `chunksize`) and dask#9051 (for `aggregate_files`), I propose that we deprecate two complex and rarely-utilized arguments from `read_parquet`: `chunksize` and `aggregate_files`.

This PR simply adds "pre-deprectation" warnings for the targeted arguments (including links to the relevant Issues discussing their deprecation).  My goal is to find (and inform) whatever users may be depending on these obscure options.
"metadata_task_size": metadata_task_size,
"split_row_groups": split_row_groups,
"chunksize=": chunksize,
"chunksize": chunksize,
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just saw this change - this looks like there was a bug in the chunksize implementation that made it not actually work before, is that correct? If so, that may be further evidence that this option can be removed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants