Skip to content

Conversation

@wiedld
Copy link
Contributor

@wiedld wiedld commented Dec 27, 2024

Which issue does this PR close?

Addresses need noted here in Datafusion, by making two methods public.

Rationale for this change

When writing parquet with the ArrowWriter, the default behavior will encode the arrow schema in the parquet kv_metadata.

Other libraries may have their own parquet writers, such as Datafusion's ParquetSink, which will need to do the same operation in order to produce equivalent parquet (see DF issue).

What changes are included in this PR?

Make two existing methods public.

Are there any user-facing changes?

Exposes 2 existing methods.

@github-actions github-actions bot added the parquet Changes to the parquet crate label Dec 27, 2024
@wiedld wiedld marked this pull request as ready for review December 27, 2024 23:14
Copy link
Contributor

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @wiedld

@alamb alamb merged commit 1d2696d into apache:main Dec 30, 2024
16 checks passed
CurtHagenlocher pushed a commit to CurtHagenlocher/arrow-rs that referenced this pull request Jan 13, 2025
svencowart pushed a commit to elastiflow/arrow-rs that referenced this pull request Jan 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

parquet Changes to the parquet crate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants