Skip to content

fix(parquet): Reading UUID columns#173

Merged
zeroshade merged 3 commits intoapache:mainfrom
zeroshade:uuid-parquet
Oct 26, 2024
Merged

fix(parquet): Reading UUID columns#173
zeroshade merged 3 commits intoapache:mainfrom
zeroshade:uuid-parquet

Conversation

@zeroshade
Copy link
Copy Markdown
Member

Split from #171 to be a more focused PR.

Currently we will properly write arrow data with the canonical UUID extension type as a parquet UUID column via pqarrow. This PR enables us to read back that data using the extensions.UUID data type correctly even when we don't have a stored schema.

Added a test to the ArrowExtensionTypeRoundTrip to ensure proper round trip without a stored schema.

Copy link
Copy Markdown
Member

@kou kou left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

Co-authored-by: Sutou Kouhei <kou@cozmixng.org>
@zeroshade zeroshade merged commit fe4bd93 into apache:main Oct 26, 2024
@zeroshade zeroshade deleted the uuid-parquet branch October 26, 2024 15:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants