-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Open
apache/arrow-site
#741Labels
enhancementAny new improvement worthy of a entry in the changelogAny new improvement worthy of a entry in the changelog
Description
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
-
Part of [Epic] Parquet Reader Improvement Plan / Proposal - July 2025 #8000
-
Related to Decouple IO and CPU operations in the Parquet Reader (push decoder) #7983
Describe the solution you'd like
I would like to explain the benefits of a push decoder in a more digestable format than a PR (even if the code is well documented
)
I think this will also help reviewers
Describe alternatives you've considered
Idea is to write a blog post about the push parquet decoder, and how it can be used to offer more fine grained control over IO and CPU work in the parquet reader.
The blog post would cover:
- Motivation: why do we need a push decoder?
- Design: how does the push decoder work?
- Examples: how to use the push decoder in practice
- Performance: how does the push decoder perform compared to the existing parquet reader?
- Future work: what are the next steps for the push decoder?
Additional context
Metadata
Metadata
Assignees
Labels
enhancementAny new improvement worthy of a entry in the changelogAny new improvement worthy of a entry in the changelog