Skip to content

Conversation

@thinkharderdev
Copy link
Contributor

Which issue does this PR close?

Closes #.

Rationale for this change

Add filter pushdown for parquet async reader. This should enable push down of row-level filter predicates to the parquet decoder allowing us to avoid decoding overhead as well as (with some other tweaks) IO overhead when we have page index.

This will need quite a bit more spit and polish but wanted to get feedback on the basic API

What changes are included in this PR?

Are there any user-facing changes?

@github-actions github-actions bot added the parquet Changes to the parquet crate label Aug 3, 2022
@thinkharderdev
Copy link
Contributor Author

@tustvold @Ted-Jiang

@thinkharderdev thinkharderdev mentioned this pull request Aug 3, 2022
@tustvold
Copy link
Contributor

tustvold commented Aug 4, 2022

Thank you for this, I intend to explore the design space for this a bit this afternoon and report back. My main concern is being able to handle this in the sync part of the codebase, as this will both likely be simpler, but also benefit both APIs.

@tustvold
Copy link
Contributor

tustvold commented Aug 4, 2022

I intend to explore the design space for this a bit this afternoon and report back

Made reasonable progress on this, I intend to polish this up and push up a draft tomorrow

@tustvold
Copy link
Contributor

I think this can be closed now?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

parquet Changes to the parquet crate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants