-
Notifications
You must be signed in to change notification settings - Fork 3.7k
[External](parquet) pass non predicates column's offset index to RowGroupReader #55795
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
|
run buildall |
TPC-H: Total hot run time: 34644 ms |
TPC-DS: Total hot run time: 189549 ms |
ClickBench: Total hot run time: 29.71 s |
f9b4486 to
960e2ca
Compare
|
run buildall |
TPC-H: Total hot run time: 34512 ms |
TPC-DS: Total hot run time: 188475 ms |
960e2ca to
2d47b96
Compare
|
run buildall |
ClickBench: Total hot run time: 30.54 s |
TPC-H: Total hot run time: 34669 ms |
TPC-DS: Total hot run time: 188775 ms |
ClickBench: Total hot run time: 30.38 s |
2d47b96 to
2e39ffd
Compare
|
run buildall |
TPC-H: Total hot run time: 34756 ms |
TPC-DS: Total hot run time: 189533 ms |
ClickBench: Total hot run time: 30.5 s |
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
BE Regression && UT Coverage ReportIncrement line coverage Increment coverage report
|
|
PR approved by at least one committer and no changes requested. |
|
PR approved by anyone and no changes requested. |
2e39ffd to
c5f6b09
Compare
|
run buildall |
kaka11chen
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
morningman
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
BE Regression && UT Coverage ReportIncrement line coverage Increment coverage report
|
…roupReader (#55795) ### What problem does this PR solve? Now, Parquet reader don't pass non predicate coloumn's offset index to `RowGroupReader`. `create_page_reader` will create `PageReader` rather than `PageReaderWithOffsetIndex`. The `PageReader` will parse each page header when `skip_page` and `next_page_header`. In the case of merging io, read and enlarge severely. So, in this PR, we pass the offset index to `RowGroupReader`. Co-authored-by: liutang123 <liulijia@gmail.com>
…roupReader (apache#55795) ### What problem does this PR solve? Now, Parquet reader don't pass non predicate coloumn's offset index to `RowGroupReader`. `create_page_reader` will create `PageReader` rather than `PageReaderWithOffsetIndex`. The `PageReader` will parse each page header when `skip_page` and `next_page_header`. In the case of merging io, read and enlarge severely. So, in this PR, we pass the offset index to `RowGroupReader`. Co-authored-by: liutang123 <liulijia@gmail.com>
…dex to RowGroupReader (apache#55795)
What problem does this PR solve?
Now, Parquet reader don't pass non predicate coloumn's offset index to
RowGroupReader.create_page_readerwill createPageReaderrather thanPageReaderWithOffsetIndex.The
PageReaderwill parse each page header whenskip_pageandnext_page_header. In the case of merging io, read and enlarge severely.So, in this PR, we pass the offset index to
RowGroupReader.Issue Number: close #xxx
Related PR: #xxx
Problem Summary:
Release note
None
Check List (For Author)
Test
Behavior changed:
Does this need documentation?
Check List (For Reviewer who merge this PR)