-
Notifications
You must be signed in to change notification settings - Fork 3.7k
[fix](count) fix wrong count push down logic #56182
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
|
run buildall |
TPC-H: Total hot run time: 1498 ms |
TPC-DS: Total hot run time: 2763 ms |
ClickBench: Total hot run time: 0.03 s |
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
BE Regression && UT Coverage ReportIncrement line coverage Increment coverage report
|
|
run buildall |
FE Regression Coverage ReportIncrement line coverage |
TPC-H: Total hot run time: 1489 ms |
TPC-DS: Total hot run time: 2764 ms |
ClickBench: Total hot run time: 0.05 s |
FE UT Coverage ReportIncrement line coverage |
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
BE Regression && UT Coverage ReportIncrement line coverage Increment coverage report
|
FE Regression Coverage ReportIncrement line coverage |
|
run buildall |
Cloud UT Coverage ReportIncrement line coverage Increment coverage report
|
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
BE Regression && UT Coverage ReportIncrement line coverage Increment coverage report
|
FE Regression Coverage ReportIncrement line coverage |
793faa2 to
2e290c0
Compare
|
run buildall |
Cloud UT Coverage ReportIncrement line coverage Increment coverage report
|
FE UT Coverage ReportIncrement line coverage |
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
TPC-H: Total hot run time: 1483 ms |
TPC-DS: Total hot run time: 2745 ms |
ClickBench: Total hot run time: 0.04 s |
BE Regression && UT Coverage ReportIncrement line coverage Increment coverage report
|
FE Regression Coverage ReportIncrement line coverage |
|
PR approved by at least one committer and no changes requested. |
|
PR approved by anyone and no changes requested. |
### What problem does this PR solve? Introduced from topn optimization. When executing query like `select count(*) from tbl`, it will trigger "count push down optimization". which means it will send some "dummy" split to BE, each with a part of row count number. But due to the bug, BE will use the range offset info in these dummy split to do the row group filter logic, which is incorrect and will result in empty result because all row group will be filtered. This PR fix it, to not filter the row group if it is a dummy split. How to reproduce: 1. find an iceberg table with file size at least 16MB 2. set file_split_size=4MB 3. select count(*) from table, it will return empty result
Introduced from topn optimization. When executing query like `select count(*) from tbl`, it will trigger "count push down optimization". which means it will send some "dummy" split to BE, each with a part of row count number. But due to the bug, BE will use the range offset info in these dummy split to do the row group filter logic, which is incorrect and will result in empty result because all row group will be filtered. This PR fix it, to not filter the row group if it is a dummy split. How to reproduce: 1. find an iceberg table with file size at least 16MB 2. set file_split_size=4MB 3. select count(*) from table, it will return empty result
What problem does this PR solve?
Introduced from topn optimization.
When executing query like
select count(*) from tbl, it will trigger "count push down optimization".which means it will send some "dummy" split to BE, each with a part of row count number.
But due to the bug, BE will use the range offset info in these dummy split to do the row group filter logic,
which is incorrect and will result in empty result because all row group will be filtered.
This PR fix it, to not filter the row group if it is a dummy split.
How to reproduce:
Release note
None
Check List (For Author)
Test
Behavior changed:
Does this need documentation?
Check List (For Reviewer who merge this PR)