Tests for runtime field queries with fbf aggs by nik9000 · Pull Request #71503 · elastic/elasticsearch

nik9000 · 2021-04-08T20:56:15Z

This adds a few tests for runtime field queries applied to
"filter-by-filter" style aggregations. We expect to still be able to
use filter-by-filter aggregations to speed up collection when the top
level query is a runtime field. You'd think that filter-by-filter would
be slow when the top level query is slow, like it is with runtime
fields, but we only run filter-by-filter when we can translate each
aggregation bucket into a quick query. So long as the results of those
queries don't "overlap" we shouldn't end up running the slower top level
query more times than we would during regular collection.

This also adds some javadoc to that effect to the two places where we
chose between filter-by-filter and a "native" aggregation
implementation.

This adds a few tests for runtime field queries applied to "filter-by-filter" style aggregations. We expect to still be able to use filter-by-filter aggregations to speed up collection when the top level query is a runtime field. You'd think that filter-by-filter would be slow when the top level query is slow, like it is with runtime fields, but we only run filter-by-filter when we can translate each aggregation bucket into a quick query. So long as the results of those queries don't "overlap" we shouldn't end up running the slower top level query more times than we would during regular collection. This also adds some javadoc to that effect to the two places where we chose between filter-by-filter and a "native" aggregation implementation.

elasticmachine · 2021-04-08T20:56:18Z

Pinging @elastic/es-analytics-geo (Team:Analytics)

nik9000 · 2021-04-08T20:56:37Z

server/src/main/java/org/elasticsearch/search/aggregations/bucket/filter/FiltersAggregator.java

+                segmentsCollected++;
                collectCount(ctx, live);
            } else {
+                segmentsCounted++;


These were missing! Ooops.

not-napoleon

LGTM.

not-napoleon · 2021-04-12T13:40:48Z

...r/src/test/java/org/elasticsearch/search/aggregations/bucket/range/RangeAggregatorTests.java

+            }
+        };
+        Query query = new StringScriptFieldTermQuery(new Script("dummy"), scriptFactory, "dummy", "cat", false);
+        debugTestCase(new RangeAggregationBuilder("r").field(NUMBER_FIELD_NAME).addRange(0, 1).addRange(1, 2).addRange(2, 3), query, iw -> {


Nit - I think the formatting here isn't following the new standard; Can you just run the auto-formatter on this, please?

Surprisingly, this is what the standard looks like. If I had to guess the lambdas are letting this bunch up somehow.

I stand corrected. Thanks for checking!

not-napoleon · 2021-04-12T13:49:44Z

test/framework/src/main/java/org/elasticsearch/search/aggregations/AggregatorTestCase.java

+     * {@link Aggregator} per leaf and perform partial reductions. It always
+     * creates a single {@link Aggregator} so we can get consistent debug info.
+     */
+    protected <R extends InternalAggregation> void debugTestCase(


Thanks for adding this!

This adds a few tests for runtime field queries applied to "filter-by-filter" style aggregations. We expect to still be able to use filter-by-filter aggregations to speed up collection when the top level query is a runtime field. You'd think that filter-by-filter would be slow when the top level query is slow, like it is with runtime fields, but we only run filter-by-filter when we can translate each aggregation bucket into a quick query. So long as the results of those queries don't "overlap" we shouldn't end up running the slower top level query more times than we would during regular collection. This also adds some javadoc to that effect to the two places where we chose between filter-by-filter and a "native" aggregation implementation.

…71585) This adds a few tests for runtime field queries applied to "filter-by-filter" style aggregations. We expect to still be able to use filter-by-filter aggregations to speed up collection when the top level query is a runtime field. You'd think that filter-by-filter would be slow when the top level query is slow, like it is with runtime fields, but we only run filter-by-filter when we can translate each aggregation bucket into a quick query. So long as the results of those queries don't "overlap" we shouldn't end up running the slower top level query more times than we would during regular collection. This also adds some javadoc to that effect to the two places where we chose between filter-by-filter and a "native" aggregation implementation.

nik9000 added 3 commits April 8, 2021 13:59

Drop method we don't need

a6c8cc1

fix bug

229c46d

nik9000 added >test Issues or PRs that are addressing/adding tests :Analytics/Aggregations Aggregations v8.0.0 v7.13.0 labels Apr 8, 2021

nik9000 requested a review from not-napoleon April 8, 2021 20:56

elasticmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Apr 8, 2021

nik9000 commented Apr 8, 2021

View reviewed changes

Backwards

5ce7e4a

not-napoleon approved these changes Apr 12, 2021

View reviewed changes

nik9000 added 2 commits April 12, 2021 13:14

Merge branch 'master' into terms_agg_de_optimize_runtime

7d9c6fb

Fixup

00a4285

nik9000 mentioned this pull request Apr 12, 2021

Speed up terms agg when not force merged #71241

Merged

nik9000 merged commit 3583ba0 into elastic:master Apr 12, 2021

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tests for runtime field queries with fbf aggs#71503

Tests for runtime field queries with fbf aggs#71503
nik9000 merged 6 commits intoelastic:masterfrom
nik9000:terms_agg_de_optimize_runtime

nik9000 commented Apr 8, 2021

Uh oh!

elasticmachine commented Apr 8, 2021

Uh oh!

nik9000 Apr 8, 2021

Uh oh!

not-napoleon left a comment

Uh oh!

not-napoleon Apr 12, 2021

Uh oh!

nik9000 Apr 12, 2021

Uh oh!

not-napoleon Apr 12, 2021

Uh oh!

not-napoleon Apr 12, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

nik9000 commented Apr 8, 2021

Uh oh!

elasticmachine commented Apr 8, 2021

Uh oh!

nik9000 Apr 8, 2021

Choose a reason for hiding this comment

Uh oh!

not-napoleon left a comment

Choose a reason for hiding this comment

Uh oh!

not-napoleon Apr 12, 2021

Choose a reason for hiding this comment

Uh oh!

nik9000 Apr 12, 2021

Choose a reason for hiding this comment

Uh oh!

not-napoleon Apr 12, 2021

Choose a reason for hiding this comment

Uh oh!

not-napoleon Apr 12, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants