ES|QL: Push down TopN into Fork branches by ioanatia · Pull Request #139605 · elastic/elasticsearch

ioanatia · 2025-12-16T14:01:01Z

Pushes OrderBy + Limit into FORK branches, when the FORK branch queries an index (EsRelation, not LocalRelation is present) and does not contain a pipeline breaker (and its output is unbounded).
This optimization will help when we remove the implicit LIMIT for FORK.

As a conceptual example:

FROM my-index
| FORK
   (WHERE x > 1000)
   (WHERE y > 1000)
| SORT z
| LIMIT 10

becomes

FROM my-index
| FORK
          (WHERE x > 1000 | SORT z | LIMIT 10)
          (WHERE y > 1000 | SORT z | LIMIT 10)
| SORT z
| LIMIT 10

Because right now an implicit limit is always added, this optimization does not have an effect.
In order to properly test it, we introduce a query pragma that tells the analyzer to not add the implicit limit.
Query pragmas are not user facing and are mostly used for internal testing.

Additional work is needed, in the following examples, TopN is not being pushed down:

FROM my-index
| FORK
  (WHERE x > 1000)
  (WHERE y > 1000)
| MV_EXPAND w
| SORT z
| LIMIT 10    


FROM my-index
| FORK
  (WHERE x > 1000)
  (WHERE y > 1000)
| EVAL a = a + 1
| SORT z
| LIMIT 10

elasticsearchmachine · 2026-01-15T16:07:10Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

carlosdelest

Looks great!

A couple of questions, but nothing that prevents merging 👍

...g/elasticsearch/xpack/esql/optimizer/rules/logical/PushDownLimitAndOrderByIntoForkTests.java

carlosdelest · 2026-01-16T08:39:45Z

...va/org/elasticsearch/xpack/esql/optimizer/rules/logical/PushDownLimitAndOrderByIntoFork.java

+        return outputMap;
+    }
+
+    private boolean shouldPushDownIntoForkBranch(LogicalPlan plan) {


Should we push down if we SORT on a column that the fork branch does not produce?

we don't push down in this case, because right now we are only pushing down when SORT + LIMIT immediately follow a FORK. We should only be in this case when we sort on attributes produced by FORK.
What I can do for good measure is too add an assertion to explicitly check that when we push down in maybePushDownLimitAndOrderByToForkBranch

...g/elasticsearch/xpack/esql/optimizer/rules/logical/PushDownLimitAndOrderByIntoForkTests.java

fang-xing-esql

LGTM from functional perspective, thanks @ioanatia !

There is one performance related uncertainty comes to my mind. sort can be expensive, and pushing down sort into each branch implies we could potentially execute an expensive sort in parallel across multiple branches, and increase the total cost. On the other hand, limit is pushed down as well, each branch may return fewer rows potentially, which could help performance potentially, and offset the extra sorting cost. We will likely need performance benchmarks to validate the trade offs. This is behind a pragma, I think it is good for now.

ioanatia · 2026-01-20T14:59:09Z

On the other hand, limit is pushed down as well, each branch may return fewer rows potentially, which could help performance potentially, and offset the extra sorting cost. We will likely need performance benchmarks to validate the trade offs. This is behind a pragma, I think it is good for now.

We only push down if there's no pipeline breaker in the FORK branch - we will need some performance benchmarks for sure.
There's some follow up improvements we could do here:

if we push down to all FORK branches, we know the pages outputted from FORK should always be sorted - so we could use a similar optimization to ES|QL: Sort faster (Optimize TopNOperator in final plans) #131221
when we push down to a FORK branch, we could skip the intermediary TopN that gets executed on the coordinator as part of the FORK branch, and just output the pages that are coming from the data nodes from the FORK branch. The main TopN that's present after FORK should already take in sorting those.

again - we would need some benchmarks to see the real benefits

Push down TopN into Fork branches

2e8b858

ioanatia added >non-issue Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch :Search Relevance/ES|QL Search functionality in ES|QL labels Dec 16, 2025

elasticsearchmachine added v9.3.0 v9.4.0 and removed v9.3.0 labels Dec 16, 2025

Merge branch 'main' into fork_order_by_limit_pushdown

1473dd8

ioanatia mentioned this pull request Jan 12, 2026

ES|QL: Optimizations for when we remove implicit LIMIT for FORK #136820

Open

3 tasks

ioanatia and others added 7 commits January 13, 2026 10:37

Merge branch 'main' into fork_order_by_limit_pushdown

6c8a135

Checkstyle fix

3bd3b54

Add query pragma for removing fork limit

5d957af

[CI] Auto commit changes from spotless

1d261b0

Merge branch 'main' into fork_order_by_limit_pushdown

bdaa398

Ignore branches using LocalRelation for now

eda6622

Merge branch 'main' into fork_order_by_limit_pushdown

0e173a3

ioanatia mentioned this pull request Jan 14, 2026

ES|QL: Prune fork branches with empty results #140593

Merged

ioanatia and others added 3 commits January 15, 2026 13:57

Merge branch 'main' into fork_order_by_limit_pushdown

a82fccd

Remove hack now that we have PruneEmptyForkBranches

f09ca21

Add comment

f872727

ioanatia marked this pull request as ready for review January 15, 2026 16:06

ioanatia requested a review from carlosdelest January 15, 2026 16:07

ioanatia requested a review from fang-xing-esql January 15, 2026 16:07

carlosdelest approved these changes Jan 16, 2026

View reviewed changes

fang-xing-esql approved these changes Jan 16, 2026

View reviewed changes

ioanatia and others added 4 commits January 19, 2026 14:02

Add a test for subqueries and assertion

fd2aa6d

Merge branch 'main' into fork_order_by_limit_pushdown

f88dc1d

Make sure we don't break release tests

7fd180b

Merge branch 'main' into fork_order_by_limit_pushdown

4c684f8

ioanatia requested a review from carlosdelest January 20, 2026 15:00

carlosdelest approved these changes Jan 20, 2026

View reviewed changes

ioanatia merged commit e91962d into elastic:main Jan 20, 2026
35 checks passed

ioanatia deleted the fork_order_by_limit_pushdown branch January 20, 2026 15:37

spinscale pushed a commit to spinscale/elasticsearch that referenced this pull request Jan 21, 2026

ES|QL: Push down TopN into Fork branches (elastic#139605)

981d307

carlosdelest mentioned this pull request Feb 11, 2026

add hybrid queries to msmarco-v2-vector in ESQL and DSL mode (#1010) elastic/rally-tracks#1010

Merged

kkharbas mentioned this pull request Feb 13, 2026

hybrid search fix: add back top-n into fork branches elastic/rally-tracks#1050

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ES|QL: Push down TopN into Fork branches#139605

ES|QL: Push down TopN into Fork branches#139605
ioanatia merged 16 commits intoelastic:mainfrom
ioanatia:fork_order_by_limit_pushdown

ioanatia commented Dec 16, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Jan 15, 2026

Uh oh!

carlosdelest left a comment

Uh oh!

Uh oh!

carlosdelest Jan 16, 2026

Uh oh!

ioanatia Jan 19, 2026

Uh oh!

Uh oh!

fang-xing-esql left a comment

Uh oh!

ioanatia commented Jan 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ioanatia commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Jan 15, 2026

Uh oh!

carlosdelest left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

carlosdelest Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

ioanatia Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fang-xing-esql left a comment

Choose a reason for hiding this comment

Uh oh!

ioanatia commented Jan 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ioanatia commented Dec 16, 2025 •

edited

Loading