Fix field resolution for FORK by ioanatia · Pull Request #128193 · elastic/elasticsearch

ioanatia · 2025-05-20T12:24:00Z

Should close #127208

Part of #121950

AFAICS we don't need to ask for all fields when FORK is used and the current field resolution should work since FORK is a N-ary plan.

elasticsearchmachine · 2025-05-20T18:25:08Z

Pinging @elastic/es-analytical-engine (Team:Analytics)

ChrisHegarty

LGTM

astefan

From code and tests in csv-spec files I noticed that FORK is "generating" a _fork (sort of hidden) field. Can you, please, add tests with queries that make use of this _fork field?

astefan

The query below (from rerank.csv-spec) is using as a field name _fork which is strange (many other queries using fork and rrf do this). This is the full list of field names it's asking for: book_no, author, title, _fork, title.*, _fork.*, author.*, book_no.*.

FROM books METADATA _id, _index, _score
            | FORK ( WHERE title:"Tolkien" | SORT _score, _id DESC | LIMIT 3 )
            ( WHERE author:"Tolkien" | SORT _score, _id DESC | LIMIT 3 )
            | RRF
            | RERANK "Tolkien" ON title WITH test_reranker
            | LIMIT 2
            | KEEP book_no, title, author

EsqlSession.fieldNames automatically ignores MetadataAttributes (like _id, _score, _index and others). I'm sure you've discussed about this: what was the reason for not considering _fork as a MetadataAttribute?

ioanatia · 2025-05-21T09:55:09Z

...in/esql/src/test/java/org/elasticsearch/xpack/esql/session/IndexResolverFieldNamesTests.java

+            | WHERE a > 2000
+            | EVAL b = a + 100
+            | FORK (WHERE c > 1 AND a < 10000 | EVAL d = a + 500)
+                   (STATS x = count(*), y=min(z))


hmmm - now that I am looking at this more - I think we should return all fields, because one branch is not bounded by any command like KEEP or STATS that resets the output to a known list of fields. will fix 🤔

There is one more query where I am confused, in part because of probably not knowing what is the expectation for fork.

FROM employees | FORK ( WHERE true | stats min(salary) by gender) ( WHERE true | LIMIT 3 )

This shows these columns:

min(salary) | gender | _fork | salary

Should these be a "union" kind of set of columns and fieldNames is only limiting it to what it "sees" in the fork's first branch? If that's true, this means that fieldNames should consider a union kind of field names from all the branches of fork. As a shortcut, the first branch that it finds that's the "widest" it should stop checking the rest.

…tion or project

ioanatia · 2025-05-21T11:08:08Z

EsqlSession.fieldNames automatically ignores MetadataAttributes (like _id, _score, _index and others). I'm sure you've discussed about this: what was the reason for not considering _fork as a MetadataAttribute?

We could consider _fork a metadata attribute - but unlike _id, _index, you could actually have a _fork mapped field.
Promise to take this as a separate question that we can address, but for now we end up asking field_caps for the _fork field when we see it referenced.

tteofili

LGTM

x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/session/EsqlSession.java

astefan

LGTM.

Please, also, add that query I mentioned in one of my comments: FROM employees | FORK ( WHERE true | stats min(salary) by gender) ( WHERE true | LIMIT 3 ) to IndexResolverFieldNamesTests. Thank you.

Fix field resolution for FORK

5a64c9b

ioanatia added >non-issue :Analytics/ES|QL AKA ESQL Team:SearchOrg Meta label for the Search Org (Enterprise Search) v9.1.0 labels May 20, 2025

ioanatia added 2 commits May 20, 2025 18:36

Merge branch 'main' into fork_field_names

c4a25d3

Merge branch 'main' into fork_field_names

af3b955

ioanatia marked this pull request as ready for review May 20, 2025 18:24

ioanatia requested a review from ChrisHegarty May 20, 2025 18:24

elasticsearchmachine added Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) and removed Team:SearchOrg Meta label for the Search Org (Enterprise Search) labels May 20, 2025

ioanatia requested review from afoucret, astefan, carlosdelest and tteofili May 20, 2025 18:25

ChrisHegarty approved these changes May 21, 2025

View reviewed changes

astefan reviewed May 21, 2025

View reviewed changes

ioanatia commented May 21, 2025

View reviewed changes

Fix field resolution when one FORK branch is not bounded by a aggrega…

4d23bc0

…tion or project

tteofili approved these changes May 21, 2025

View reviewed changes

astefan reviewed May 21, 2025

View reviewed changes

x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/session/EsqlSession.java Show resolved Hide resolved

astefan approved these changes May 21, 2025

View reviewed changes

Add test and apply refactor suggestion

34c883e

ioanatia mentioned this pull request May 21, 2025

[CI] CrossClusterQueryWithFiltersIT testTimestampFilterFromQuery failing #127332

Closed

ioanatia merged commit c8581b0 into elastic:main May 21, 2025
17 of 18 checks passed

ioanatia deleted the fork_field_names branch May 21, 2025 17:34

ioanatia mentioned this pull request May 22, 2025

[CI] IndexResolverFieldNamesTests testForkWithStatsInAllBranches failing #128272

Closed

ioanatia mentioned this pull request Jul 24, 2025

ES|QL: Fix Fork field reference tracking #131723

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix field resolution for FORK#128193

Fix field resolution for FORK#128193
ioanatia merged 5 commits intoelastic:mainfrom
ioanatia:fork_field_names

ioanatia commented May 20, 2025

Uh oh!

elasticsearchmachine commented May 20, 2025

Uh oh!

ChrisHegarty left a comment

Uh oh!

astefan left a comment

Uh oh!

astefan left a comment

Uh oh!

ioanatia May 21, 2025

Uh oh!

astefan May 21, 2025

Uh oh!

ioanatia May 21, 2025

Uh oh!

ioanatia commented May 21, 2025 •

edited

Loading

Uh oh!

tteofili left a comment

Uh oh!

Uh oh!

astefan left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

ioanatia commented May 20, 2025

Uh oh!

elasticsearchmachine commented May 20, 2025

Uh oh!

ChrisHegarty left a comment

Choose a reason for hiding this comment

Uh oh!

astefan left a comment

Choose a reason for hiding this comment

Uh oh!

astefan left a comment

Choose a reason for hiding this comment

Uh oh!

ioanatia May 21, 2025

Choose a reason for hiding this comment

Uh oh!

astefan May 21, 2025

Choose a reason for hiding this comment

Uh oh!

ioanatia May 21, 2025

Choose a reason for hiding this comment

Uh oh!

ioanatia commented May 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tteofili left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

astefan left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ioanatia commented May 21, 2025 •

edited

Loading

astefan left a comment •

edited

Loading