ES|QL: Fix KQL/QSTR with unmapped fields in NULLIFY mode by luigidellaquila · Pull Request #143399 · elastic/elasticsearch

luigidellaquila · 2026-03-02T16:12:27Z

Fixing full text functions (KQL, QSTR) usage in queries with unmapped_fields=NULLIFY.

When using SET unmapped_fields="nullify", KQL and QSTR functions would fail validation due toEval nodes added right after EsRelation.

Here we propose a different approach: instead of adding an EVAL, we add unmapped fields directly to EsRelation's output with DataType.NULL.
ReplaceFieldWithConstantOrNull (local planner) rule will replace them with null literals anyway.

This should be fine, this is the same thing we do with non-existing fields locally (with index patterns).
And it's conceptually the same thing we do with LOAD strategy.

For non-FROM queries (eg. ROW) we still fall back to adding an EVAL.

I also evaluated two other solutions, but they don't seem to work:

make the fulltext functions validation more fine-grained, eg. allowing EVALs with all NULL values before KQL. It's fragile, users would be able to add an EVAL non_existing = null manually, and it would be hard to distinguish.
add the EVAL right before the unsupported fields are used (rather than right after FROM): it doesn't work in all cases, eg. we'd have to consider KEEP/DROP and other commands that mask the non-existing fields. It's fragile and complicated.

Fixes: #142968
Fixes: #142959

elasticsearchmachine · 2026-03-02T16:12:52Z

Pinging @elastic/es-analytical-engine (Team:Analytics)

elasticsearchmachine · 2026-03-02T16:13:15Z

Hi @luigidellaquila, I've created a changelog YAML for you.

alex-spies · 2026-03-03T14:18:04Z

Without having actually reviewed, I'm not sure about the approach, as it works around full-text functions' limitations. I think that conceptually, NULL is a bottom type and each function that can work on a field or column should also work when passed the null literal.

~~I explored the rule "every function should take NULL" in another PR, and there are only few exceptions where functions don't behave this way, full-text functions being the most prominent ones.~~

Update: please disregard, this is not about type compatibility, but about outright rejecting EVALs before KQL and QSTR.

alex-spies · 2026-03-03T14:50:06Z

Had a look at the approach. Still a little unsure. It's a pretty neat idea, though, and I think it's growing on me.

Injecting an EVAL after the EsRelation is a pretty standard thing we do, it just messes with KQL/QSTR validation because those don't want user-made EVALs before being called. Updating this validation may have less potential side effects, resp. may be a conceptually more isolated change compared to this PR, which drastically changes the effect of nullify on query plans.

Creating NULL-typed field attributes, like this PR proposes, means that wherever we process a field attribute, we may get bugs because we don't expect that attribute to correspond to a missing field. For instance, this approach requires testing to prevent wrong filter and topn pushdowns to Lucene, which isn't a problem we had to worry about before. For this reason, this PR should wait until #143460 is merged to avoid this problem.

That said, load uses PotentiallyUnmappedKeywordEsFields and does place them into the EsRelation, so this PR is an interesting opportunity to make nullify and load behave more similarly. But I think that we shouldn't just re-use EsField, but introduce an analogous MissingEsField (or so) so that we can more easily pattern match on. Golden tests should also clearly show that this isn't a standard field attribute #f but a missing field; for PotentiallyUnmappedKeywordEsField, this will be implemented in another PR (see here).

@luigidellaquila , can we leave this open until the following are merged?

ESQL: Prevent pushdown of unmapped fields in filters and sorts #143460
DRAFT: LOAD/NULLIFY golden tests #141821
Unless we find anything that this approach would somehow break, I think I'm in favor of refactoring nullify this way, therefore aligning it with load and simplifying the coordinator plan.

luigidellaquila · 2026-03-03T14:57:01Z

Thanks for the feedback @alex-spies

Creating NULL-typed field attributes, like this PR proposes, means that wherever we process a field attribute, we may get bugs because we don't expect that attribute to correspond to a missing field. For instance, this approach requires testing to prevent wrong filter and topn pushdowns to Lucene, which isn't a problem we had to worry about before

Actually, this is exactly what we do today with index patterns, a node/cluster could receive attributes that don't exist in the local indices, and the local planner just ignores them and emits a null. That's the main reason that makes me consider this approach (probably) safe

But I think that we shouldn't just re-use EsField, but introduce an analogous MissingEsField (or so) so that we can more easily pattern match on

I thought about this and I don't have a very strong opinion. It could be useful for testing, but in practice this is just a null value, so I'm not sure.

can we leave this open until the following are merged?

Yes, absolutely! More tests = catching more potential corner cases

alex-spies · 2026-03-03T16:53:46Z

Actually, this is exactly what we do today with index patterns, a node/cluster could receive attributes that don't exist in the local indices, and the local planner just ignores them and emits a null.

What's important to check is if FROM idx | WHERE missing == "foobar" still leads to an optimization to a LocalRelation. With this approach, I don't think it does. That might be a reason not to go with it unless we also ensure our optimizations correctly pick up the NULL field attribute.

A simpler approach might be to ensure the added NULL aliases in the EVAL (added by ResolveUnmapped) are synthetic, and skip the validation of KQL/QSTR that forbids EVALs if the EVAL only adds synthetic attributes.

luigidellaquila · 2026-03-03T17:05:01Z

What's important to check is if FROM idx | WHERE missing == "foobar" still leads to an optimization to a LocalRelation. With this approach, I don't think it does

I tend to put correctness on top of performance, but I think in this case we are lucky.
I did a quick test:

[2026-03-03T17:59:06,981][TRACE][o.e.x.e.o.L.changes      ] [runTask-0] Rule logical.FoldNull applied with change
Limit[1000[INTEGER],false,false]                                           = Limit[1000[INTEGER],false,false]
\_Filter[missing{f}#49 == foobar[KEYWORD]]                                 ! \_Filter[null[BOOLEAN]]
  \_EsRelation[idx][123foo{f}#44, @timestamp{f}#45, foo{f}#46, keyword{..] =   \_EsRelation[idx][123foo{f}#44, @timestamp{f}#45, foo{f}#46, keyword{..]

[2026-03-03T17:59:06,982][TRACE][o.e.x.e.o.L.changes      ] [runTask-0] Rule logical.PruneFilters applied with change
Limit[1000[INTEGER],false,false]                                           = Limit[1000[INTEGER],false,false]
\_Filter[null[BOOLEAN]]                                                    ! \_LocalRelation[[123foo{f}#44, @timestamp{f}#45, foo{f}#46, keyword{f}#48, text{f}#47, missing{f}#49],EMPTY]
  \_EsRelation[idx][123foo{f}#44, @timestamp{f}#45, foo{f}#46, keyword{..] !

It works because FoldNull checks isGuaranteedNull(), that also checks for NULL field type.

A simpler approach might be to ensure the added NULL aliases in the EVAL (added by ResolveUnmapped) are synthetic

I also tried this path, but synthetic attributes are not supposed to be returned as a result, so I got other failures.

alex-spies · 2026-03-04T08:50:11Z

It works because FoldNull checks isGuaranteedNull(), that also checks for NULL field type.

Oh, that's a nice surprise. Thanks for checking!

In this case, this looks like a nice improvement over the original approach and I think we should go for it, provided that we are a little more paranoid about testing things that could break.

coderabbitai · 2026-03-06T09:58:26Z

Important

Review skipped

Auto reviews are limited based on label configuration.

🏷️ Required labels (at least one) (2)

Team:Delivery
Team:Search - Inference

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Path: .coderabbit.yml

Review profile: CHILL

Plan: Pro

Run ID: 2559112b-a859-4f95-a373-970f610fd7fc

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Tip

Try Coding Plans. Let us write the prompt for your AI agent so you can ship faster (with fewer bugs).
Share your feedback on Discord.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

bpintea

LGTM.
Left two nits, but not worth going through CI if these will be the only left things.
I think I kind of like this solution better. 👍

bpintea · 2026-03-06T12:18:49Z

...k/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/analysis/rules/ResolveUnmapped.java

+        // For non-EsRelation sources (Row, LocalRelation): insert Eval nodes with null assignments
+        // This handles cases like: ROW x = 1 | EVAL y = unmapped_field
+        transformed = transformed.transformUp(
+            n -> n instanceof UnaryPlan unary && unary.child() instanceof LeafPlan leaf && (leaf instanceof EsRelation == false),


Nit: no need for ( )

bpintea · 2026-03-06T12:25:25Z

...k/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/analysis/rules/ResolveUnmapped.java

+        List<LogicalPlan> newChildren = new ArrayList<>(nAry.children().size());
+        boolean changed = false;
+        for (var child : nAry.children()) {
+            if (child instanceof LeafPlan source && (source instanceof EsRelation == false)) {


Also here, ( ) are redundant.

luigidellaquila · 2026-03-06T16:36:42Z

Thanks @bpintea!

luigidellaquila · 2026-03-06T17:21:08Z

~~Running the golden tests I noticed a strange side effect. A query like~~

SET unmapped_fields="nullify";
FROM idx 
| KEEP field, does_not_exist 
| WHERE field == "foo" OR does_not_exist::KEYWORD == "foo"

returns does_not_exist as a keyword type (and not as a null type).
If I add another condition casting to another type. eg does_not_exist::KEYWORD == "foo" OR does_not_exist::INTEGER == 3, I get null again.

~~I think it's an interaction with union types~~
~~I need to investigate it a bit before moving forward, but @bpintea @alex-spies if you have a clue please let me know.~~

Sorry, ignore all the above, I don't know what I was looking at.
It's all fine, it always returns a null type; it's just that it's Friday afternoon and I probably need a break.

)

ES|QL: Fix KQL/QSTR with unmapped fields in NULLIFY mode

ca28232

luigidellaquila requested review from alex-spies and astefan March 2, 2026 16:12

luigidellaquila added >bug :Analytics/ES|QL AKA ESQL labels Mar 2, 2026

elasticsearchmachine added v9.4.0 Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) labels Mar 2, 2026

Update docs/changelog/143399.yaml

1fc4dfa

luigidellaquila mentioned this pull request Mar 3, 2026

Fix full text functions with nullify #143231

Closed

Merge branch 'main' into esql/fix_kql_nullify

c8a28dc

Merge branch 'main' into esql/fix_kql_nullify

f76b8bb

alex-spies requested review from bpintea and removed request for alex-spies March 4, 2026 10:31

luigidellaquila and others added 6 commits March 4, 2026 12:09

Merge branch 'main' into esql/fix_kql_nullify

7b2294e

Merge branch 'main' into esql/fix_kql_nullify

77c0744

Merge branch 'main' into esql/fix_kql_nullify

9e2df80

Merge branch 'main' into esql/fix_kql_nullify

26704c3

Add MissingEsField

dbb3699

Merge branch 'main' into esql/fix_kql_nullify

adce0b1

bpintea approved these changes Mar 6, 2026

View reviewed changes

Merge branch 'main' into esql/fix_kql_nullify

f521f1b

luigidellaquila enabled auto-merge (squash) March 6, 2026 16:37

luigidellaquila disabled auto-merge March 6, 2026 17:17

Fix golden tests

748a64a

luigidellaquila enabled auto-merge (squash) March 6, 2026 17:29

Merge branch 'main' into esql/fix_kql_nullify

3677225

luigidellaquila merged commit d22c50a into elastic:main Mar 9, 2026
34 of 36 checks passed

prwhelan mentioned this pull request Mar 9, 2026

[Transform] Disable PIT for CPS #143876

Closed

idegtiarenko mentioned this pull request Mar 9, 2026

ESQL: unmapped_field nullify after KQL commands results in an incorrect error message #142705

Closed

tfcmarques mentioned this pull request Mar 10, 2026

[Discover][Traces] Update RED metrics queries to work with KQL WHERE clause elastic/kibana#252259

Closed

This was referenced Mar 11, 2026

ESQL: Small nullify fix cleanups #143998

Merged

ESQL: Fix incorrectly optimized fork with nullify unmapped_fields #143030

Merged

luigidellaquila added a commit to luigidellaquila/elasticsearch that referenced this pull request Mar 13, 2026

ES|QL: transport version to backport elastic#143399

4bb55a7

luigidellaquila mentioned this pull request Mar 13, 2026

ES|QL: transport version to backport #143399 #144232

Merged

luigidellaquila added a commit to luigidellaquila/elasticsearch that referenced this pull request Mar 13, 2026

ES|QL: Fix KQL/QSTR with unmapped fields in NULLIFY mode (elastic#143399

b69c225

)

luigidellaquila added a commit to luigidellaquila/elasticsearch that referenced this pull request Mar 13, 2026

ES|QL: transport version to backport elastic#143399

5076ea5

luigidellaquila mentioned this pull request Mar 13, 2026

Backport 143399 #144235

Merged

luigidellaquila added a commit that referenced this pull request Mar 14, 2026

ES|QL: transport version to backport #143399 (#144232)

7a4e493

alex-spies mentioned this pull request Mar 16, 2026

[9.3] ESQL: Fix incorrectly optimized fork with nullify unmapped_fields (#143030) #144205

Closed

ncordon pushed a commit to ncordon/elasticsearch that referenced this pull request Mar 16, 2026

ES|QL: transport version to backport elastic#143399 (elastic#144232)

507f31d

michalborek pushed a commit to michalborek/elasticsearch that referenced this pull request Mar 23, 2026

ES|QL: transport version to backport elastic#143399 (elastic#144232)

66e8d21

Conversation

luigidellaquila commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Mar 2, 2026

Uh oh!

elasticsearchmachine commented Mar 2, 2026

Uh oh!

alex-spies commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alex-spies commented Mar 3, 2026

Uh oh!

luigidellaquila commented Mar 3, 2026

Uh oh!

alex-spies commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

luigidellaquila commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alex-spies commented Mar 4, 2026

Uh oh!

coderabbitai bot commented Mar 6, 2026

Review skipped

Uh oh!

bpintea left a comment

Choose a reason for hiding this comment

Uh oh!

bpintea Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

bpintea Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

luigidellaquila commented Mar 6, 2026

Uh oh!

luigidellaquila commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

luigidellaquila commented Mar 2, 2026 •

edited

Loading

alex-spies commented Mar 3, 2026 •

edited

Loading

alex-spies commented Mar 3, 2026 •

edited

Loading

luigidellaquila commented Mar 3, 2026 •

edited

Loading

luigidellaquila commented Mar 6, 2026 •

edited

Loading