Update semantic search CCS to support generic query vector builders by Mikep86 · Pull Request #142254 · elastic/elasticsearch

Mikep86 · 2026-02-10T21:11:44Z

The current semantic search CCS implementation assumes that the only QueryVectorBuilder implementation is TextEmbeddingQueryVectorBuilder, However, this will soon not be the case. There is already a PR to add a new "lookup" query vector builder, and more variants may be added later.

This PR updates the semantic search CCS implementation to support generic query vector builders. The high-level approach is:

Unless handling a special case, assume any given QueryVectorBuilder can be used to build a query vector on the coordinator node. If it cannot, the query vector builder's buildVector method is responsible for throwing an exception.
Add logic to intercepted knn queries to rewrite a query vector builder to a query vector on the coordinator node. The resulting query vector is stored in the original query that was intercepted.

TextEmbeddingQueryVectorBuilders still need special handling, because they can be used for semantic search even when incomplete. When a TextEmbeddingQueryVectorBuilder does not specify an inference ID, we extract the model text from it and use that to generate the inference results necessary to query the specified semantic_text field(s).

This approach also allows us to simplify the intercepted query logic overall. We can remove the concept of a general "inference ID override" and replace it with custom coordinator node rewrite actions that generate query vectors as necessary.

…ilder

…yBuilder

…erence validation logic

…nce validation logic

Mikep86 · 2026-02-10T21:27:08Z

@elasticmachine update branch

…-builders

server/src/main/java/org/elasticsearch/search/vectors/QueryVectorBuilder.java

benwtrent · 2026-02-11T20:38:14Z

...ava/org/elasticsearch/xpack/inference/queries/InterceptedInferenceKnnVectorQueryBuilder.java

+            // Other query vector builder types always require validation
+            queryVectorBuilder.validate();


We don't need this new interface. You still do the instanceof check before it, it isn't really providing anything.

Hmmm, I think I see how to do that. Some special handling would be required for TextEmbeddingQueryVectorBuilder, but all other query vector builders would be assumed to be complete and able to build vectors.

The only special handling is that we inject the model to TextEmbeddingQueryVectorBuilder right? If thats the case, that is the only thing that ever requires special handling. Everything else should be treated as "it should just work and if you don't give us a vector, shame on you"

Essentially yes. The special handling for TextEmbeddingQueryVectorBuilder is that if a model ID isn't specified, we pull the model text from it and then infer the inference ID(s) from the semantic text fields queried.

All other query vector builders should "just work" though, and if they don't, they should throw an error when buildVector is called.

++ yes, I think this is good. I think the logic is that "rewrite as normal if NOT TextEmbeddingQueryVectorBuilder and if you are, let's handle our special case"

Done in c3da5be

…-builders

…rQueryBuilderTests

elasticsearchmachine · 2026-02-13T20:20:56Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

benwtrent

@Mikep86 its difficult to parse what is "refactoring" vs. actually fixing the bug. Could you extract the refactoring pieces into their own PR, that don't adjust behavior at all? This SEEMS like the new "QueryRewriteAction" things.

Mikep86 · 2026-02-23T16:15:06Z

@benwtrent Sure, but the refactoring pieces would just be moving the QueryRewriteAsyncAction extensions into dedicated files

…-builders

benwtrent · 2026-02-24T14:48:02Z

...ava/org/elasticsearch/xpack/inference/queries/InterceptedInferenceKnnVectorQueryBuilder.java

+        QueryVectorBuilder queryVectorBuilder = originalQuery.queryVectorBuilder();
+        if (queryVectorBuilder != null) {
+            boolean registerAction = false;
+            if (queryVectorBuilder instanceof TextEmbeddingQueryVectorBuilder tevb) {
+                // TextEmbeddingQueryVectorBuilder is a special case. If a model ID is set, we register an action to generate
+                // the query vector. If not, the model text will be returned via getQuery() so that InferenceQueryUtils can
+                // generate the appropriate inference results for the inferred inference ID(s).
+                if (tevb.getModelId() != null) {
+                    registerAction = true;
+                }
+            } else {
+                // We register an action to generate the query vector for all other query vector builders. If they cannot, buildVector()
+                // should throw an error indicating why.
+                registerAction = true;
+            }
+
+            if (registerAction) {
+                SetOnce<float[]> newQueryVectorSupplier = new SetOnce<>();
+                queryRewriteContext.registerUniqueAsyncAction(
+                    new QueryVectorBuilderAsyncAction(queryVectorBuilder),
+                    newQueryVectorSupplier::set
+                );
+                return new InterceptedInferenceKnnVectorQueryBuilder(queryBuilder, originalQuery, newQueryVectorSupplier);
+            }
+        }
+
+        return queryBuilder;
+    }


benwtrent

looking at the knn specific things. This looks good.

I don't know anything about the changes to the InferenceQueryUtils class. But seems fall out from getInferenceIdOverride. I am not sure why we ever needed that interface. Where does the logic reside now? Was it ever needed for the fqdn inference id?

Mikep86 · 2026-02-24T15:36:10Z

@benwtrent

I don't know anything about the changes to the InferenceQueryUtils class. But seems fall out from getInferenceIdOverride. I am not sure why we ever needed that interface. Where does the logic reside now? Was it ever needed for the fqdn inference id?

Inference ID override was how we used to handle when the user provided an inference ID to the the query vector builder or sparse_vector query. Basically, InferenceQueryUtils would always perform query-time inference for intercepted queries, even for "complete" query vector builders. If the user didn't provide an inference ID, we inferred it from the semantic text field(s) queried. If they did, we set the override.

This has been simplified with this updated implementation. Now, InferenceQueryUtils is only used for query-time inference when inferring the inference ID(s) from semantic_text fields. In cases where the user explicitly provides an inference ID, we handle query-time inference directly in the intercepted query. Thus, we can remove the concept of an inference ID override in InferenceQueryUtils.

Mikep86 · 2026-02-25T19:07:21Z

@elasticmachine update branch

…-builders

Mikep86 added 15 commits February 9, 2026 11:08

Added validate method to QueryVectorBuilder

6fbfcd1

Validate query vector builder on query text extraction

c74768c

Updated intercepted knn query to rewrite query vector builders

b753500

Spotless

8ee350a

Updated intercepted sparse_vector query to get query vectors

da4abb1

Added a query vector supplier to InterceptedInferenceKnnVectorQueryBu…

045b4ea

…ilder

Added a query vector supplier to InterceptedInferenceSparseVectorQuer…

d289a55

…yBuilder

Updated InterceptedInferenceSparseVectorQueryBuilder pre and post inf…

0b15c6f

…erence validation logic

Updated InterceptedInferenceKnnVectorQueryBuilder pre and post infere…

f5fca6e

…nce validation logic

Remove inference ID override from InferenceQueryUtils

fde692e

Keep query vector builder while using query vector supplier

37568ae

Fixed InterceptedInferenceKnnVectorQueryBuilderTests

c15b3fe

Fixed InterceptedInferenceSparseVectorQueryBuilderTests

fcf4b7d

Simplify test

02eedb7

Remove inference ID override logic

31bb0c8

Mikep86 requested review from a team and benwtrent February 10, 2026 21:11

Mikep86 added >non-issue :Search Relevance/Vectors Vector search v9.4.0 labels Feb 10, 2026

This was referenced Feb 10, 2026

Add a new "lookup" query vector builder #141488

Merged

Update intercepted queries to support generic query vector builders #142141

Closed

Merge branch 'main' into semantic-search_support-generic-query-vector…

13aa7a1

…-builders

Mikep86 commented Feb 11, 2026

View reviewed changes

server/src/main/java/org/elasticsearch/search/vectors/QueryVectorBuilder.java Outdated Show resolved Hide resolved

benwtrent reviewed Feb 11, 2026

View reviewed changes

Mikep86 added 4 commits February 13, 2026 11:12

Remove validate method from QueryVectorBuilder interface

c3da5be

Merge branch 'main' into semantic-search_support-generic-query-vector…

726c985

…-builders

Update comment

5b20526

Added GenericQueryVectorBuilder

3c9690d

Mikep86 added 5 commits February 13, 2026 12:47

Added test cases to KnnVectorQueryBuilderCrossClusterSearchIT

c965152

Moved GenericQueryVectorBuilder

bb66639

Added GenericQueryVectorBuilder tests to InterceptedInferenceKnnVecto…

2f350ca

…rQueryBuilderTests

Revert changes to buildVector

9c24f19

Remove TODO

51a4021

Mikep86 marked this pull request as ready for review February 13, 2026 20:20

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Feb 13, 2026

benwtrent reviewed Feb 23, 2026

View reviewed changes

Mikep86 mentioned this pull request Feb 23, 2026

Refactor query rewrite async actions for knn and sparse_vector queries #142889

Merged

Mikep86 added 2 commits February 23, 2026 16:20

Merge branch 'main' into semantic-search_support-generic-query-vector…

39c26fe

…-builders

Fix build error

9d9da9c

Mikep86 requested a review from benwtrent February 24, 2026 12:53

benwtrent reviewed Feb 24, 2026

View reviewed changes

Merge branch 'main' into semantic-search_support-generic-query-vector…

4450f2e

…-builders

benwtrent approved these changes Feb 25, 2026

View reviewed changes

Mikep86 merged commit 798eb10 into elastic:main Feb 25, 2026
36 checks passed

Mikep86 mentioned this pull request Apr 1, 2026

Intercepted knn & sparse_vector queries sometimes register multiple async action consumers #145444

Closed

		// Other query vector builder types always require validation
		queryVectorBuilder.validate();

Conversation

Mikep86 commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Mikep86 commented Feb 10, 2026

Uh oh!

Uh oh!

benwtrent Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Mikep86 Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

benwtrent Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Mikep86 Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

benwtrent Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Mikep86 Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

elasticsearchmachine commented Feb 13, 2026

Uh oh!

benwtrent left a comment

Choose a reason for hiding this comment

Uh oh!

Mikep86 commented Feb 23, 2026

Uh oh!

benwtrent Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

benwtrent left a comment

Choose a reason for hiding this comment

Uh oh!

Mikep86 commented Feb 24, 2026

Uh oh!

Mikep86 commented Feb 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Mikep86 commented Feb 10, 2026 •

edited

Loading

Mikep86 Feb 11, 2026 •

edited

Loading