Update text_similarity_rank_retriever to support inference ID as an argument when reranking on chunks by mridula-s109 · Pull Request #137397 · elastic/elasticsearch

mridula-s109 · 2025-10-30T17:54:10Z

Adding support to automatically use the best chunking size based on the input 'inference_id' provided.

Tested these below scenarios

# Full chunking control
POST my-index/_search
{
  "retriever": {
    "text_similarity_reranker": {
      "retriever": { "standard": { "query": { "match": { "text": "query" } } } },
      "field": "text",
      "inference_id": ".rerank-v1-elasticsearch",
      "inference_text": "search query",
      "chunk_rescorer": {
        "size": 3,
        "chunking_settings": {
          "strategy": "sentence",
          "max_chunk_size": 50,
          "sentence_overlap": 0
        }
      },
      "rank_window_size": 10
    }
  }
}

# Partial chunking (only max_chunk_size)
POST my-index/_search
{
  "retriever": {
    "text_similarity_reranker": {
      "retriever": { "standard": { "query": { "match": { "text": "query" } } } },
      "field": "text",
      "inference_id": ".rerank-v1-elasticsearch",
      "inference_text": "search query",
      "chunk_rescorer": {
        "size": 3,
        "chunking_settings": {
          "max_chunk_size": 100
        }
      },
      "rank_window_size": 10
    }
  }
}

# Auto-resolve to defaults
POST my-index/_search
{
  "retriever": {
    "text_similarity_reranker": {
      "retriever": { "standard": { "query": { "match": { "text": "query" } } } },
      "field": "text",
      "inference_id": ".rerank-v1-elasticsearch",
      "inference_text": "search query",
      "chunk_rescorer": {
        "size": 3
      },
      "rank_window_size": 10
    }
  }
}

# No chunking
POST my-index/_search
{
  "retriever": {
    "text_similarity_reranker": {
      "retriever": { "standard": { "query": { "match": { "text": "query" } } } },
      "field": "text",
      "inference_id": ".rerank-v1-elasticsearch",
      "inference_text": "search query",
      "rank_window_size": 10
    }
  }
}

mridula-s109 · 2025-10-30T18:31:35Z

@kderusso, this is a WIP, would love to hear your thoughts on the POC when you have a moment.

kderusso

I performed a very high level review of the approach. Please update this PR with the suggested changes.

server/src/main/java/org/elasticsearch/TransportVersions.java

...e/src/main/java/org/elasticsearch/xpack/inference/rank/textsimilarity/ChunkScorerConfig.java

...rg/elasticsearch/xpack/inference/rank/textsimilarity/TextSimilarityRankRetrieverBuilder.java

...pack/inference/rank/textsimilarity/TextSimilarityRankFeaturePhaseRankCoordinatorContext.java

mridula-s109 · 2025-11-12T16:19:00Z

I am still unable to run the yaml tests locally, but opening up for early feedback! Work in progress.

kderusso

High level functional review, still needs to be addressed.

docs/changelog/137397.yaml

...e/src/main/java/org/elasticsearch/xpack/inference/rank/textsimilarity/ChunkScorerConfig.java

...k/inference/rank/textsimilarity/TextSimilarityRerankingRankFeaturePhaseRankShardContext.java

.../test/java/org/elasticsearch/xpack/inference/rank/textsimilarity/ChunkScorerConfigTests.java

…he right chunking size

Copilot

Pull request overview

This PR enhances the text_similarity_rank_retriever to automatically determine optimal chunking settings based on the inference endpoint's window size when chunking settings are not explicitly provided. The retriever now queries the inference endpoint for its window size and uses it to configure chunking, while still allowing users to override these defaults with explicit settings.

Key changes:

Automatic resolution of chunking settings from inference endpoint window size when not explicitly provided
Support for partial chunking configuration where only max_chunk_size is specified
Introduction of async query rewriting to fetch window size from inference endpoints

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
70_text_similarity_rank_retriever.yml	Added integration tests covering auto-resolution, partial configuration, and explicit override scenarios
TextSimilarityRankFeaturePhaseRankCoordinatorContextTests.java	Added unit tests for chunking settings resolution logic and removed obsolete tests
ChunkScorerConfigTests.java	New test file covering serialization, deserialization, and chunking settings creation
TextSimilarityRerankingRankFeaturePhaseRankShardContext.java	Added validation to ensure chunking settings are resolved before shard execution
TextSimilarityRankRetrieverBuilder.java	Implemented async query rewriting to fetch window size and resolve chunking settings
TextSimilarityRankFeaturePhaseRankCoordinatorContext.java	Added chunking settings resolution logic and integrated window size fetching
ChunkScorerConfig.java	Modified to support null chunking settings and improved settings creation methods
InferenceFeatures.java	Added feature flag for the new chunking behavior
137397.yaml	Added changelog entry for the enhancement

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

...e/src/main/java/org/elasticsearch/xpack/inference/rank/textsimilarity/ChunkScorerConfig.java

...rg/elasticsearch/xpack/inference/rank/textsimilarity/TextSimilarityRankRetrieverBuilder.java

mridula-s109 · 2025-12-16T22:35:05Z

@kderusso Thanks for your patience! I have addressed the comments and also made sure the functionality is working as intended for different reranking models. Please let me know if there are any concerns or optimisation needed.

kderusso

Looks good, thanks for all your hard work iterating!

Can you please also update https://github.com/elastic/elasticsearch/blob/main/docs/reference/elasticsearch/rest-apis/retrievers/text-similarity-reranker-retriever.md to say that we default to chunking settings that will fit into the model associated with inference_id's token window? (This can be done as a followup if you want).

...rc/yamlRestTest/resources/rest-api-spec/test/inference/70_text_similarity_rank_retriever.yml

github-actions · 2025-12-19T12:43:18Z

🔍 Preview links for changed docs

docs/reference/elasticsearch/rest-apis/retrievers/text-similarity-reranker-retriever.md

github-actions · 2025-12-19T12:43:19Z

ℹ️ Important: Docs version tagging

👋 Thanks for updating the docs! Just a friendly reminder that our docs are now cumulative. This means all 9.x versions are documented on the same page and published off of the main branch, instead of creating separate pages for each minor version.

We use applies_to tags to mark version-specific features and changes.

Expand for a quick overview

When to use applies_to tags:

✅ At the page level to indicate which products/deployments the content applies to (mandatory)
✅ When features change state (e.g. preview, ga) in a specific version
✅ When availability differs across deployments and environments

What NOT to do:

❌ Don't remove or replace information that applies to an older version
❌ Don't add new information that applies to a specific version without an applies_to tag
❌ Don't forget that applies_to tags can be used at the page, section, and inline level

🤔 Need help?

Check out the cumulative docs guidelines
Reach out in the #docs Slack channel

mridula-s109 · 2025-12-19T13:22:39Z

Looks good, thanks for all your hard work iterating!

Can you please also update https://github.com/elastic/elasticsearch/blob/main/docs/reference/elasticsearch/rest-apis/retrievers/text-similarity-reranker-retriever.md to say that we default to chunking settings that will fit into the model associated with inference_id's token window? (This can be done as a followup if you want).

Thanks @kderusso for approving. I have addressed the doc update. Please do have a look and let me know if there are suggestions or happy to go ahead with the merge.

docs/reference/elasticsearch/rest-apis/retrievers/text-similarity-reranker-retriever.md

* upstream/main: Update text_similarity_rank_retriever to support inference ID as an argument when reranking on chunks (elastic#137397) Mute org.elasticsearch.test.rest.yaml.CssSearchYamlTestSuiteIT test {p0=search.retrievers/result-diversification/10_mmr_result_diversification_retriever/Test MMR result diversification multiple indexes} elastic#139826 Mute org.elasticsearch.index.mapper.SkipperSettingsTests testTSDBSkipperSettingDefaults elastic#139824 Unmute test fix elastic#129517 (elastic#139782) Add back support for deserializing old refresh token in test (elastic#139811) Add documentation for exponential_histogram field type (elastic#139684) make ES|QL sample CSV test looser (elastic#139814) Add `frozen_after` field to data stream lifecycle (elastic#139042) Quieten many `ERROR` logs to `WARN` (elastic#139799)

mridula-s109 added 2 commits October 30, 2025 17:52

Accept inference id in parser

cd45b91

Merge branch 'main' into add-inferenceid-support-textsimilarity

c879fcb

elasticsearchmachine added the v9.3.0 label Oct 30, 2025

mridula-s109 and others added 2 commits October 30, 2025 18:16

Wrote the functionality

bf58dce

[CI] Auto commit changes from spotless

d5ac95d

Merge branch 'main' into add-inferenceid-support-textsimilarity

53b4cae

kderusso requested changes Oct 30, 2025

View reviewed changes

mridula-s109 and others added 17 commits November 11, 2025 09:57

Removed unnecessary transport versioning

a7d1bfd

Merge conflict resolved

cb02634

Modified chunk scorer to allow null

65120ca

Merge conflict

3132025

Fixed compile error

02e4497

Merge branch 'main' into add-inferenceid-support-textsimilarity

3185b76

Unnecesary change reverted

ee92591

Nit - removed white space

c1a6aaf

[CI] Auto commit changes from spotless

4b10608

Fixed comments

55ca78d

Fixed null pointer

c472852

Fixed a NPE possibility

f7535e0

Merge branch 'main' into add-inferenceid-support-textsimilarity

05c6775

Unit tests added

8d7ded0

comitting to trigger a run

641481b

Merge branch 'main' into add-inferenceid-support-textsimilarity

6c6e4bc

[CI] Auto commit changes from spotless

df222f1

mridula-s109 marked this pull request as ready for review November 12, 2025 16:19

mridula-s109 requested a review from kderusso November 12, 2025 16:19

elasticsearchmachine added the needs:triage Requires assignment of a team area label label Nov 12, 2025

mridula-s109 added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Nov 12, 2025

Merge branch 'main' into add-inferenceid-support-textsimilarity

ae8a0e2

kderusso requested changes Dec 11, 2025

View reviewed changes

mridula-s109 added 2 commits December 16, 2025 20:54

Modified files to make sure the feature is working as intended with t…

69d0679

…he right chunking size

Merge branch 'main' into add-inferenceid-support-textsimilarity

f644263

mridula-s109 requested a review from Copilot December 16, 2025 20:56

Copilot AI reviewed Dec 16, 2025

View reviewed changes

...e/src/main/java/org/elasticsearch/xpack/inference/rank/textsimilarity/ChunkScorerConfig.java Show resolved Hide resolved

...rg/elasticsearch/xpack/inference/rank/textsimilarity/TextSimilarityRankRetrieverBuilder.java Outdated Show resolved Hide resolved

elasticsearchmachine and others added 3 commits December 16, 2025 21:04

[CI] Auto commit changes from spotless

7909164

Refactored/cleanedup code a bit

83fbccd

Merge branch 'main' into add-inferenceid-support-textsimilarity

138b4c4

mridula-s109 requested a review from kderusso December 16, 2025 22:34

kderusso approved these changes Dec 17, 2025

View reviewed changes

...rc/yamlRestTest/resources/rest-api-spec/test/inference/70_text_similarity_rank_retriever.yml Show resolved Hide resolved

elasticsearchmachine added v9.4.0 and removed v9.3.0 labels Dec 17, 2025

mridula-s109 added 2 commits December 19, 2025 12:41

Updated docs

8bb0b8c

Merge branch 'main' into add-inferenceid-support-textsimilarity

3677fc5

mridula-s109 added 2 commits December 19, 2025 12:47

Updated docs

067f775

Added applies to

e6f41d3

mridula-s109 force-pushed the add-inferenceid-support-textsimilarity branch from ef75bea to e6f41d3 Compare December 19, 2025 13:01

Applies to tag

6cec8eb

kderusso reviewed Dec 19, 2025

View reviewed changes

docs/reference/elasticsearch/rest-apis/retrievers/text-similarity-reranker-retriever.md Outdated Show resolved Hide resolved

mridula-s109 added 3 commits December 19, 2025 14:21

Updated doc to 9.2

e0f0aa3

Merge branch 'main' into add-inferenceid-support-textsimilarity

71675ea

Merge branch 'main' into add-inferenceid-support-textsimilarity

f63b488

mridula-s109 enabled auto-merge (squash) December 19, 2025 14:35

mridula-s109 merged commit e577424 into elastic:main Dec 19, 2025
35 checks passed

Conversation

mridula-s109 commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mridula-s109 commented Oct 30, 2025

Uh oh!

kderusso left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mridula-s109 commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kderusso left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

mridula-s109 commented Dec 16, 2025

Uh oh!

kderusso left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔍 Preview links for changed docs

Uh oh!

github-actions bot commented Dec 19, 2025

ℹ️ Important: Docs version tagging

When to use applies_to tags:

What NOT to do:

🤔 Need help?

Uh oh!

mridula-s109 commented Dec 19, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mridula-s109 commented Oct 30, 2025 •

edited

Loading

mridula-s109 commented Nov 12, 2025 •

edited

Loading

github-actions bot commented Dec 19, 2025 •

edited

Loading