Skip to content

Improves visibility of vector index options and inference configuration#141653

Merged
kosabogi merged 18 commits intoelastic:mainfrom
kosabogi:dense_vector_sem_text_docs_improvements
Feb 24, 2026
Merged

Improves visibility of vector index options and inference configuration#141653
kosabogi merged 18 commits intoelastic:mainfrom
kosabogi:dense_vector_sem_text_docs_improvements

Conversation

@kosabogi
Copy link
Copy Markdown
Member

@kosabogi kosabogi commented Feb 2, 2026

This PR improves the discoverability of vector indexing and inference configuration in the docs by:

  • Refactoring the Automatically quantize vectors for kNN search section to lead with an example and include a concrete bbq_disk configuration.

  • Extending the first dense_vector mapping example to show index_options, with a numbered annotation linking to the quantization section.

  • Expanding the semantic_text example to set an inference_id and include index_options, linking back to the dense_vector quantization docs.

Relates to elastic/docs-content#3804 and elastic/docs-content#4832

@kosabogi kosabogi requested review from Mikep86 and kderusso February 2, 2026 12:21
@kosabogi kosabogi self-assigned this Feb 2, 2026
@kosabogi kosabogi added >docs General docs changes Team:Docs Meta label for docs team labels Feb 2, 2026
@elasticsearchmachine
Copy link
Copy Markdown
Collaborator

Pinging @elastic/core-docs (Team:Docs)

@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Feb 2, 2026

@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Feb 2, 2026

ℹ️ Important: Docs version tagging

👋 Thanks for updating the docs! Just a friendly reminder that our docs are now cumulative. This means all 9.x versions are documented on the same page and published off of the main branch, instead of creating separate pages for each minor version.

We use applies_to tags to mark version-specific features and changes.

Expand for a quick overview

When to use applies_to tags:

✅ At the page level to indicate which products/deployments the content applies to (mandatory)
✅ When features change state (e.g. preview, ga) in a specific version
✅ When availability differs across deployments and environments

What NOT to do:

❌ Don't remove or replace information that applies to an older version
❌ Don't add new information that applies to a specific version without an applies_to tag
❌ Don't forget that applies_to tags can be used at the page, section, and inline level

🤔 Need help?

@kosabogi kosabogi requested a review from a team February 2, 2026 16:35
@kderusso kderusso removed their request for review February 3, 2026 13:00
Copy link
Copy Markdown
Contributor

@Mikep86 Mikep86 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice work! I left some comments on some things we should expand on in the semantic text documentation.

Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>
@kosabogi
Copy link
Copy Markdown
Member Author

kosabogi commented Feb 11, 2026

Thank you for your review, @Mikep86 !

Based on your suggestions, I prepared a plan for continuing this work. I’d like to address the following items in this PR:

Default indexing strategy (version-specific)

9.0:
Default method for dense_vector fields: int8_hnsw
9.1:
Default method for dense_vector fields:

  • Vectors ≥ 384 dimensions: bbq_hnsw
  • Vectors < 384 dimensions: int8_hnsw

Is this correct, right?

semantic_text index_options section

  1. Create new sections under Set up and configure, including configuration examples and explanations for both:
  • Set index_options for sparse_vectors
  • Set index_options for dense_vectors
  1. Update this sentence:

“To learn more about available index options and how they affect vector quantization…”
Instead of linking to the dense_vector quantization page, link to the newly added semantic_text section.

Full semantic_text example with all optional parameters

Keep the current example as a Basic semantic_text mapping example
Add a separate Extended semantic_text mapping example that specifies all optional parameters of semantic_text

@kosabogi kosabogi marked this pull request as draft February 11, 2026 12:39
@kosabogi kosabogi requested a review from Mikep86 February 13, 2026 10:36
@kosabogi kosabogi marked this pull request as ready for review February 13, 2026 10:36
Copy link
Copy Markdown
Contributor

@Mikep86 Mikep86 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Layout and content looks good 🚀 ! I spotted two inaccuracies we should fix before we merge.

@kosabogi kosabogi requested review from a team and Mikep86 February 18, 2026 14:11
Copy link
Copy Markdown
Contributor

@Mikep86 Mikep86 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks for the iterations!

Copy link
Copy Markdown
Contributor

@szabosteve szabosteve left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Left a few suggestions, mostly to remove legacy snippet test comments, and a question.

"prune": true, <1>
"pruning_config": {
"tokens_freq_ratio_threshold": 10, <2>
"tokens_weight_threshold": 0.4 <3>
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just to double check. Should it be 0.4 or 0.5?

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good point! I meant to write 0.5. I've fixed it in my latest commit.

@kosabogi kosabogi merged commit e86a8a1 into elastic:main Feb 24, 2026
12 checks passed
szybia added a commit to szybia/elasticsearch that referenced this pull request Feb 24, 2026
…on-sliced-reindex

* upstream/main:
  Update docs for v9.3.1 release (elastic#142887)
  Update docs for v9.2.6 release (elastic#142888)
  Improves visibility of vector index options and inference configuration (elastic#141653)
  Disable CAE in microsoft-graph-authz plugin (elastic#142848)
  Small improvements to `GetSnapshotsIT#testAllFeatures` (elastic#142825)
  Fix IndexSettingsTests synthetic ID tests (elastic#142654)
  [Test] Unmute tests of SnapshotShutdownIT (elastic#142921)
  Fixing metrics_info.json kibana definition file name (elastic#142813)
  [Packaging] Disable glibc 2.43 malloc huge pages in Wolfi images (elastic#142894)
  Mute org.elasticsearch.xpack.searchablesnapshots.SearchableSnapshotsTSDBSyntheticIdIntegTests testSearchableSnapshot elastic#142918
  Add shard heap usage to ClusterInfo (elastic#139557)
  ESQL: Load script fields row-by-row (elastic#142807)
  ESQL: Consolidate doc values memory tracking (elastic#142816)
  ES-14124  Create Index Count Limit User documentation Page (elastic#142570)
  Add a es819 codec test to verify tryRead returns null if may contain duplicates (elastic#142409)
  Support arithmetic operations for dense_vectors: scalar version (elastic#141060)
  [Transform] Allow project_routing (elastic#142421)
  Refactor query rewrite async actions for knn and sparse_vector queries (elastic#142889)
  Do not mark bulk indexing requests as retried after primary relocations (elastic#142157)
sidosera pushed a commit to sidosera/elasticsearch that referenced this pull request Feb 24, 2026
…on (elastic#141653)

* Improve visibility of vector index options and inference configuration

* Fixes link

* Removes incorrect note

* Update docs/reference/elasticsearch/mapping-reference/semantic-text.md

Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>

* Adresses suggestions

* Syntax fix

* Fixes syntax, adds list

* Applies suggestions

* Update docs/reference/elasticsearch/mapping-reference/dense-vector.md

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>

* Update docs/reference/elasticsearch/mapping-reference/semantic-text-setup-configuration.md

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>

* Update docs/reference/elasticsearch/mapping-reference/semantic-text.md

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>

* Update docs/reference/elasticsearch/mapping-reference/semantic-text.md

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>

* Fixes syntax and code example

---------

Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>
Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

>docs General docs changes Team:Docs Meta label for docs team v9.4.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants