Improves visibility of vector index options and inference configuration#141653
Improves visibility of vector index options and inference configuration#141653kosabogi merged 18 commits intoelastic:mainfrom
Conversation
|
Pinging @elastic/core-docs (Team:Docs) |
🔍 Preview links for changed docs |
ℹ️ Important: Docs version tagging👋 Thanks for updating the docs! Just a friendly reminder that our docs are now cumulative. This means all 9.x versions are documented on the same page and published off of the main branch, instead of creating separate pages for each minor version. We use applies_to tags to mark version-specific features and changes. Expand for a quick overviewWhen to use applies_to tags:✅ At the page level to indicate which products/deployments the content applies to (mandatory) What NOT to do:❌ Don't remove or replace information that applies to an older version 🤔 Need help?
|
Mikep86
left a comment
There was a problem hiding this comment.
Nice work! I left some comments on some things we should expand on in the semantic text documentation.
docs/reference/elasticsearch/mapping-reference/semantic-text.md
Outdated
Show resolved
Hide resolved
docs/reference/elasticsearch/mapping-reference/semantic-text.md
Outdated
Show resolved
Hide resolved
Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>
|
Thank you for your review, @Mikep86 ! Based on your suggestions, I prepared a plan for continuing this work. I’d like to address the following items in this PR: Default indexing strategy (version-specific)9.0:
Is this correct, right? semantic_text index_options section
“To learn more about available index options and how they affect vector quantization…” Full semantic_text example with all optional parametersKeep the current example as a Basic semantic_text mapping example |
Mikep86
left a comment
There was a problem hiding this comment.
Layout and content looks good 🚀 ! I spotted two inaccuracies we should fix before we merge.
docs/reference/elasticsearch/mapping-reference/semantic-text-setup-configuration.md
Outdated
Show resolved
Hide resolved
docs/reference/elasticsearch/mapping-reference/semantic-text-setup-configuration.md
Outdated
Show resolved
Hide resolved
Mikep86
left a comment
There was a problem hiding this comment.
LGTM, thanks for the iterations!
szabosteve
left a comment
There was a problem hiding this comment.
Left a few suggestions, mostly to remove legacy snippet test comments, and a question.
docs/reference/elasticsearch/mapping-reference/semantic-text-setup-configuration.md
Outdated
Show resolved
Hide resolved
| "prune": true, <1> | ||
| "pruning_config": { | ||
| "tokens_freq_ratio_threshold": 10, <2> | ||
| "tokens_weight_threshold": 0.4 <3> |
There was a problem hiding this comment.
Just to double check. Should it be 0.4 or 0.5?
There was a problem hiding this comment.
Good point! I meant to write 0.5. I've fixed it in my latest commit.
docs/reference/elasticsearch/mapping-reference/semantic-text.md
Outdated
Show resolved
Hide resolved
docs/reference/elasticsearch/mapping-reference/semantic-text.md
Outdated
Show resolved
Hide resolved
Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
…etup-configuration.md Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
…on-sliced-reindex * upstream/main: Update docs for v9.3.1 release (elastic#142887) Update docs for v9.2.6 release (elastic#142888) Improves visibility of vector index options and inference configuration (elastic#141653) Disable CAE in microsoft-graph-authz plugin (elastic#142848) Small improvements to `GetSnapshotsIT#testAllFeatures` (elastic#142825) Fix IndexSettingsTests synthetic ID tests (elastic#142654) [Test] Unmute tests of SnapshotShutdownIT (elastic#142921) Fixing metrics_info.json kibana definition file name (elastic#142813) [Packaging] Disable glibc 2.43 malloc huge pages in Wolfi images (elastic#142894) Mute org.elasticsearch.xpack.searchablesnapshots.SearchableSnapshotsTSDBSyntheticIdIntegTests testSearchableSnapshot elastic#142918 Add shard heap usage to ClusterInfo (elastic#139557) ESQL: Load script fields row-by-row (elastic#142807) ESQL: Consolidate doc values memory tracking (elastic#142816) ES-14124 Create Index Count Limit User documentation Page (elastic#142570) Add a es819 codec test to verify tryRead returns null if may contain duplicates (elastic#142409) Support arithmetic operations for dense_vectors: scalar version (elastic#141060) [Transform] Allow project_routing (elastic#142421) Refactor query rewrite async actions for knn and sparse_vector queries (elastic#142889) Do not mark bulk indexing requests as retried after primary relocations (elastic#142157)
…on (elastic#141653) * Improve visibility of vector index options and inference configuration * Fixes link * Removes incorrect note * Update docs/reference/elasticsearch/mapping-reference/semantic-text.md Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co> * Adresses suggestions * Syntax fix * Fixes syntax, adds list * Applies suggestions * Update docs/reference/elasticsearch/mapping-reference/dense-vector.md Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co> * Update docs/reference/elasticsearch/mapping-reference/semantic-text-setup-configuration.md Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co> * Update docs/reference/elasticsearch/mapping-reference/semantic-text.md Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co> * Update docs/reference/elasticsearch/mapping-reference/semantic-text.md Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co> * Fixes syntax and code example --------- Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co> Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
This PR improves the discoverability of vector indexing and inference configuration in the docs by:
Refactoring the Automatically quantize vectors for kNN search section to lead with an example and include a concrete bbq_disk configuration.
Extending the first dense_vector mapping example to show index_options, with a numbered annotation linking to the quantization section.
Expanding the semantic_text example to set an inference_id and include index_options, linking back to the dense_vector quantization docs.
Relates to elastic/docs-content#3804 and elastic/docs-content#4832