-
Notifications
You must be signed in to change notification settings - Fork 25.8k
_uid should be indexed in Lucene in binary form, not base64 #18154
Copy link
Copy link
Closed
Labels
:Core/Infra/CoreCore issues without another labelCore issues without another label:Search Foundations/MappingIndex mappings, including merging and defining field typesIndex mappings, including merging and defining field typesTeam:Search FoundationsMeta label for the Search Foundations team in ElasticsearchMeta label for the Search Foundations team in Elasticsearchstalledv6.0.0-beta1
Description
@rmuir had this idea:
Today, when ES auto-generates an ID (TimeBasedUUIDGenerator.getBase64UUID), it uses 15 bytes, but then we immediately Base64 encode that to 20 bytes, a 33% "waste".
This is really a holdover from the past when Lucene could not index fully binary terms.
I think we should explore passing the raw binary form to Lucene instead? We could implement back-compat based on the version as of when the index was created.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
:Core/Infra/CoreCore issues without another labelCore issues without another label:Search Foundations/MappingIndex mappings, including merging and defining field typesIndex mappings, including merging and defining field typesTeam:Search FoundationsMeta label for the Search Foundations team in ElasticsearchMeta label for the Search Foundations team in Elasticsearchstalledv6.0.0-beta1
Type
Fields
Give feedbackNo fields configured for issues without a type.