Fix bulk scoring to process last batch instead of falling through to scalar tail by ldematte · Pull Request #145316 · elastic/elasticsearch

ldematte · 2026-03-31T11:36:56Z

This PR fixes a small issue in bulk scoring functions where the last batch of vectors was unnecessarily dropped to the single-vector tail loop.

Bulk loops used c + 2 * batches - 1 < count as the loop condition, which exits when there aren't enough vectors for both the current batch AND a next batch to prefetch. This means the last full batch (where there's no next batch to prefetch) was always processed one-by-one in the scalar tail.

This PR changes the loop condition to c + batches - 1 < count (process all full batches), and guard the prefetch with const bool has_next = c + 2 * batches - 1 < count. This pattern was already used in vec_i4_2.cpp (AVX-512 int4) — now applied consistently everywhere.

Also fixes > to >= in SIMD stride checks across all files, so that when dims equals exactly the stride length, we use the SIMD path instead of falling through to scalar.

Relates to #145411

Test plan

JDKVectorLibrary*Tests pass locally on Apple Silicon (aarch64)
JDKVectorLibraryInt8Tests pass on AMD c8a (x64 AVX-512)

elasticsearchmachine · 2026-03-31T11:39:19Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

…t-batch

ChrisHegarty

LGTM

libs/simdvec/native/src/vec/c/amd64/vec_2.cpp

…rics * upstream/main: (21 commits) Mute org.elasticsearch.xpack.esql.qa.mixed.MixedClusterEsqlSpecIT test {csv-spec:external-basic.topSnippetsFunction} elastic#145353 Mute org.elasticsearch.xpack.esql.qa.mixed.MixedClusterEsqlSpecIT test {csv-spec:external-basic.scoreFunction} elastic#145352 [DiskBBQ] Fix bug in NeighborQueue#popRawAndAddRaw (elastic#145324) Fix dense_vector default index options when using BFLOAT16 (elastic#145202) Use checked exceptions in entitlement constructor rules (elastic#145234) ESQL: DS: datasource file plugins should not return TEXT types (elastic#145334) Plumb DLM error store through to DlmFrozenTransition classes (elastic#145243) Make Settings.Builder.remove() fluent (elastic#145294) Add FLS tests for METRICS_INFO and TS_INFO (elastic#145211) Fix flaky SecurityFeatureResetTests (elastic#145063) [DOCS] Fix conflict markers in ESQL processing command list (elastic#145338) Skip certain metric assertions on Windows (elastic#144933) [ES|QL] Add schema reconciliation for multi-file external sources (elastic#145220) Simplify DiskBBQ dynamic visit ratio to linear (elastic#142784) ESQL: Disallow unmapped_fields=load with partial non-KEYWORD (elastic#144109) [Transform] Track Linked Projects (elastic#144399) Fix bulk scoring to process last batch instead of falling through to scalar tail (elastic#145316) Clean up TickerScheduleEngineTests (elastic#145303) [CI] ShardBulkInferenceActionFilterIT testRestart - Ensuring that secrets-inference index is available after full restart and unmuting test (elastic#145317) Add CRUD doc to the DistributedArchitectureGuide (elastic#144710) ...

…scalar tail (elastic#145316) This PR fixes a small issue in bulk scoring functions where the last batch of vectors was unnecessarily dropped to the single-vector tail loop. Bulk loops used c + 2 * batches - 1 < count as the loop condition, which exits when there aren't enough vectors for both the current batch AND a next batch to prefetch. This means the last full batch (where there's no next batch to prefetch) was always processed one-by-one in the scalar tail. This PR changes the loop condition to c + batches - 1 < count (process all full batches), and guard the prefetch with const bool has_next = c + 2 * batches - 1 < count. This pattern was already used in vec_i4_2.cpp (AVX-512 int4) — now applied consistently everywhere. Also fixes > to >= in SIMD stride checks across all files, so that when dims equals exactly the stride length, we use the SIMD path instead of falling through to scalar.

ldematte added 2 commits March 30, 2026 17:22

Fix: do not drop out of batch processing one batch too early

cd52d4d

Fix: SIMD process dims also when it is exactly the stride size

25094b2

elasticsearchmachine added v9.4.0 needs:triage Requires assignment of a team area label labels Mar 31, 2026

ldematte added :Search Relevance/Vectors Vector search >non-issue and removed needs:triage Requires assignment of a team area label labels Mar 31, 2026

ldematte requested review from ChrisHegarty and thecoop and removed request for thecoop March 31, 2026 11:38

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Mar 31, 2026

ldematte added 2 commits March 31, 2026 13:40

Merge remote-tracking branch 'upstream/main' into native/fix-bulk-las…

121a37d

…t-batch

Publish vec binaries + update version

b3ef3c8

ldematte requested a review from a team as a code owner March 31, 2026 11:44

ChrisHegarty approved these changes Mar 31, 2026

View reviewed changes

thecoop reviewed Mar 31, 2026

View reviewed changes

libs/simdvec/native/src/vec/c/amd64/vec_2.cpp Show resolved Hide resolved

thecoop approved these changes Mar 31, 2026

View reviewed changes

ldematte enabled auto-merge (squash) March 31, 2026 13:34

ldematte merged commit 1b0c52d into elastic:main Mar 31, 2026
35 checks passed

ldematte deleted the native/fix-bulk-last-batch branch March 31, 2026 14:01

ldematte mentioned this pull request Apr 1, 2026

Comparative benchmarking and fine-tuning of optimized native scorers #145411

Open

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bulk scoring to process last batch instead of falling through to scalar tail#145316

Fix bulk scoring to process last batch instead of falling through to scalar tail#145316
ldematte merged 4 commits intoelastic:mainfrom
ldematte:native/fix-bulk-last-batch

ldematte commented Mar 31, 2026 •

edited

Loading

Uh oh!

elasticsearchmachine commented Mar 31, 2026

Uh oh!

ChrisHegarty left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ldematte commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test plan

Uh oh!

elasticsearchmachine commented Mar 31, 2026

Uh oh!

ChrisHegarty left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ldematte commented Mar 31, 2026 •

edited

Loading