Parameterize the vector result benchmarks on implementation and function by thecoop · Pull Request #139423 · elastic/elasticsearch

thecoop · 2025-12-12T10:12:00Z

This makes it a lot more scalable, so more functions can be added without adding more test methods.

So that the benchmark tests continue to work, the random data generation is pulled out into VectorData classes that can be re-used across several different Benchmark classes for specific implementations, to ensure the results are still as expected.

elasticsearchmachine · 2025-12-12T10:12:25Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

ldematte

I like it! Some small comments but look good to me

ldematte · 2025-12-12T11:11:01Z

...arks/src/main/java/org/elasticsearch/benchmark/vector/scorer/VectorScorerInt7uBenchmark.java

-    float vec2CorrectionConstant;
-    float scoreCorrectionConstant;
+    @Param
+    public Implementation implementation;


ldematte · 2025-12-12T11:11:15Z

...arks/src/main/java/org/elasticsearch/benchmark/vector/scorer/VectorScorerInt7uBenchmark.java

-        IOUtils.rm(path);
-    }
+    @Param
+    public Function function;


...arks/src/main/java/org/elasticsearch/benchmark/vector/scorer/VectorScorerInt7uBenchmark.java

ldematte · 2025-12-12T11:15:32Z

...arks/src/main/java/org/elasticsearch/benchmark/vector/scorer/VectorScorerInt7uBenchmark.java

    @Benchmark
-    public float squareDistanceNativeQuery() throws IOException {
-        return nativeSqrScorerQuery.score(1);
+    public float scoreQuery() throws IOException {


What happens if these are run on Java 21? (supportsHeapSegments() returns false?)
Is there a way to filter these benchmarks out?

The method will throw an exception (NPE in this case), and the test will just be ignored by JMH

A bit brutal :D but if it works...

.../src/main/java/org/elasticsearch/benchmark/vector/scorer/VectorScorerInt7uBulkBenchmark.java

ldematte

LGTM but I'd like for @ChrisHegarty to give it a look too.
I remember he was thinking at a naming convention or another way to run a subset of benchmarks -- JMH uses pattern matching on the function names, so being consisten pays.
Adding parameters for dimensions like implementation or function might help here, as they can be used as "filters" (e.g. just run all the benchmarks for native and dot product).

Also, do you have an example of what the output will look like?

ChrisHegarty · 2025-12-16T09:34:19Z

I remember he was thinking at a naming convention or another way to run a subset of benchmarks -- JMH uses pattern matching on the function names, so being consisten pays.
..
Also, do you have an example of what the output will look like?

Right. I think that the refactoring is probably ok, but I'd like to see how what the output looks like, and how we can more easily select subsets with various regex patterns.

thecoop · 2025-12-17T09:45:43Z

This command:

./gradlew -Druntime.java=25 -p benchmarks run --args 'VectorScorerInt7uBenchmark -p dims=96,1024 -p function=DOT_PRODUCT'

produces this output:

Benchmark                              (dims)   (function)  (implementation)   Mode  Cnt   Score    Error   Units
VectorScorerInt7uBenchmark.score           96  DOT_PRODUCT            SCALAR  thrpt    5  30.679 ±  0.747  ops/us
VectorScorerInt7uBenchmark.score           96  DOT_PRODUCT            LUCENE  thrpt    5  74.437 ±  4.182  ops/us
VectorScorerInt7uBenchmark.score           96  DOT_PRODUCT            NATIVE  thrpt    5  60.110 ±  4.005  ops/us
VectorScorerInt7uBenchmark.score         1024  DOT_PRODUCT            SCALAR  thrpt    5   3.178 ±  2.831  ops/us
VectorScorerInt7uBenchmark.score         1024  DOT_PRODUCT            LUCENE  thrpt    5  13.406 ±  0.825  ops/us
VectorScorerInt7uBenchmark.score         1024  DOT_PRODUCT            NATIVE  thrpt    5  23.472 ± 16.359  ops/us
VectorScorerInt7uBenchmark.scoreQuery      96  DOT_PRODUCT            LUCENE  thrpt    5  60.697 ± 21.170  ops/us
VectorScorerInt7uBenchmark.scoreQuery      96  DOT_PRODUCT            NATIVE  thrpt    5  57.546 ±  8.331  ops/us
VectorScorerInt7uBenchmark.scoreQuery    1024  DOT_PRODUCT            LUCENE  thrpt    5  12.945 ±  3.980  ops/us
VectorScorerInt7uBenchmark.scoreQuery    1024  DOT_PRODUCT            NATIVE  thrpt    5  34.916 ±  2.796  ops/us

ldematte · 2025-12-17T15:36:23Z

Thanks for the output. I think it is nice, as it gives us a way (using parameters) to filter/execute benchmarks over multiple dimensions (function, like you did in the example, but also implementation).
We could for example easily filter for int7 bulk (both via name) dot product (param) native (param). That would be more difficult with just names.

thecoop added 2 commits December 11, 2025 16:26

Refactor benchmark classes

8036c15

Update benchmark tests

f340576

thecoop requested review from ChrisHegarty and ldematte December 12, 2025 10:12

thecoop added >test Issues or PRs that are addressing/adding tests :Search Relevance/Vectors Vector search labels Dec 12, 2025

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Dec 12, 2025

elasticsearchmachine added the v9.3.0 label Dec 12, 2025

ldematte reviewed Dec 12, 2025

View reviewed changes

thecoop added 3 commits December 12, 2025 13:43

Rename methods

0153831

Update benchmark utils methods

746ae2a

Merge branch 'main' into parameterized-vector-benchmarks

1e7a983

thecoop requested a review from ldematte December 12, 2025 16:38

Merge branch 'main' into parameterized-vector-benchmarks

9cac74b

thecoop mentioned this pull request Dec 15, 2025

Parameterize VectorSimilarityFunctionsTests #139516

Merged

ldematte approved these changes Dec 15, 2025

View reviewed changes

Merge branch 'main' into parameterized-vector-benchmarks

378cca5

thecoop force-pushed the parameterized-vector-benchmarks branch from e9d4e7f to bff2990 Compare December 17, 2025 10:20

Use VectorSimilarityType

348cd94

thecoop force-pushed the parameterized-vector-benchmarks branch from bff2990 to 348cd94 Compare December 17, 2025 10:21

thecoop merged commit 8cccfd9 into elastic:main Dec 17, 2025
35 checks passed

thecoop deleted the parameterized-vector-benchmarks branch December 17, 2025 16:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parameterize the vector result benchmarks on implementation and function#139423

Parameterize the vector result benchmarks on implementation and function#139423
thecoop merged 8 commits intoelastic:mainfrom
thecoop:parameterized-vector-benchmarks

thecoop commented Dec 12, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Dec 12, 2025

Uh oh!

ldematte left a comment

Uh oh!

ldematte Dec 12, 2025

Uh oh!

ldematte Dec 12, 2025

Uh oh!

Uh oh!

ldematte Dec 12, 2025

Uh oh!

thecoop Dec 12, 2025

Uh oh!

ldematte Dec 12, 2025

Uh oh!

Uh oh!

ldematte left a comment

Uh oh!

ChrisHegarty commented Dec 16, 2025

Uh oh!

thecoop commented Dec 17, 2025

Uh oh!

ldematte commented Dec 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

thecoop commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Dec 12, 2025

Uh oh!

ldematte left a comment

Choose a reason for hiding this comment

Uh oh!

ldematte Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

ldematte Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ldematte Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

thecoop Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

ldematte Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ldematte left a comment

Choose a reason for hiding this comment

Uh oh!

ChrisHegarty commented Dec 16, 2025

Uh oh!

thecoop commented Dec 17, 2025

Uh oh!

ldematte commented Dec 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

thecoop commented Dec 12, 2025 •

edited

Loading