use HashMap instead of TreeMap for OnHeapHnswGraph neighbors by jbellis · Pull Request #12248 · apache/lucene

jbellis · 2023-04-27T16:43:43Z

Switches OnHeapHnswGraph from representing a single node's neighbors as a TreeMap, to a HashMap, and updates its callers to no longer assume that NodeIterator results are ordered by ordinal.

This means that looking up neighbors goes from an O(log N) operation to an O(1) operation for all upper levels. Only callers that need ordered neighbors need to pay the cost of sorting.

On my machine, building the graph and running the queries from the SIFT dataset at http://corpus-texmex.irisa.fr/ sees about a 4% speedup. These are dimension 128 vectors; the relative importance of looking up neighbors vs computing similarities will vary inversely with dimensionality.

(First I did three runs for each codebase, but there was enough variance that I then ran five more.)

[TreeMap]
Run 0 took 708.876145328 seconds
Run 1 took 754.700354185 seconds
Run 2 took 710.236167725 seconds
Run 0 took 717.61476478 seconds
Run 1 took 742.494343683 seconds
Run 2 took 736.983143255 seconds
Run 3 took 719.305541186 seconds
Run 4 took 722.831859596 seconds

[HashMap]
Run 0 took 724.875610053 seconds
Run 1 took 682.65666933 seconds
Run 2 took 682.655977613 seconds
Run 0 took 716.165341298 seconds
Run 1 took 684.314657618 seconds
Run 2 took 686.456432263 seconds
Run 3 took 717.067567184 seconds
Run 4 took 702.009706983 seconds

The timing harness is here: https://github.com/jbellis/hnswdemo/tree/lucene-bench

Switch OnHeapHnswGraph from representing a single node's neighbors as a TreeMap, to a HashMap, and update its callers to no longer assume that NodeIterator results are ordered by ordinal.

…nd Collections.sort instead

msokolov

I left some style questions but overall seems reasonable - thank you! We only need the nodes sorted when we finally write the graph. In the meantime we want fast insertion and lookup and don't care about the sorting.

Could you say how you used that test harness? I looked at it, and didn't see a method that would measure and print out multiple iterations of indexing times as you showed in the overview.

lucene/core/src/test/org/apache/lucene/util/hnsw/HnswGraphTestCase.java

lucene/core/src/java/org/apache/lucene/util/hnsw/HnswGraph.java

jbellis · 2023-04-28T19:54:04Z

(My mistake; it looks like tidy/spotlessApply don't care about imports; it must have been intellij's fault.)

jbellis · 2023-04-29T02:21:47Z

I saw NeighborQueue construction taking up a decent chunk of time on the profiler data, so I refactored HGSearcher a bit to only allocate once per call to search. This takes out about another 1% build time.

[HashMap build only]
Run 4 took 696.4547865 seconds
Run 1 took 696.7912881 seconds
Run 0 took 698.4082808 seconds
Run 3 took 699.6830504 seconds
Run 2 took 700.3700336 seconds

[HashMap + NQ change, build only]
Run 4 took 689.9413937 seconds
Run 3 took 690.78149 seconds
Run 2 took 691.8227998 seconds
Run 1 took 692.4830887 seconds
Run 0 took 694.5696814 seconds

msokolov · 2023-04-29T12:55:50Z

I'm not entirely sure why just from inspection, but this seems to have broken some of the backwards-compatibility tests. What it means is this can no longer read indexes written by a prior point release. Maybe consider reverting the change to pre-allocate the NeighborQueue so we can move forward with the change from TreeMap to HashMap?

lucene/core/src/test/org/apache/lucene/util/hnsw/HnswGraphTestCase.java

lucene/core/src/java/org/apache/lucene/util/hnsw/OnHeapHnswGraph.java

lucene/core/src/java/org/apache/lucene/util/hnsw/HnswGraphSearcher.java

lucene/core/src/java/org/apache/lucene/codecs/lucene95/Lucene95HnswVectorsWriter.java

jbellis · 2023-04-29T21:31:23Z

I've added the types in, but I'd like to push back a bit on requiring Locale for string.formatted in test code -- the alternative isn't really using Locale and localizing the assertion messages, the realistic alternative is doing old school string concatenation with +, and that's just worse all around.

jbellis · 2023-04-29T21:38:27Z

^ I think that's everything, I'll look at what's going on w/ the tests for the NeighborQueue piece and create a new PR

jbellis · 2023-04-29T22:38:24Z

^ last commit addresses the legacy test failures, not clear to me why they didn't fail before on this branch, but with this change everything passes with and w/o the NQ commit

msokolov · 2023-04-30T17:39:19Z

I've added the types in, but I'd like to push back a bit on requiring Locale for string.formatted in test code -- the alternative isn't really using Locale and localizing the assertion messages, the realistic alternative is doing old school string concatenation with +, and that's just worse all around.

I don't understand - can't we use String.format(Locale, String ...) ? This isn't about localization, it's about using a consistent Locale for string formatting.

msokolov · 2023-04-30T17:41:44Z

^ last commit addresses the legacy test failures, not clear to me why they didn't fail before on this branch, but with this change everything passes with and w/o the NQ commit

Lucene's tests are executed with randomized settings, so to reproduce them you often have to provide the same test seed that was used. This gets printed out in a reproduce command in the failure message; see where it has "-Dtests.seed=xxxxx"

msokolov · 2023-04-30T17:49:29Z

also - in the future I recommend not using force-push with github PRs since it loses all the history and (I think) can erase the comments that were tied to specific commits. At any rate it makes things difficult for reviewers since we have to re-review the entire contribution rather than being able to see what changed since the last review

jbellis · 2023-04-30T17:59:57Z

My bad, I missed that one.

…fying the linter

msokolov · 2023-04-30T22:06:04Z

I cleaned up the last few String.format issues and pushed 3c16374

Thank you @jbellis!

jbellis · 2023-05-01T00:52:26Z

Thanks, Michael!

Jonathan Ellis added 4 commits April 27, 2023 11:42

Use HashMap for OnHeapHnswGraph neighbors

21676c8

Switch OnHeapHnswGraph from representing a single node's neighbors as a TreeMap, to a HashMap, and update its callers to no longer assume that NodeIterator results are ordered by ordinal.

run tidy

3641519

add pretty-printing to graph equality tests

851fa73

using IntStream.iterate to sort doesn't work here, use a while loop a…

23875d4

…nd Collections.sort instead

jbellis mentioned this pull request Apr 28, 2023

add ConcurrentOnHeapHnswGraph and Builder #12254

Closed

msokolov reviewed Apr 28, 2023

View reviewed changes

lucene/core/src/test/org/apache/lucene/util/hnsw/HnswGraphTestCase.java Outdated Show resolved Hide resolved

lucene/core/src/java/org/apache/lucene/util/hnsw/HnswGraph.java Outdated Show resolved Hide resolved

move prettyPrint to HnswGraphTestCase

c8d1484

msokolov requested changes Apr 29, 2023

View reviewed changes

assertGraphInitializedFromGraph also needs to sort nodes on level

b3b1d11

jbellis force-pushed the hnsw-hashmap branch from 6f7c942 to c875dbc Compare April 29, 2023 21:23

explicit types

1d547de

jbellis force-pushed the hnsw-hashmap branch from c875dbc to 1d547de Compare April 29, 2023 21:29

switch to for loop for nodesOnLevel iteration

42f93b8

inline assertion messages

c8331c0

jbellis force-pushed the hnsw-hashmap branch from aa860ac to c8331c0 Compare April 29, 2023 21:41

sort nodes on level in legacy writer classes, too

1cb8776

no wildcard imports

e0d9ebf

r/m use of .formatted

dd0cc8d

remove use of .formatted in test code, reducing readability but satis…

5bd7ac9

…fying the linter

msokolov closed this Apr 30, 2023

msokolov mentioned this pull request May 9, 2023

allocate one NeighborQueue per search for results #12255

Merged

Conversation

jbellis commented Apr 27, 2023

Uh oh!

msokolov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jbellis commented Apr 28, 2023

Uh oh!

jbellis commented Apr 29, 2023

Uh oh!

msokolov commented Apr 29, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jbellis commented Apr 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jbellis commented Apr 29, 2023

Uh oh!

jbellis commented Apr 29, 2023

Uh oh!

msokolov commented Apr 30, 2023

Uh oh!

msokolov commented Apr 30, 2023

Uh oh!

msokolov commented Apr 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jbellis commented Apr 30, 2023

Uh oh!

msokolov commented Apr 30, 2023

Uh oh!

jbellis commented May 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jbellis commented Apr 29, 2023 •

edited

Loading

msokolov commented Apr 30, 2023 •

edited

Loading