LUCENE-9614: Fix KnnVectorQuery failure when numDocs is 0 by jtibshirani · Pull Request #413 · apache/lucene

jtibshirani · 2021-10-25T19:26:28Z

When the reader has no live docs, KnnVectorQuery can error out. This happens
because IndexReader#numDocs is 0, and we end up passing an illegal value of
k = 0 to the search method.

This commit removes the problematic optimization in KnnVectorQuery and
replaces with a lower-level based on the total number of vectors in the segment.

jtibshirani · 2021-10-25T19:27:22Z

This is a pretty obscure bug, I only noticed it because we use a custom FilterLeafReader as part of our security implementation. I wasn't able to reproduce it using regular deletes or even soft deletes.

jpountz · 2021-10-25T20:15:12Z

lucene/core/src/java/org/apache/lucene/search/KnnVectorQuery.java

    for (LeafReaderContext ctx : reader.leaves()) {
-      perLeafResults[ctx.ord] = searchLeaf(ctx, Math.min(k, reader.numDocs()));
+      int numDocs = ctx.reader().numDocs();
+      perLeafResults[ctx.ord] = numDocs > 0 ? searchLeaf(ctx, Math.min(k, numDocs)) : NO_RESULTS;


This makes me wonder why we pass min(k, numDocs) rather than just k. Is it to avoid oversizing the heap that collect nearest neighbors?

One reason why I'm wondering this is because in the past we tried to avoid calling numDocs()unless it was strictly necessary in the past because it's expensive on reader views that hide subsets of documents (LUCENE-9003).

I also guessed it was to avoid oversizing the heap for tiny segments. I'm not sure how helpful this is though, maybe we can simplify and just use k here.

I was thinking of passing k here, and moving the logic to avoid oversizing the heap to Lucene90HnswVectorsReader by doing k = min(k, size()) (where size() is the number of docs that have a vector).

This makes sense to me, I pushed a change. Instead of Lucene90HnswVectorsReader, I thought it could make sense to apply the bound in HnswGraph. But this turned out messier because there's separate concepts for topK and numSeed (we're cleaning this up as part of LUCENE-10054).

Thanks for fixing this - it makes sense to me use size() instead of numDocs(), or even simply k; I wasn't aware of the costly nature of that call. Indeed the idea here was just to avoid spending extra work on tiny segments; something I noticed all the time in tests, but which is probably not much of an issue in reality.

jpountz · 2021-10-27T07:04:01Z

lucene/core/src/test/org/apache/lucene/search/TestKnnVectorQuery.java

-import org.apache.lucene.index.Term;
-import org.apache.lucene.index.VectorSimilarityFunction;
+import org.apache.lucene.document.*;
+import org.apache.lucene.index.*;


Oh, I thought we failed the build on wildcard imports, but apparently we don't. Maybe still use explicit imports to reduce line changes of this PR?

I also noticed our static analysis is totally fine with it (surprisingly?) I'll need to fix my IntelliJ setup :)

LUCENE-9614: Fix KnnVectorQuery failure when LeafReader#numDocs is 0

832aec5

jtibshirani mentioned this pull request Oct 25, 2021

Ensure kNN search respects authorization elastic/elasticsearch#79693

Merged

jpountz reviewed Oct 25, 2021

View reviewed changes

jtibshirani requested a review from msokolov October 25, 2021 20:56

jtibshirani added 2 commits October 26, 2021 16:41

Push optimization down to vectors format

da2dfbf

Reformat

a321c3e

jpountz approved these changes Oct 27, 2021

View reviewed changes

jtibshirani added 2 commits October 27, 2021 08:54

Remove star imports

c2b049d

Fix spotless

f305804

jtibshirani merged commit abd5ec4 into apache:main Oct 27, 2021

jtibshirani deleted the knn-query branch October 27, 2021 18:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LUCENE-9614: Fix KnnVectorQuery failure when numDocs is 0#413

LUCENE-9614: Fix KnnVectorQuery failure when numDocs is 0#413
jtibshirani merged 5 commits intoapache:mainfrom
jtibshirani:knn-query

jtibshirani commented Oct 25, 2021 •

edited

Loading

Uh oh!

jtibshirani commented Oct 25, 2021

Uh oh!

jpountz Oct 25, 2021

Uh oh!

jpountz Oct 25, 2021

Uh oh!

jtibshirani Oct 25, 2021

Uh oh!

jpountz Oct 26, 2021

Uh oh!

jtibshirani Oct 27, 2021

Uh oh!

msokolov Oct 27, 2021

Uh oh!

jpountz Oct 27, 2021

Uh oh!

jtibshirani Oct 27, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jtibshirani commented Oct 25, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jtibshirani commented Oct 25, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jtibshirani commented Oct 25, 2021 •

edited

Loading