Implement distance_feature for runtime dates by nik9000 · Pull Request #60851 · elastic/elasticsearch

nik9000 · 2020-08-06T21:23:02Z

This implements the distance_feature for date valued runtime_scripts. This produces the same numbers running against an indexed date, but it doesn't have the same performance characteristics at all. Which is normal for runtime_scripts. But distance_feature` against an indexes fields does a lot of work to refine the query as it goes, limiting the number of documents that it has to visit. We can't do that because we don't have an index. So we just spit out the same numbers and hope it is good enough.

…ature

elasticmachine · 2020-08-06T21:23:04Z

Pinging @elastic/es-search (:Search/Search)

mayya-sharipova

Thanks @nik9000, I have left a couple of comments

mayya-sharipova · 2020-08-13T16:18:02Z

...in/java/org/elasticsearch/xpack/runtimefields/query/LongScriptFieldDistanceFeatureQuery.java

+
+        @Override
+        public float getMaxScore(int upTo) throws IOException {
+            return boost;


Should a potential max score be: weight instead of boost?
This method and several other methods don't seem to throw IOException

Yes! good catch.

mayya-sharipova · 2020-08-13T16:34:04Z

...in/java/org/elasticsearch/xpack/runtimefields/query/LongScriptFieldDistanceFeatureQuery.java

+
+    @Override
+    public int hashCode() {
+        return Objects.hash(super.hashCode(), origin, pivot);


should we incorporate the initial boost to hashCode, equals, toString ?

mayya-sharipova · 2020-08-13T16:57:55Z

...in/java/org/elasticsearch/xpack/runtimefields/query/LongScriptFieldDistanceFeatureQuery.java

+        protected DistanceScorer(Weight weight, AbstractLongScriptFieldScript script, int maxDoc, float boost) {
+            super(weight);
+            this.script = script;
+            twoPhase = new TwoPhaseIterator(DocIdSetIterator.all(maxDoc)) {


I am not familiar with runtime fields, but I am wondering if we intend always to create an iterator across all documents? Do we plan to add support to limit number of docs (e.g. only docs returned by a top filter)?

I believe we're hoping for bool queries to AND together a "normal" query and a runtime field query. I've experimented with this for our term and match style queries and it seems to work pretty well. If the "normal" query is selective then the runtime query won't be asked if it matches most documents. On the flip side, if the runtime field query non-selective then we'll quickly fill up the 10,000 hits and terminate early.

nik9000

I'll push a patch to address your comments soon!

nik9000 · 2020-08-13T18:28:23Z

...in/java/org/elasticsearch/xpack/runtimefields/query/LongScriptFieldDistanceFeatureQuery.java

+
+        @Override
+        public float getMaxScore(int upTo) throws IOException {
+            return boost;


Yes! good catch.

nik9000 · 2020-08-13T18:28:36Z

...in/java/org/elasticsearch/xpack/runtimefields/query/LongScriptFieldDistanceFeatureQuery.java

+
+    @Override
+    public int hashCode() {
+        return Objects.hash(super.hashCode(), origin, pivot);


nik9000 · 2020-08-13T18:31:51Z

...in/java/org/elasticsearch/xpack/runtimefields/query/LongScriptFieldDistanceFeatureQuery.java

+        protected DistanceScorer(Weight weight, AbstractLongScriptFieldScript script, int maxDoc, float boost) {
+            super(weight);
+            this.script = script;
+            twoPhase = new TwoPhaseIterator(DocIdSetIterator.all(maxDoc)) {


I believe we're hoping for bool queries to AND together a "normal" query and a runtime field query. I've experimented with this for our term and match style queries and it seems to work pretty well. If the "normal" query is selective then the runtime query won't be asked if it matches most documents. On the flip side, if the runtime field query non-selective then we'll quickly fill up the 10,000 hits and terminate early.

…ature

javanna · 2020-08-14T09:14:57Z

...ds/src/main/java/org/elasticsearch/xpack/runtimefields/mapper/ScriptDateMappedFieldType.java

+                boost
+            );
+        });
+    }


I wonder what the plan is for the instanceof checks in DistanceFeatureQueryBuilder#doToQuery . Are we ok with keeping those?

oh actually those are gone upstream, great! sorry for the noise then, you already did what I would asked you to do

javanna

LGTM

nik9000 added 2 commits August 6, 2020 17:13

WIP

2a1c4b1

Merge branch 'feature/runtime_fields' into runtime_fields_distance_fe…

05d5c5a

…ature

nik9000 added the :Search/Search Search-related issues that do not fall into other categories label Aug 6, 2020

nik9000 requested a review from javanna August 6, 2020 21:23

elasticmachine added the Team:Search Meta label for search team label Aug 6, 2020

javanna mentioned this pull request Aug 6, 2020

Add support for runtime fields #59332

Closed

30 tasks

Finish test

f788941

mayya-sharipova approved these changes Aug 13, 2020

View reviewed changes

nik9000 commented Aug 13, 2020

View reviewed changes

nik9000 added 2 commits August 13, 2020 15:10

Merge branch 'feature/runtime_fields' into runtime_fields_distance_fe…

dfd27ff

…ature

Update tests

9739a0c

javanna reviewed Aug 14, 2020

View reviewed changes

javanna approved these changes Aug 14, 2020

View reviewed changes

nik9000 merged commit f3b65eb into elastic:feature/runtime_fields Aug 17, 2020

Conversation

nik9000 commented Aug 6, 2020

Uh oh!

elasticmachine commented Aug 6, 2020

Uh oh!

mayya-sharipova left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nik9000 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

javanna left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants