Query refactoring: unify boost and query name by javanna · Pull Request #11974 · elastic/elasticsearch

javanna · 2015-07-01T12:31:10Z

Following the discussion in #11744, move boost and query _name to base class AbstractQueryBuilder with their getters and setters. Unify their serialization code and equals/hashcode handling in the base class too. This guarantess that every query supports both _name and boost and nothing needs to be done around those in subclasses besides properly parsing the fields in the parsers and printing them out as part of the doXContent method in the builders. More specifically, these are the performed changes:

Introduced printBoostAndQueryName utility method in AbstractQueryBuilder that subclasses can use to print out _name and boost in their doXContent method.
readFrom and writeTo are now final methods that take care of _name and boost serialization. Subclasses have to implement doReadFrom and doWriteTo instead.
toQuery is a final method too that takes care of properly applying _name and boost to the lucene query. Subclasses have to implement doToQuery instead. The query returned will have boost and queryName applied automatically.
Removed BoostableQueryBuilder interface, given that every query is boostable after this change. This won't have any negative effect on filters, as the boost simply gets ignored in that case.
Extended equals and hashcode to handle queryName and boost automatically as well.
Update the query test infra so that queryName and boost are tested automatically, and whenever they are forgotten in parser or doXContent tests fail, so this makes things a lot less error-prone
Introduced DEFAULT_BOOST constant to make sure we don't repeat 1.0f all the time for default boost values.

SpanQueryBuilder is again a marker interface only. The convenient toQuery that allowed us to override the return type to SpanQuery cannot be supported anymore due to a clash with the toQuery implementation from AbstractQueryBuilder. We have to go back to castin lucene Query to SpanQuery when dealing with span queries unfortunately.

Note that this change touches not only the already refactored queries but also the untouched ones, by making sure that we parse _name and boost whenever we need to and that we print them out as part of QueryBuilder#doXContent. This will result in printing out the default boost all the time rather than skipping it in non refactored queries, something that we would have changed anyway as part of the query refactoring.

The following are the queries that support boost now while previously they didn't (parser now parses it and builder prints it out): and, exists, fquery, geo_bounding_box, geo_distance, geo_distance_range, geo_hash_cell, geo_polygon, indices, limit, missing, not, or, script, type.

The following are the queries that support _name now while previously they didn't (parser now parses it and builder prints it out): boosting, constant_score, function_score, limit, match_all, type.

Range query parser supports now _name at the same level as boost too (_name is still supported on the outer object though for bw comp).

There are two exceptions that despite have getters and setters for queryName and boost don't really support boost and queryName: query filter and span multi term query. The reason for this is that they only support a single inner object which is another query that they wrap, no other elements.

Relates to #11744
Closes #10776

cbuescher · 2015-07-01T12:51:45Z

core/src/main/java/org/apache/lucene/queryparser/classic/MissingFieldQueryExtension.java

Is this because of your change or could newFilter() always return null even before the change?
Is it safe to return null if it does?

it could always return null, it does it explicitly.

javanna · 2015-07-01T13:21:48Z

This change is breaking for the java api, until we move to serialize queries in Streamable format rather than using their json representation. When using json (or smile etc.) the query builder prints out new fields (e.g. boost, _name) which the new version of the parser supports, but when the same request gets sent to an older node the parser will most likely throw error because it doesn't support boost (or _name).

cbuescher · 2015-07-01T13:36:30Z

core/src/main/java/org/elasticsearch/index/query/FQueryFilterBuilder.java

Can queryName be deleted here?

yea good catch!

I found a couple of other places where I have the same leftovers, going to clean those up

cbuescher · 2015-07-01T15:06:27Z

Did a round of review, LGTM.

javanna · 2015-07-01T15:12:54Z

I pushed new commits that address your comments :)

Following the discussion in elastic#11744, move boost and query _name to base class AbstractQueryBuilder with their getters and setters. Unify their serialization code and equals/hashcode handling in the base class too. This guarantess that every query supports both _name and boost and nothing needs to be done around those in subclasses besides properly parsing the fields in the parsers and printing them out as part of the doXContent method in the builders. More specifically, these are the performed changes: - Introduced printBoostAndQueryName utility method in AbstractQueryBuilder that subclasses can use to print out _name and boost in their doXContent method. - readFrom and writeTo are now final methods that take care of _name and boost serialization. Subclasses have to implement doReadFrom and doWriteTo instead. - toQuery is a final method too that takes care of properly applying _name and boost to the lucene query. Subclasses have to implement doToQuery instead. The query returned will have boost and queryName applied automatically. - Removed BoostableQueryBuilder interface, given that every query is boostable after this change. This won't have any negative effect on filters, as the boost simply gets ignored in that case. - Extended equals and hashcode to handle queryName and boost automatically as well. - Update the query test infra so that queryName and boost are tested automatically, and whenever they are forgotten in parser or doXContent tests fail, so this makes things a lot less error-prone - Introduced DEFAULT_BOOST constant to make sure we don't repeat 1.0f all the time for default boost values. SpanQueryBuilder is again a marker interface only. The convenient toQuery that allowed us to override the return type to SpanQuery cannot be supported anymore due to a clash with the toQuery implementation from AbstractQueryBuilder. We have to go back to castin lucene Query to SpanQuery when dealing with span queries unfortunately. Note that this change touches not only the already refactored queries but also the untouched ones, by making sure that we parse _name and boost whenever we need to and that we print them out as part of QueryBuilder#doXContent. This will result in printing out the default boost all the time rather than skipping it in non refactored queries, something that we would have changed anyway as part of the query refactoring. The following are the queries that support boost now while previously they didn't (parser now parses it and builder prints it out): and, exists, fquery, geo_bounding_box, geo_distance, geo_distance_range, geo_hash_cell, geo_polygon, indices, limit, missing, not, or, script, type. The following are the queries that support _name now while previously they didn't (parser now parses it and builder prints it out): boosting, constant_score, function_score, limit, match_all, type. Range query parser supports now _name at the same level as boost too (_name is still supported on the outer object though for bw comp). There are two exceptions that despite have getters and setters for queryName and boost don't really support boost and queryName: query filter and span multi term query. The reason for this is that they only support a single inner object which is another query that they wrap, no other elements. Relates to elastic#11744 Closes elastic#10776 Closes elastic#11974

javanna added >enhancement review labels Jul 1, 2015

cbuescher reviewed Jul 1, 2015
View reviewed changes

javanna added the >breaking label Jul 1, 2015

cbuescher reviewed Jul 1, 2015
View reviewed changes

javanna force-pushed the enhancement/unify_query_name_boost branch from deddf73 to cab3a68 Compare July 1, 2015 15:49

javanna merged commit cab3a68 into elastic:feature/query-refactoring Jul 1, 2015

kevinkluge removed the review label Jul 1, 2015

cbuescher mentioned this pull request Dec 9, 2015

Range query does not support _name anymore in 2.x #15306

Closed

clintongormley added :Search/Search Search-related issues that do not fall into other categories and removed :Query Refactoring labels Feb 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query refactoring: unify boost and query name#11974

Query refactoring: unify boost and query name#11974
javanna merged 1 commit intoelastic:feature/query-refactoringfrom
javanna:enhancement/unify_query_name_boost

javanna commented Jul 1, 2015

Uh oh!

cbuescher Jul 1, 2015

Uh oh!

javanna Jul 1, 2015

Uh oh!

javanna commented Jul 1, 2015

Uh oh!

cbuescher Jul 1, 2015

Uh oh!

javanna Jul 1, 2015

Uh oh!

javanna Jul 1, 2015

Uh oh!

cbuescher commented Jul 1, 2015

Uh oh!

javanna commented Jul 1, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

javanna commented Jul 1, 2015

Uh oh!

cbuescher Jul 1, 2015

Choose a reason for hiding this comment

Uh oh!

javanna Jul 1, 2015

Choose a reason for hiding this comment

Uh oh!

javanna commented Jul 1, 2015

Uh oh!

cbuescher Jul 1, 2015

Choose a reason for hiding this comment

Uh oh!

javanna Jul 1, 2015

Choose a reason for hiding this comment

Uh oh!

javanna Jul 1, 2015

Choose a reason for hiding this comment

Uh oh!

cbuescher commented Jul 1, 2015

Uh oh!

javanna commented Jul 1, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants