Disable scoring of Keyword Term search by default, allow user use old logic using useSimilarity parameter#17889
Conversation
|
❌ Gradle check result for 6b60c9c: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
|
{"run-benchmark-test": "id_1"} |
|
{"run-benchmark-test": "id_4"} |
|
The Jenkins job url is https://build.ci.opensearch.org/job/benchmark-pull-request/2828/ . Final results will be published once the job is completed. |
|
Edit: n/m, code out of sync I'm getting this error when trying to update the mapping: Without that useSimilarity it works: |
|
❌ Gradle check result for 493a1d3: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
* will follow up with unit tests * making sure I'm on the right track Signed-off-by: Asim Mahmood <asim.seng@gmail.com>
* will follow up with manually testing the new mapping parameter Signed-off-by: Asim Mahmood <asim.seng@gmail.com>
Signed-off-by: Asim Mahmood <asim.seng@gmail.com>
Signed-off-by: Asim Mahmood <asim.seng@gmail.com>
493a1d3 to
ef419d5
Compare
Signed-off-by: Asim Mahmood <asim.seng@gmail.com>
|
❌ Gradle check result for ef419d5: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
|
❌ Gradle check result for 16882dc: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
* also check all places where we do `X instanceof BoostQuery` to also check for `ConstantScoreQuery` Signed-off-by: Asim Mahmood <asim.seng@gmail.com>
|
Failure was real. In case I also where we do |
|
❌ Gradle check result for c8e4dcd: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
|
❕ Gradle check result for c8e4dcd: UNSTABLE Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure. |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #17889 +/- ##
============================================
+ Coverage 72.43% 72.47% +0.03%
- Complexity 66952 66976 +24
============================================
Files 5467 5470 +3
Lines 309605 309707 +102
Branches 45043 45052 +9
============================================
+ Hits 224272 224466 +194
+ Misses 66978 66904 -74
+ Partials 18355 18337 -18 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
4308f4c
into
opensearch-project:main
… logic using useSimilarity parameter (opensearch-project#17889) * Initial attempt to add constant scorer for term keyword search * will follow up with unit tests * making sure I'm on the right track Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit tests * will follow up with manually testing the new mapping parameter Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Add changelog entry Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit test Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix changelog entry Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Apply style check Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit tests * since term is a basic query type, it affects many that build on top * none of these should cause a regression, infact this change should fix any possible regression these query types, which may not be copatured in big5 datasets Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix more unit tests Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix flakly test * due to the random nature of org.opensearch.index.query.SimpleQueryStringBuilderTests.testToQuery, this can sometimes fail with `but: was <ConstantScore(mapped_string_2:za)>` Signed-off-by: Asim Mahmood <asim.seng@gmail.com> # Please enter the commit message for your changes. Lines starting * Fix style check Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Change parameter to use snake_case: use_similarity Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix integ tests * also check all places where we do `X instanceof BoostQuery` to also check for `ConstantScoreQuery` Signed-off-by: Asim Mahmood <asim.seng@gmail.com> --------- Signed-off-by: Asim Mahmood <asim.seng@gmail.com> Signed-off-by: Asim Mahmood <asim.seng@gmail.com> # Please enter the commit message for your changes. Lines starting Signed-off-by: Asim M <asim.seng@gmail.com> Signed-off-by: Sriram Ganesh <srignsh22@gmail.com>
… logic using useSimilarity parameter (opensearch-project#17889) * Initial attempt to add constant scorer for term keyword search * will follow up with unit tests * making sure I'm on the right track Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit tests * will follow up with manually testing the new mapping parameter Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Add changelog entry Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit test Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix changelog entry Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Apply style check Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit tests * since term is a basic query type, it affects many that build on top * none of these should cause a regression, infact this change should fix any possible regression these query types, which may not be copatured in big5 datasets Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix more unit tests Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix flakly test * due to the random nature of org.opensearch.index.query.SimpleQueryStringBuilderTests.testToQuery, this can sometimes fail with `but: was <ConstantScore(mapped_string_2:za)>` Signed-off-by: Asim Mahmood <asim.seng@gmail.com> # Please enter the commit message for your changes. Lines starting * Fix style check Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Change parameter to use snake_case: use_similarity Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix integ tests * also check all places where we do `X instanceof BoostQuery` to also check for `ConstantScoreQuery` Signed-off-by: Asim Mahmood <asim.seng@gmail.com> --------- Signed-off-by: Asim Mahmood <asim.seng@gmail.com> Signed-off-by: Asim Mahmood <asim.seng@gmail.com> # Please enter the commit message for your changes. Lines starting Signed-off-by: Asim M <asim.seng@gmail.com> Signed-off-by: Harsh Kothari <techarsh@amazon.com>
… logic using useSimilarity parameter (opensearch-project#17889) * Initial attempt to add constant scorer for term keyword search * will follow up with unit tests * making sure I'm on the right track Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit tests * will follow up with manually testing the new mapping parameter Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Add changelog entry Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit test Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix changelog entry Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Apply style check Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit tests * since term is a basic query type, it affects many that build on top * none of these should cause a regression, infact this change should fix any possible regression these query types, which may not be copatured in big5 datasets Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix more unit tests Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix flakly test * due to the random nature of org.opensearch.index.query.SimpleQueryStringBuilderTests.testToQuery, this can sometimes fail with `but: was <ConstantScore(mapped_string_2:za)>` Signed-off-by: Asim Mahmood <asim.seng@gmail.com> # Please enter the commit message for your changes. Lines starting * Fix style check Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Change parameter to use snake_case: use_similarity Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix integ tests * also check all places where we do `X instanceof BoostQuery` to also check for `ConstantScoreQuery` Signed-off-by: Asim Mahmood <asim.seng@gmail.com> --------- Signed-off-by: Asim Mahmood <asim.seng@gmail.com> Signed-off-by: Asim Mahmood <asim.seng@gmail.com> # Please enter the commit message for your changes. Lines starting Signed-off-by: Asim M <asim.seng@gmail.com> Signed-off-by: Harsh Kothari <techarsh@amazon.com>
Description
Related Issues
Resolves #17823
Check List
Testing with useSimilarity:true, took time 10ms, score 1.0
Testing with useSimilarity:true, took time 200ms, score 0.8...