Increase filename boost#785
Merged
Merged
Conversation
Contributor
Author
|
golden queries evals (note the change in "Explain file", which is a class of queries containing a filename) Before After CodeSearchNet evals (results unchanged) After |
mmanela
approved these changes
May 28, 2024
|
Just checking, did the filename boost actually decrease recall here? Cause when I add up all the numerators I actually get 64 instead of 60 |
Contributor
Author
|
@chenkc805 good catch! I checked our eval snapshots for the change right before this and confirmed that 60/89 is in fact right. I just somehow had a copy-paste error for the explanation. I updated the comment above. I even called out how "explain file" improved, so we can be sure I was looking at the right numbers! |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
When we introduced filename boosting in BM25, we set it to a very conservative
weight. This PR increases the weight from 2.0 -> 5.0, which improves results on
relevant evals.
Relates to SPLF-88