Summary
Evaluating two records on a feature, where one of those records doesn't have a value, penalizes the comparison too much. For example, if two birthdates don't match (1980-01-01 and 1990-06-05) that should result in a worse score than a record with 1980-01-01 and a record that is missing a birthdate. Right now, the former gets a better score.
Acceptance Criteria
Details / Tasks
The compare method needs to use the new "defaults/compare_missing_percentage" and the "defaults/compare_minimum_percentage" when calculating the value. When comparing the incoming record to an existing one, compare_missing_percentage of the log odds value should be rewarded for the feature comparison if either value is missing.
When comparing two records, if more than compare_minimum_percentage is missing from the comparison, the evaluation as a whole should result in a 0 score.
Dependencies
#230
Summary
Evaluating two records on a feature, where one of those records doesn't have a value, penalizes the comparison too much. For example, if two birthdates don't match (1980-01-01 and 1990-06-05) that should result in a worse score than a record with 1980-01-01 and a record that is missing a birthdate. Right now, the former gets a better score.
Acceptance Criteria
Details / Tasks
The compare method needs to use the new "defaults/compare_missing_percentage" and the "defaults/compare_minimum_percentage" when calculating the value. When comparing the incoming record to an existing one,
compare_missing_percentageof the log odds value should be rewarded for the feature comparison if either value is missing.When comparing two records, if more than
compare_minimum_percentageis missing from the comparison, the evaluation as a whole should result in a 0 score.Dependencies
#230