[ML] Fix end offset for first_non_blank_line char_filter by droberts195 · Pull Request #73882 · elastic/elasticsearch

droberts195 · 2021-06-08T07:53:32Z

When the input gets chopped by a char_filter immediately after
a token, that token must be reported as ending at the very end
of the original input, otherwise analysis will have incorrect
offsets when multiple field values are analyzed in the same
_analyze request.

The pattern_replace filter works like this. This PR changes
the new first_non_blank_line filter to work in the same way.

Backport of #73828

When the input gets chopped by a char_filter immediately after a token, that token must be reported as ending at the very end of the original input, otherwise analysis will have incorrect offsets when multiple field values are analyzed in the same _analyze request. The pattern_replace filter works like this. This PR changes the new first_non_blank_line filter to work in the same way. Backport of elastic#73828

Now that elastic#73882 is merged the test should pass on master. Relates elastic#73828

Now that #73882 is merged the test should pass on master. Relates #73828

droberts195 added backport v7.14.0 labels Jun 8, 2021

droberts195 merged commit e49b21f into elastic:7.x Jun 8, 2021

droberts195 deleted the fix_end_offset_for_first_non_blank_line_filter_7x branch June 8, 2021 08:38

droberts195 added a commit to droberts195/elasticsearch that referenced this pull request Jun 8, 2021

[ML] Unmute REST compat test after backport

d0135f1

Now that elastic#73882 is merged the test should pass on master. Relates elastic#73828

droberts195 mentioned this pull request Jun 8, 2021

[ML] Unmute REST compat test after backport #73885

Merged

droberts195 added a commit that referenced this pull request Jun 8, 2021

[ML] Unmute REST compat test after backport (#73885)

58e51a0

Now that #73882 is merged the test should pass on master. Relates #73828

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Fix end offset for first_non_blank_line char_filter#73882

[ML] Fix end offset for first_non_blank_line char_filter#73882
droberts195 merged 1 commit intoelastic:7.xfrom
droberts195:fix_end_offset_for_first_non_blank_line_filter_7x

droberts195 commented Jun 8, 2021 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

droberts195 commented Jun 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

droberts195 commented Jun 8, 2021 •

edited

Loading