Skip to content

[ML] Token highlighting is completely haywire on long messages in categorization wizard with new default analyzer #101255

@droberts195

Description

@droberts195

Kibana version:

7.14.0-SNAPSHOT

Elasticsearch version:

7.14.0-SNAPSHOT

Server OS version:

macOS Catalina

Browser version:

Chrome 90.0.4430.212

Browser OS version:

macOS Catalina

Original install method (e.g. download page, yum, from source, etc.):

gradle run for Elasticsearch and yarn start for Kibana.

Describe the bug:

When creating a categorization job on some Elasticsearch logs using the new default categorization analyzer we are adding for 7.14 the token highlighting is completely wrong for some of the longer messages. It's not just that the wrong characters are highlighted - the highlighting process seems to have completely messed up the text too. Highlighted tokens seem to be printed on top of non-highlighted text in the wrong places.

Also, some other messages don't seem to have any tokens highlighted at all.

Steps to reproduce:

  1. Upload a file called logs.csv using file upload - ping me to get a copy of this - and have the uploader create an index pattern for it
  2. Create a new categorization job using the categorization wizard
  3. On the first tab, ask to use the full time range of the data
  4. On the second tab let the validation take place, then look at the example messages with highlighted tokens, and hopefully you'll see something similar to the screenshot below

Expected behavior:

Token highlighting should make sense and be rendered correctly.

Screenshots (if relevant):

Screenshot 2021-06-03 at 11 13 05

Metadata

Metadata

Assignees

No one assigned

    Labels

    :mlFeature:Anomaly DetectionML anomaly detectionbugFixes for quality problems that affect the customer experiencev7.14.0

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions