Skip to content

v2.0.0 #1

@niftylettuce

Description

@niftylettuce
  • When we're parsing tokens, striptags implementation needs to additionally be pre-processed with sanitize-html to remove blocks like <style>, <stylesheet>, <meta>, <head> etc.
  • Modify scanner.getPhishingResults to check against OpenPhish and PhishTank datasets.
  • Tokenize and stem other mail headers (e.g. to, from, cc, bcc, reply-to, in-reply-to, etc.)
  • Determine solution to performance issue with classifier.train() in classifier.js per NaturalNode/natural#520.
  • Headers should NOT get converted and preserved for URL/Received-By purposes - only content should be converted
  • Get inspiration from ls /usr/share/spamassassin if needed

Metadata

Metadata

Assignees

No one assigned

    Labels

    help wantedExtra attention is needed

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions