v2.0.0

- [x] When we're parsing tokens, `striptags` implementation needs to additionally be pre-processed with `sanitize-html` to remove blocks like `<style>`, `<stylesheet>`, `<meta>`, `<head>` etc.
- [x] Modify `scanner.getPhishingResults` to check against ~[OpenPhish][]~ and [PhishTank][] datasets.
- [x] Tokenize and stem other mail headers (e.g. to, from, cc, bcc, reply-to, in-reply-to, etc.)
- [x] Determine solution to performance issue with `classifier.train()` in `classifier.js` per [NaturalNode/natural#520](https://github.com/NaturalNode/natural/issues/520).
- [x] Headers should NOT get converted and preserved for URL/Received-By purposes - only content should be converted
- [x] Get inspiration from `ls /usr/share/spamassassin` if needed

[openphish]: https://openphish.com/
[phishtank]: https://phishtank.com/
[nsfw]: https://github.com/infinitered/nsfwjs
[toxicity]: https://github.com/tensorflow/tfjs-models/tree/master/toxicity

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v2.0.0 #1

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

v2.0.0 #1

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions