Improved doc search by Alkarex · Pull Request #6785 · FreshRSS/FreshRSS

Alkarex · 2024-09-07T07:39:27Z

No description provided.

mtalexan · 2024-09-07T15:16:34Z

docs/en/users/10_filter.md

 Additional reading: [De Morgan’s laws](https://en.wikipedia.org/wiki/De_Morgan%27s_laws).

-> ℹ️ Searches are applied to the raw HTML content
+> ℹ️ Searches are applied to the HTML content, and are automatically XML-encoded (so one can search for `'A & B'` without having to encode the `&amp;`).


You call it "XML encoded" here, but "HTML encoded" down in the regex section. It's the same encoding, but XML is probably also relevant to mention down in the regex since searching even a plain text article is affected (I.e. match anplaintext title containing Q&A: needs to be done with /Q&A:/). The HTML example down there is also relevant though too.

For the record, the title is also an HTML field, hence the same syntax

For the record, the title is also an HTML field, hence the same syntax

If I'm not mistaken, it's supported, but up to the actual feed to decide whether it will specify it in plain text or HTML, correct? Plain text is just a subset of HTML with the exception of the 4 characters that have to be escaped, and those also have to be escaped for XML.

I guess I never thought too much about it, but I suppose HTML in the RSS XML probably isn't being double-escaped, is it?

When we sanitize the title (and other text fields), the end result is always HTML. Otherwise we would not be able to display those different fields safely.

Oh duh, or CDATA in the XML.
So when FreshRSS is importing the XML content, does it require CDATA sections for the title and content, or does it unwrap CDATA and decode non-CDATA fields?

All that is handled by the sanitization / normalisation.

Improved doc search

3528137

Alkarex added Documentation 📚 Search 🔍 labels Sep 7, 2024

Alkarex added this to the 1.25.0 milestone Sep 7, 2024

Alkarex mentioned this pull request Sep 7, 2024

Regex search #6706

Merged

mtalexan reviewed Sep 7, 2024

View reviewed changes

<&">

758f67e

Alkarex merged commit af37d88 into FreshRSS:edge Sep 7, 2024

Alkarex deleted the doc-search branch September 7, 2024 21:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improved doc search#6785

Improved doc search#6785
Alkarex merged 2 commits intoFreshRSS:edgefrom
Alkarex:doc-search

Alkarex commented Sep 7, 2024

Uh oh!

mtalexan Sep 7, 2024

Uh oh!

Alkarex Sep 7, 2024

Uh oh!

Alkarex Sep 7, 2024

Uh oh!

mtalexan Sep 7, 2024 •

edited

Loading

Uh oh!

Alkarex Sep 7, 2024

Uh oh!

mtalexan Sep 7, 2024

Uh oh!

Alkarex Sep 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Alkarex commented Sep 7, 2024

Uh oh!

mtalexan Sep 7, 2024

Choose a reason for hiding this comment

Uh oh!

Alkarex Sep 7, 2024

Choose a reason for hiding this comment

Uh oh!

Alkarex Sep 7, 2024

Choose a reason for hiding this comment

Uh oh!

mtalexan Sep 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Alkarex Sep 7, 2024

Choose a reason for hiding this comment

Uh oh!

mtalexan Sep 7, 2024

Choose a reason for hiding this comment

Uh oh!

Alkarex Sep 7, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mtalexan Sep 7, 2024 •

edited

Loading