Skip to content

Fix feeds encoded in UTF-16LE#40

Merged
Alkarex merged 1 commit intofreshrssfrom
fix-UTF-16LE
Jun 24, 2025
Merged

Fix feeds encoded in UTF-16LE#40
Alkarex merged 1 commit intofreshrssfrom
fix-UTF-16LE

Conversation

@Alkarex
Copy link
Member

@Alkarex Alkarex commented Jun 24, 2025

Example of valid feed not working in SimplePie: https://haveibeenpwned.com/feed/breaches/
Regression due to simplepie#445
The final character > of a feed is encoded as 3E00 in UTF-16LE, so calling trim() was removing the \x00, breaking the multibyte encoding and making the feed invalid.
Downstream issue FreshRSS/FreshRSS#7690
Upstream PR simplepie#916

Example of valid feed not working in SimplePie: https://haveibeenpwned.com/feed/breaches/
Regression due to simplepie#445
The final character `>` of a feed is encoded as `3E00` in UTF-16LE, so calling `trim()` was removing the `\x00`, breaking the multibyte encoding and making the feed invalid.
Downstream issue FreshRSS/FreshRSS#7690
@Alkarex Alkarex added the bug label Jun 24, 2025
@Alkarex Alkarex merged commit 370de7c into freshrss Jun 24, 2025
20 checks passed
Alkarex added a commit to Alkarex/FreshRSS that referenced this pull request Jun 24, 2025
Fix FreshRSS#7690
FreshRSS/simplepie#40
The final character `>` of a feed is encoded as `3E00` in UTF-16LE, so calling `trim()` was removing the `\x00`, breaking the multibyte encoding and making the feed invalid.

Upstream PR simplepie/simplepie#916
Alkarex added a commit that referenced this pull request Jun 25, 2025
simplepie#917
Slight refactor of simplepie#916 (#40) to cover all paths.
Missing paths included the fsock method without gzip (e.g. deflate or plain).
Alkarex added a commit to Alkarex/FreshRSS that referenced this pull request Jun 25, 2025
FreshRSS/simplepie#42
simplepie/simplepie#917
Slight refactor of simplepie/simplepie#916 (FreshRSS/simplepie#40) to cover all paths.
Missing paths included the fsock method without gzip (e.g. deflate or plain).
Alkarex added a commit to FreshRSS/FreshRSS that referenced this pull request Jun 25, 2025
* Fix feeds encoded in UTF-16LE
Fix #7690
FreshRSS/simplepie#40
The final character `>` of a feed is encoded as `3E00` in UTF-16LE, so calling `trim()` was removing the `\x00`, breaking the multibyte encoding and making the feed invalid.

Upstream PR simplepie/simplepie#916

* Trim body for all paths
FreshRSS/simplepie#42
simplepie/simplepie#917
Slight refactor of simplepie/simplepie#916 (FreshRSS/simplepie#40) to cover all paths.
Missing paths included the fsock method without gzip (e.g. deflate or plain).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant