ROB: Silently ignore Adobe Ascii85 whitespace by mbierma · Pull Request #3528 · py-pdf/pypdf

mbierma · 2025-11-20T08:43:48Z

The PDF standard specifies that "the ASCII85Decode filter shall ignore all white-space characters" (spaces, tabs, newlines, etc.)

While the code currently strips leading and trailing whitespace, it fails to remove whitespace characters contained within the main data stream
This issue manifests because the Python standard library a85decode is performing its end-of-data check (~>) before honoring the ignorechars parameter for internal whitespace

This can cause failures when decoding certain PDF files and the fix improves the robustness of #2996.

The update ensures that all whitespace characters immediately preceding the final > are removed prior to passing the data to the a85decode decoder.

Based on the PDF standards "the ASCII85Decode filter shall ignore all white-space characters".

codecov · 2025-11-20T08:55:08Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 97.16%. Comparing base (e9e3735) to head (dec66f5).
⚠️ Report is 86 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #3528   +/-   ##
=======================================
  Coverage   97.16%   97.16%           
=======================================
  Files          57       57           
  Lines        9807     9809    +2     
  Branches     1780     1781    +1     
=======================================
+ Hits         9529     9531    +2     
  Misses        167      167           
  Partials      111      111

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

pypdf/filters.py

tests/test_filters.py

stefan6419846

Thanks.

stefan6419846 · 2025-11-21T09:09:08Z

Please note that it might take another day to merge this as https://docs.python.org is currently offline and we rely on it for generating documentation, which would fail our CI on the merge commit otherwise.

@stefan6419846

## What's new ### Security (SEC) - Reduce default limit for LZW decoding by @stefan6419846 ### New Features (ENH) - Parse and format comb fields in text widget annotations (#3519) by @PJBrs ### Robustness (ROB) - Silently ignore Adobe Ascii85 whitespace for suffix detection (#3528) by @mbierma [Full Changelog](6.3.0...6.4.0)

mbierma added 2 commits November 20, 2025 00:19

ROB: Silently ignore Adobe Ascii85 whitespace

df794b0

Based on the PDF standards "the ASCII85Decode filter shall ignore all white-space characters".

linting

2e9459e

stefan6419846 reviewed Nov 20, 2025

View reviewed changes

pypdf/filters.py Outdated Show resolved Hide resolved

stefan6419846 reviewed Nov 20, 2025

View reviewed changes

pypdf/filters.py Outdated Show resolved Hide resolved

mbierma added 2 commits November 20, 2025 07:31

Reduce whitespace replacement to just before the final >

21b2c07

Add additional check for final >

7b3e1e0

stefan6419846 reviewed Nov 20, 2025

View reviewed changes

tests/test_filters.py Outdated Show resolved Hide resolved

mbierma added 2 commits November 20, 2025 07:52

Update test to check result matches expected value

26d1974

Update length check

dec66f5

stefan6419846 approved these changes Nov 21, 2025

View reviewed changes

stefan6419846 merged commit 82faf98 into py-pdf:main Nov 21, 2025
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ROB: Silently ignore Adobe Ascii85 whitespace#3528

ROB: Silently ignore Adobe Ascii85 whitespace#3528
stefan6419846 merged 6 commits intopy-pdf:mainfrom
mbierma:main

mbierma commented Nov 20, 2025 •

edited

Loading

Uh oh!

codecov bot commented Nov 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stefan6419846 left a comment

Uh oh!

stefan6419846 commented Nov 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mbierma commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stefan6419846 left a comment

Choose a reason for hiding this comment

Uh oh!

stefan6419846 commented Nov 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mbierma commented Nov 20, 2025 •

edited

Loading

codecov bot commented Nov 20, 2025 •

edited

Loading