-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Closed
Labels
is-robustness-issueFrom a users perspective, this is about robustnessFrom a users perspective, this is about robustness
Description
Hi
We are processing quite a lot of PDFs, and from time to time we see the following assert fail on specific PDFs when trying to get the mediabox property of a page.
pypdf/pypdf/generic/_rectangle.py
Line 24 in 27edc06
| assert len(arr) == 4 |
Here is the content of page:
{'/Contents': [IndirectObject(34, 0, 131870331607200)],
'/CropBox': [0, 0, 595, 841, 0, 0, 595, 841],
'/MediaBox': [0, 0, 595, 841, 0, 0, 595, 841],
'/Parent': IndirectObject(1, 0, 131870331607200),
'/Resources': IndirectObject(5, 0, 131870331607200),
'/Type': '/Page'}Is this "just" a malformed PDF (it opens without problem in a wide range of pdf readers)? Unfortunately, I can't share the PDF, since it contains sensitive customer information.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
is-robustness-issueFrom a users perspective, this is about robustnessFrom a users perspective, this is about robustness