ROB: Deal with DictionaryObjects having streams of length 0 by stefan6419846 · Pull Request #3114 · py-pdf/pypdf

stefan6419846 · 2025-02-10T13:24:38Z

Closes #3052.

Likend · 2025-10-01T17:18:01Z

            if length > 0:
                data["__streamdata__"] = stream. Read(length)
            elif length < 0:
                data["__streamdata__"] = read_until_regex(
                    stream, re.compile(b"endstream")
                )

...

        if "__streamdata__" in data:
            return StreamObject.initialize_from_dictionary(data)
        retval = DictionaryObject()
        retval.update(data)
        return retval

I think the problem lies in this piece of code. A StreamObject of length 0 will not have "__streamdata__" in data, and thus it will be parsed into a DictionaryObject instead of a StreamObject.

I've run pytest in my Windows,. The tests failed because tika-950337.pdf contains a length-0 content stream in page 2 (in test_compress_raised, test_workflows.py), which will be parsed to be a DictionaryObject. However compress_content_streams() need to call get_data() method of the content, so it failed.

ROB: Deal with DictionaryObjects having streams of length 0

ad6c2e7

stefan6419846 closed this Feb 10, 2025

Likend mentioned this pull request Oct 1, 2025

BUG: Fix handling of zero-length StreamObject #3485

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ROB: Deal with DictionaryObjects having streams of length 0#3114

ROB: Deal with DictionaryObjects having streams of length 0#3114
stefan6419846 wants to merge 1 commit intopy-pdf:mainfrom
stefan6419846:issue3052

stefan6419846 commented Feb 10, 2025

Uh oh!

Likend commented Oct 1, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

stefan6419846 commented Feb 10, 2025

Uh oh!

Likend commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Likend commented Oct 1, 2025 •

edited

Loading