Skip to content

text_extraction invalid for habibi.pdf #1619

@pubpub-zz

Description

@pubpub-zz
          I've opened https://github.com/py-pdf/sample-files/pull/13 to put `habibi.pdf` in the sample-files repo.  i recommend including a test for it before merging this.

the extracted show the arab characters to be reversed

Originally posted by @dkg in #1126 (comment)

Metadata

Metadata

Assignees

No one assigned

    Labels

    is-bugFrom a users perspective, this is a bug - a violation of the expected behavior with a compliant PDF

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions