Skip to content

pdfdocencode table is wrong #151

@jribbens

Description

@jribbens

In the _pdfDocEncoding translation table, characters 9, 10 and 13 are marked as illegal when in fact they should translate to themselves. This means that decode_pdfdocencoding() fails on all multi-line strings.

Metadata

Metadata

Assignees

No one assigned

    Labels

    is-bugFrom a users perspective, this is a bug - a violation of the expected behavior with a compliant PDFneeds-changeThe PR/issue cannot be handled as issue and needs to be improvedworkflow-text-extractionFrom a users perspective, text extraction is the affected feature/workflow

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions