Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: py-pdf/pypdf
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: 2.1.1
Choose a base ref
...
head repository: py-pdf/pypdf
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: 2.2.0
Choose a head ref
  • 2 commits
  • 14 files changed
  • 2 contributors

Commits on Jun 13, 2022

  1. ENH: Text Extraction improvements (#969)

    * Improvements around /Encoding / /ToUnicode
    * Extraction of CMaps improved
    * Fallback for font def missing
    * Support for /Identity-H and /Identity-V: utf-16-be
    * Support for /GB-EUC-H / /GB-EUC-V / GBp/c-EUC-H / /GBpc-EUC-V (beta release for evaluation)
    * Arabic (for evaluation)
    * Whitespace extraction improvements
    pubpub-zz authored Jun 13, 2022
    Configuration menu
    Copy the full SHA
    72fcaae View commit details
    Browse the repository at this point in the history
  2. REL: 2.2.0

    The 2.2.0 release improves text extraction again via (#969):
    
    * Improvements around /Encoding / /ToUnicode
    * Extraction of CMaps improved
    * Fallback for font def missing
    * Support for /Identity-H and /Identity-V: utf-16-be
    * Support for /GB-EUC-H / /GB-EUC-V / GBp/c-EUC-H / /GBpc-EUC-V (beta release for evaluation)
    * Arabic (for evaluation)
    * Whitespace extraction improvements
    
    Those changes should mainly improve the text extraction for non-ASCII alphabets,
    e.g. Russian / Chinese / Japanese / Korean / Arabic.
    
    Full Changelog: 2.1.1...2.2.0
    MartinThoma committed Jun 13, 2022
    Configuration menu
    Copy the full SHA
    f0cd829 View commit details
    Browse the repository at this point in the history
Loading