fix issues with destination (#604) by pubpub-zz · Pull Request #821 · py-pdf/pypdf

pubpub-zz · 2022-04-25T14:29:54Z

root cause: probably extraction from a document not extracting properly destination

changes:

getDestinationPageNumber return -1 with NullObject
in case of Strict = False, return a destination to first page to prevent error (no change in case of Strict=True)
note ; warning generated

Test added with the sample test

#604 root cause: probably extraction from a document not extracting properly destination changes: * getDestinationPageNumber return -1 with NullObject * in case of Strict = False, return a destination to first page to prevent error (no change in case of Strict=True) note ; warning generated Test added with the sample test

MartinThoma · 2022-04-28T16:55:21Z

PyPDF2/pdf.py

        :rtype: int
        """
        indirectRef = destination.page
+        if type(indirectRef) is NullObject:


We might want isinstance (reasons). What do you think about it?

MartinThoma · 2022-04-28T16:58:15Z

@pubpub-zz Looks good to me, except for the type <-> isinstance part.

I did a lot of changes (applying the black formatter + splitting the pdf module into many submodules). Do you want me to deal with the merge conflicts or can you do that?

codecov · 2022-04-29T16:02:37Z

Codecov Report

Merging #821 (2dd0986) into main (80f2f25) will increase coverage by 0.22%.
The diff coverage is 100.00%.

❗ Current head 2dd0986 differs from pull request most recent head 7596ce3. Consider uploading reports for the commit 7596ce3 to get more accurate results

@@            Coverage Diff             @@
##             main     #821      +/-   ##
==========================================
+ Coverage   75.35%   75.58%   +0.22%     
==========================================
  Files          12       12              
  Lines        3563     3571       +8     
  Branches      822      824       +2     
==========================================
+ Hits         2685     2699      +14     
+ Misses        661      657       -4     
+ Partials      217      215       -2

Impacted Files	Coverage Δ
PyPDF2/pdf.py	`82.34% <100.00%> (+0.33%)`	⬆️
PyPDF2/generic.py	`68.12% <0.00%> (+0.28%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 80f2f25...7596ce3. Read the comment docs.

pubpub-zz · 2022-04-29T21:20:57Z

@pubpub-zz Looks good to me, except for the type <-> isinstance part.

I did a lot of changes (applying the black formatter + splitting the pdf module into many submodules). Do you want me to deal with the merge conflicts or can you do that?

@MartinThoma , sure : I will propose a new PR

py-pdf#604 root cause: probably extraction from a document not extracting properly destination changes: getDestinationPageNumber return -1 with NullObject in case of Strict = False, return a destination to first page to prevent error (no change in case of Strict=True) note ; warning generated Test added with the sample test (duplicate of py-pdf#821 to match refactoring)

If a destination is missing, getDestinationPageNumber now returns -1 If `strict=False`, the first page is used as a fallback. The code triggering the exception was ```python from PyPDF2 import PdfFileReader # https://github.com/mstamy2/PyPDF2/files/6045010/thyroid.pdf with open("thyroid.pdf", "rb") as f: reader = PdfFileReader(f) bookmarks = pdf.getOutlines() for b in bookmarks: print(reader.getDestinationPageNumber(b) + 1) # page count starts from 0 ``` The error message was: PyPDF2.utils.PdfReadError: Unknown Destination Type: 0 Closes #604 Closes #821

pubpub-zz added 3 commits April 25, 2022 16:25

late cleanup

34cd64b

cleanup

2dd0986

MartinThoma reviewed Apr 28, 2022

View reviewed changes

cleanup

7596ce3

pubpub-zz mentioned this pull request Apr 30, 2022

fix issues with missing destinations (#604) #840

Merged

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix issues with destination (#604)#821

fix issues with destination (#604)#821
pubpub-zz wants to merge 4 commits intopy-pdf:mainfrom
pubpub-zz:iss604

pubpub-zz commented Apr 25, 2022

Uh oh!

MartinThoma Apr 28, 2022

Uh oh!

MartinThoma commented Apr 28, 2022

Uh oh!

codecov bot commented Apr 29, 2022

Uh oh!

pubpub-zz commented Apr 29, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pubpub-zz commented Apr 25, 2022

Uh oh!

MartinThoma Apr 28, 2022

Choose a reason for hiding this comment

Uh oh!

MartinThoma commented Apr 28, 2022

Uh oh!

codecov bot commented Apr 29, 2022

Codecov Report

Uh oh!

pubpub-zz commented Apr 29, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants