.extractText() reads / as 1.

I'm trying to automate sorting pdfs by the date on the pdf. However the issue I continue having is that the /'s in the dates continually get read as 1's. Wouldn't be a problem 90% of the time unfortunately it reads a lot of January and November dates as the same

1/11/2022
11/1/2022

Both end up as 111112022

I tried getting the new pdfs to change to a new format to have 01/11/2022 but they aren't able to do that. Is there a way to fix this?

```python
from PyPDF2 import PdfReader

reader = PdfReader("TestPackingSlip637860440227283947.pdf")
print(f"Total pages= {len(reader.pages)}")

for i, page in enumerate(reader.pages, start=1):
    print(f"Page: {i}")
    print(page.extract_text())
```

The info on the pdf I'm uploading is randomized and does not represent anyone's real info.


[TestPackingSlip637860440227283947.pdf](https://github.com/py-pdf/PyPDF2/files/8526720/TestPackingSlip637860440227283947.pdf)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.extractText() reads / as 1. #789

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

.extractText() reads / as 1. #789

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions