-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Closed
Labels
workflow-text-extractionFrom a users perspective, text extraction is the affected feature/workflowFrom a users perspective, text extraction is the affected feature/workflow
Description
I need to read this PDF.
However, it does not correctly extracts the text.
f = open('myfile.pdf', 'rb')
reader = PdfFileReader(f)
content = reader.getPage(0).extractText()
f.close()
print(content)
This print
Resultado da Prova de Sele“‰o...
But I expected
Resultado da Prova de Seleção...
Accordance with the answer on Stack Overflow, this problem is in PyPDF
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
workflow-text-extractionFrom a users perspective, text extraction is the affected feature/workflowFrom a users perspective, text extraction is the affected feature/workflow