-
-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Improve BibTeX-from-PDF import #11999
Copy link
Copy link
Labels
component: import-loadgood second issueIssues that involve a tour of two or three interweaved components in JabRefIssues that involve a tour of two or three interweaved components in JabRef📌 Pinned📍 AssignedAssigned by assign-issue-action (or manually assigned)Assigned by assign-issue-action (or manually assigned)
Metadata
Metadata
Assignees
Labels
component: import-loadgood second issueIssues that involve a tour of two or three interweaved components in JabRefIssues that involve a tour of two or three interweaved components in JabRef📌 Pinned📍 AssignedAssigned by assign-issue-action (or manually assigned)Assigned by assign-issue-action (or manually assigned)
Type
Fields
Give feedbackNo fields configured for feature.
!! This is more an issue to experiment with heuristics. How can a machine with "traditional" (non-AI) code create useful information. !!
When importing the PDF se2paper.pdf
one gets following BibTeX entry
However, the title should be better:
The properties of the file show
Tasks:
org.jabref.logic.importer.fileformat.PdfMergeMetadataImporter#importDatabase(java.nio.file.Path))stripper.setSortByPosition(true);needs to be removed fromorg.jabref.logic.importer.fileformat.PdfContentImporter#getFirstPageContents. Maybe, two methods need to be done to be able to parse the title (depending on positon) and parsing the abstract (more on content)Hint:
org.jabref.logic.importer.fileformat.PdfContentImporterTestand ´org.jabref.logic.importer.fileformat.PdfMergeMetadataImporterTest`