-
Notifications
You must be signed in to change notification settings - Fork 537
Closed
Labels
bugFrom Hemiptera and especially its suborder HeteropteraFrom Hemiptera and especially its suborder HeteropteraimplementedThe issue has been implementedThe issue has been implemented
Description
Sentences sometimes have wrong coordinates.
Sample files used (PDF, TEI & training files) : 60806_R1.zip
Notes:
- borders are rendered by our application, based on the TEI elements
s[coords]values (which are usually correct) - GROBID segmentation model have been trained on these PDF (and the fulltext model "recognises the refs correctly")
Case sentence with element <ref> containing char ;
Incorrect coordinates
Exemple 1
PDF (coordinates rendering)
TEI (processing)
Note : the right part of the ref is no longer in this file (after the ; char)
TEI (training)

Note : the entire ref is in this file
Correct coordinates
Exemple 1
PDF (coordinates rendering)
TEI (processing)
TEI (training)
Exemple 2
PDF (coordinates rendering)
TEI (processing)
TEI (training)
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
bugFrom Hemiptera and especially its suborder HeteropteraFrom Hemiptera and especially its suborder HeteropteraimplementedThe issue has been implementedThe issue has been implemented











