Extracting from pdfs

Hello, I am using Grobid for my project and I am working with PDF Drug Labels. I have noticed a few things that happen when the pdf is extracted into xml:
1. It often times does not extract the text that comes right after an image 
2. It sometimes captures a new head into the preceding header. For example after extracting section 12.3, it extracts section 12.4 as a continuation of the preceding header.

Could this be looked at please?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extracting from pdfs #1279

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Extracting from pdfs #1279

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions