Skip to content

PDF source file containing "pdf" before ".pdf" extension breaks naming of training files #776

@cboulanger

Description

@cboulanger

I have files named XXX.pdfa.pdf, which differentiate them as PDF/A files from the non-PDF/A version XXX.pdf. When fed into createTraining, it produces training files such as xxx.training.segmentation.tei.xmla.training.segmentation.tei.xml - note the xmla and the duplication of segmentation.tei.xml. Looks like a simple replacement of all occurrences of "pdf".

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugFrom Hemiptera and especially its suborder HeteropteraimplementedThe issue has been implemented

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions