Skip to content

page to text: rewrite#151

Merged
stweil merged 1 commit intoUB-Mannheim:masterfrom
bertsky:patch-3
Oct 23, 2022
Merged

page to text: rewrite#151
stweil merged 1 commit intoUB-Mannheim:masterfrom
bertsky:patch-3

Conversation

@bertsky
Copy link
Copy Markdown
Contributor

@bertsky bertsky commented Sep 14, 2022

  • supports recursive ReadingOrder (can be disabled via param order=document)
  • supports setting the hierarchy level to extract from (default level=highest behaves as before)
  • supports setting line/paragraph boundary strings (params lb and pb)

- supports recursive ReadingOrder (can be disabled via param order=document)
- supports setting the hierarchy level to extract from (default level=highest behaves as before)
- supports setting line/paragraph boundary strings (params lb and pb)
Copy link
Copy Markdown
Collaborator

@kba kba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a significant improvement over the bare-bones export previously. @stweil if you're okay with merging here, I'd like to include in the next ocrd_fileformat release.

@stweil stweil merged commit 693ca49 into UB-Mannheim:master Oct 23, 2022
@stweil
Copy link
Copy Markdown
Member

stweil commented Oct 23, 2022

So a new release here would also be useful. Should we make a new minor version or a bug fix version?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants