Skip to content

Add support for OCR'ing more languages #942

@wallace11

Description

@wallace11

Hi there,
I see that even though tesseract supports over 100 languages already, only a handful are available in docspell.
I was wondering if it was possible to add more languages to the OCR.
It looks like most of the job needs to be done on configuring the language objects in https://github.com/eikek/docspell/blob/master/modules/common/src/main/scala/docspell/common/Language.scala and then the UI needs to be adjusted accordingly.

If it's a more complicated task to add a new language, then would it be possible to add a contribution guide and the community will take care of it via PRs?

Cheers.

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions