Skip to content

Feature Request: Support dots.ocr #16161

@AbdullahMPrograms

Description

@AbdullahMPrograms

Prerequisites

  • I am running the latest code. Mention the version if possible as well.
  • I carefully followed the README.md.
  • I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
  • I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

dots.ocr is a OCR model released by rednote that surpasses Gemini 2.5 Pro on benchmarks such as OmniDocBench:
https://huggingface.co/rednote-hilab/dots.ocr

Motivation

dots.ocr seems to be a fantastic local model for VLM OCR purposes, in my own personal tests the online dots.ocr demo transcribed my texts better then InternVL 1B & 2B, it also performed better then LFM2-VL 0.45B and 1.6B for OCR purposes

Possible Implementation

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions