PDFs suck.
If you want to build standout RAG apps, you need tooling that is tailored to your needs.
You should own your doc parsing infra.
Today we’re launching chunkr.ai on @ycombinatorycombinator.com/launches/Mud-c…
because of how good our training data was, our model has consistently better performance per document domain (finance, medical etc.) than any other variant.
Benchmarked on 1,013 human-reviewed docs, outperforming AWS, Azure, Docling, and frontier VLMs, even on the hardest documents.
80.9 mAP@50 | 88.4 P | 85.6 R | 86.9 F1
Today we're thrilled to announce the release of our newest layout-analysis model: chunkr-layout-1
Trained on millions of the hardest documents, across all verticals, chunkr-layout-1 is built to understand real documents, not just the clean ones.
Introducing chunkr-layout-1. Layout is where document intelligence starts, we just leveled it up.
- Benched on 1,013 hand-tagged samples.
80.9 mAP@50 | 88.4 P | 85.6 R | 86.9 F1
- Beats AWS, Azure, Docling, Gemini 2.5 Pro, OSS models
- Identifies 16 class labels
Live now!
I’m thrilled to announce our new VLMs, chunkr-parse-1 and chunkr-parse-1-thinking.
- Parses complex forms, tables & more
- Inline OCR (redlining and formulas)
- Multilingual (>100 languages)
- Beats AWS Textract, Gemini 2.5 Pro, Mistral at OCR
Live now on the Chunkr API
hello!
we are hiring @chunkrai
we are building best-in-class tools to help make the most out of documents. we're growing fast and are looking for a founding engineer to join!
if you
- like rust
- can handle features e2e
- are funny
lets talk!
ycombinator.com/companies/chun…