chunkr (@chunkrai) / X

chunkr

80 posts

chunkr

@chunkrai

vision based document parsing

San Francisco, CA

Joined January 2025

Pinned
chunkr
@chunkrai
Feb 26, 2025
PDFs suck. If you want to build standout RAG apps, you need tooling that is tailored to your needs. You should own your doc parsing infra. Today we’re launching chunkr.ai on @ycombinator ycombinator.com/launches/Mud-c…
24K
chunkr
@chunkrai
Oct 15, 2025
Replying to @chunkrai
chunkr-layout-1 and our other models are live now on our API! 🌐 Website: chunkr.ai 📷 Hugging Face Dataset: huggingface.co/datasets/Chunk… 📷Blog: chunkr.ai/blog/introduci…
355
chunkr
@chunkrai
Oct 15, 2025
Replying to @chunkrai
We can even parse floor plans/technical diagrams.
387
chunkr
@chunkrai
Oct 15, 2025
Replying to @chunkrai
Metrics don't tell the full story, vibe testing shows us in-practice, how good the model actually is. (our outputs in red)
122
chunkr
@chunkrai
Oct 15, 2025
Replying to @chunkrai
because of how good our training data was, our model has consistently better performance per document domain (finance, medical etc.) than any other variant.
112
chunkr
@chunkrai
Oct 15, 2025
Replying to @chunkrai
We also cover more class labels than any other variants
111
chunkr
@chunkrai
Oct 15, 2025
Replying to @chunkrai
Benchmarked on 1,013 human-reviewed docs, outperforming AWS, Azure, Docling, and frontier VLMs, even on the hardest documents. 80.9 mAP@50 | 88.4 P | 85.6 R | 86.9 F1
161
chunkr
@chunkrai
Oct 15, 2025
Today we're thrilled to announce the release of our newest layout-analysis model: chunkr-layout-1 Trained on millions of the hardest documents, across all verticals, chunkr-layout-1 is built to understand real documents, not just the clean ones.
5.5K
chunkr reposted
Ishaan Kapoor
@Ishaank1999
Oct 15, 2025
Introducing chunkr-layout-1. Layout is where document intelligence starts, we just leveled it up. - Benched on 1,013 hand-tagged samples. 80.9 mAP@50 | 88.4 P | 85.6 R | 86.9 F1 - Beats AWS, Azure, Docling, Gemini 2.5 Pro, OSS models - Identifies 16 class labels Live now!
28K
chunkr reposted
Ishaan Kapoor
@Ishaank1999
Oct 9, 2025
I’m thrilled to announce our new VLMs, chunkr-parse-1 and chunkr-parse-1-thinking. - Parses complex forms, tables & more - Inline OCR (redlining and formulas) - Multilingual (>100 languages) - Beats AWS Textract, Gemini 2.5 Pro, Mistral at OCR Live now on the Chunkr API
65K
chunkr reposted
Philip Kung
@philipkung
Aug 22, 2025
Sweetspot does that: it can auto-detect fields (including from scanned PDFs) and fill out forms of 30+ pages using context from your organization
00:00
Geoffrey Litt
@geoffreylitt
Aug 22, 2025
Cursor for filling out PDF forms
2.4K
chunkr
@chunkrai
Aug 17, 2025
webhooks, live now! docs.chunkr.ai/docs/webhooks/…
Ishaan Kapoor
@Ishaank1999
Aug 17, 2025
web hooks - live on @chunkrai now chad @notakhilesh99 shipped it in like a day docs below
1.2K
chunkr reposted
Ishaan Kapoor
@Ishaank1999
Aug 12, 2025
hello! we are hiring @chunkrai we are building best-in-class tools to help make the most out of documents. we're growing fast and are looking for a founding engineer to join! if you - like rust - can handle features e2e - are funny lets talk! ycombinator.com/companies/chun…
2.8K