Serverless on Azure · Powered by Cosmos DB

Turn Cosmos DB into
your RAG engine

Serverless RAG indexing on Cosmos DB. Scales linearly. Goes global.

Drop the limitations of Azure AI Search. RAG DB turns Cosmos DB into a full-featured vector database with 30+ file type ingestion, four search modes, and the global scale Cosmos was built for.

Any File (30+ types)

PDFDOCXAudioImagesEmailCSV

RAG.DB Engine

Ingest
Chunk
Embed
Key VaultManaged IdentityApp InsightsContainer Apps
Cosmos DB

Cosmos DB

Vector SearchFull-Text Search

The Problem

Cosmos DB is a great vector database.
But who builds the ingestion pipeline?

You chose Cosmos DB for its global scale and vector search. But getting your data in — that's the hard part.

0%

Unstructured data

PDFs, DOCX, audio, video, images, emails — you need high-fidelity Markdown extraction that preserves tables, layouts, and structure from the originals.

0

Built-in sync

Files get added, edited, and deleted in Blob Storage. Without an event-driven pipeline, your index drifts out of sync silently.

0

Search modes needed

Vector, hybrid, full-text, and semantic search. You need all four for production RAG — plus query rewriting for better recall.

N

Chunking strategies

Naive splitting destroys context. Great extraction is wasted without intelligent chunking that respects headings, tables, and section boundaries.

0+

Container limit

Cosmos DB caps at 500 containers per account. At scale, you need automatic provisioning across multiple accounts.

24/7

Monitoring & recovery

Stuck jobs, failed conversions, stale indexes. Without self-healing, your pipeline needs constant babysitting.

How It Works

From upload to search
in minutes

Any File

Any File

30+ types

Event Grid

Event Grid

Auto-detect

RAG.DB Engine

Chunk + Embed

Cosmos DB

Cosmos DB

Vector + Full-Text

Query API

4 search modes

Powered by Azure

Key VaultKey VaultManaged IdentityManaged IdentityApp InsightsApp InsightsContainer AppsContainer AppsService BusService BusContainer RegistryContainer Registry

Key Capabilities

The full RAG pipeline, out of the box

4 Search Modes

Vector, hybrid, full-text, and semantic — all included in every index. Plus AI-powered query rewriting that expands queries for better recall.

Per-Index Isolation

Every index gets its own Container App, Service Bus topic, and Cosmos DB container. Fully isolated — zero noisy neighbors, independent scaling.

Event-Driven Ingestion

Fully serverless. Drop files into Blob Storage, Event Grid triggers processing, Container Apps scale to zero when idle. Pay only for what you use.

30+ File Types

PDF, DOCX, audio, images, email, archives — converted via the open-source markitdown-pro library with OCR and transcription.

Zero-Secret Security

Managed identity, Key Vault, dual API keys with rotation, user delegation SAS tokens. No credentials in code.

Multi-Cosmos Scale-Out

Cosmos DB gives you global distribution out of the box. RAG DB scales linearly across accounts — no search unit limits, no partition caps.

Self-Healing Monitoring

CronJob monitors every 5 minutes. Detects stale indexes, recovers stuck runs, and retries failed files automatically.

Observability Built-In

Application Insights telemetry, custom histograms for indexing and query performance, Cosmos DB RU tracking per query.

Optimistic Concurrency

ETag-based concurrency control throughout the pipeline. No race conditions, no data corruption, even at scale.

Open Source

Powered by markitdown-pro

Our open-source conversion library turns 30+ file types into clean Markdown. It cascades through multiple extraction methods until it gets the best possible content.

Cascading Conversion Pipeline

Each file goes through up to 4 extraction methods. If one method can't extract enough content, the next one takes over automatically.

1

MarkItDown

Microsoft's base converter handles common formats natively

2

Unstructured.io

Advanced document parsing for complex layouts and tables

3

Azure Document Intelligence

OCR and structured extraction for scanned documents

4

GPT-4o-mini Vision

AI-powered OCR as a last resort for difficult content

30+ Supported File Types

From PDFs and Office docs to audio transcription, email extraction, and OCR on scanned images — one library handles it all.

Documents

PDF, DOCX, PPTX, XLSX, ODT, EPUB, RTF

Images + OCR

PNG, JPG, WEBP, HEIC, TIFF, SVG, GIF

Audio

WAV, OGG, FLAC, M4A, AAC, WMA, OPUS

Email

EML, PST (with attachment extraction)

Data

CSV, JSON, XML, TSV

Code

Python, JS, TS, Java, C#, Go, Rust, ...

Web

HTML, Markdown, reStructuredText

Archives

ZIP (recursive processing)

$ pip install markitdown-pro

MIT License · Python ≥ 3.12 · Published by Kinetic Solutions Group

See It In Action

One API to index and search

POST/indexes
{
"name": "contracts-q1",
"storageAccount": "acmestorage",
"blobContainer": "documents",
"blobFolder": "contracts/2026"
}
201 Created — Index provisioning started
POST/indexes/{indexId}/query/semantic
{
"query": "Q1 revenue growth projections",
"top_k": 5,
"results": [{
"score": 0.94,
"chunk": "Revenue grew 23% YoY in Q1...",
"name": "Q1-report.pdf",
"location": "https://...?sv=2024&se=...&sr=c&sp=r&sig=..."
}]
}
Semantic search + AI query rewriting — 52ms

30+

File types

4

Search modes

Global scale

Ready to turn Cosmos DB into your RAG engine?

Drop your files, RAG DB handles the rest — extraction, chunking, embeddings, sync, and search. Serverless, globally scalable, built on Cosmos DB.