Serverless on Azure · Powered by Cosmos DB

Turn Cosmos DB into
your RAG engine

Serverless RAG indexing on Cosmos DB. Scales linearly. Goes global.

Drop the limitations of Azure AI Search. RAG DB turns Cosmos DB into a full-featured vector database with 30+ file type ingestion, four search modes, and the global scale Cosmos was built for.

Read the Docs

Any File (30+ types)

PDFDOCXAudioImagesEmailCSV

RAG.DB Engine

Ingest

→

Chunk

→

Embed

Key Vault

Managed Identity

App Insights

Container Apps

Cosmos DB

Vector SearchFull-Text Search

The Problem

Cosmos DB is a great vector database.
But who builds the ingestion pipeline?

You chose Cosmos DB for its global scale and vector search. But getting your data in — that's the hard part.

Unstructured data

PDFs, DOCX, audio, video, images, emails — you need high-fidelity Markdown extraction that preserves tables, layouts, and structure from the originals.

Built-in sync

Files get added, edited, and deleted in Blob Storage. Without an event-driven pipeline, your index drifts out of sync silently.

Search modes needed

Vector, hybrid, full-text, and semantic search. You need all four for production RAG — plus query rewriting for better recall.

Chunking strategies

Naive splitting destroys context. Great extraction is wasted without intelligent chunking that respects headings, tables, and section boundaries.

Container limit

Cosmos DB caps at 500 containers per account. At scale, you need automatic provisioning across multiple accounts.

24/7

Monitoring & recovery

Stuck jobs, failed conversions, stale indexes. Without self-healing, your pipeline needs constant babysitting.

How It Works

From upload to search
in minutes

Any File

30+ types

Event Grid

Auto-detect

RAG.DB Engine

Chunk + Embed

Cosmos DB

Vector + Full-Text

Query API

4 search modes

Any File

30+ types

Event Grid

Auto-detect

RAG.DB Engine

Chunk + Embed

Cosmos DB

Vector + Full-Text

Query API

4 search modes

Key Vault

Managed Identity

App Insights

Container Apps

Service Bus

Container Registry

Key Capabilities

The full RAG pipeline, out of the box

4 Search Modes

Vector, hybrid, full-text, and semantic — all included in every index. Plus AI-powered query rewriting that expands queries for better recall.

Per-Index Isolation

Every index gets its own Container App, Service Bus topic, and Cosmos DB container. Fully isolated — zero noisy neighbors, independent scaling.

Event-Driven Ingestion

Fully serverless. Drop files into Blob Storage, Event Grid triggers processing, Container Apps scale to zero when idle. Pay only for what you use.

30+ File Types

PDF, DOCX, audio, images, email, archives — converted via the open-source markitdown-pro library with OCR and transcription.

Zero-Secret Security

Managed identity, Key Vault, dual API keys with rotation, user delegation SAS tokens. No credentials in code.

Multi-Cosmos Scale-Out

Cosmos DB gives you global distribution out of the box. RAG DB scales linearly across accounts — no search unit limits, no partition caps.

Self-Healing Monitoring

CronJob monitors every 5 minutes. Detects stale indexes, recovers stuck runs, and retries failed files automatically.

Observability Built-In

Application Insights telemetry, custom histograms for indexing and query performance, Cosmos DB RU tracking per query.

Optimistic Concurrency

ETag-based concurrency control throughout the pipeline. No race conditions, no data corruption, even at scale.

Open Source

Powered by markitdown-pro

Our open-source conversion library turns 30+ file types into clean Markdown. It cascades through multiple extraction methods until it gets the best possible content.

PyPI GitHub

Cascading Conversion Pipeline

Each file goes through up to 4 extraction methods. If one method can't extract enough content, the next one takes over automatically.

MarkItDown

Microsoft's base converter handles common formats natively

Unstructured.io

Advanced document parsing for complex layouts and tables

Azure Document Intelligence

OCR and structured extraction for scanned documents

GPT-4o-mini Vision

AI-powered OCR as a last resort for difficult content

30+ Supported File Types

From PDFs and Office docs to audio transcription, email extraction, and OCR on scanned images — one library handles it all.

Documents

PDF, DOCX, PPTX, XLSX, ODT, EPUB, RTF

Images + OCR

PNG, JPG, WEBP, HEIC, TIFF, SVG, GIF

Audio

WAV, OGG, FLAC, M4A, AAC, WMA, OPUS

EML, PST (with attachment extraction)

Data

CSV, JSON, XML, TSV

Code

Python, JS, TS, Java, C#, Go, Rust, ...

Web

HTML, Markdown, reStructuredText

One API to index and search

POST/indexes

{

"name": "contracts-q1",

"storageAccount": "acmestorage",

"blobContainer": "documents",

"blobFolder": "contracts/2026"

}

201 Created — Index provisioning started

POST/indexes/{indexId}/query/semantic

{

"query": "Q1 revenue growth projections",

"top_k": 5,

"results": [{

"score": 0.94,

"chunk": "Revenue grew 23% YoY in Q1...",

"name": "Q1-report.pdf",

"location": "https://...?sv=2024&se=...&sr=c&sp=r&sig=..."

}]

}

Semantic search + AI query rewriting — 52ms

30+

File types

Search modes

∞

Global scale

Ready to turn Cosmos DB into your RAG engine?

Drop your files, RAG DB handles the rest — extraction, chunking, embeddings, sync, and search. Serverless, globally scalable, built on Cosmos DB.

Soon on Azure Marketplace

Turn Cosmos DB intoyour RAG engine

Cosmos DB is a great vector database.But who builds the ingestion pipeline?

From upload to searchin minutes

Any File

Event Grid

RAG.DB Engine

Cosmos DB

Query API

Any File

Event Grid

RAG.DB Engine

Cosmos DB

Query API

The full RAG pipeline, out of the box

4 Search Modes

Per-Index Isolation

Event-Driven Ingestion

30+ File Types

Zero-Secret Security

Multi-Cosmos Scale-Out

Self-Healing Monitoring

Observability Built-In

Optimistic Concurrency

Powered by markitdown-pro

Cascading Conversion Pipeline

MarkItDown

Unstructured.io

Azure Document Intelligence

GPT-4o-mini Vision

30+ Supported File Types

One API to index and search

Ready to turn Cosmos DB into your RAG engine?

Turn Cosmos DB into
your RAG engine

Cosmos DB is a great vector database.
But who builds the ingestion pipeline?

From upload to search
in minutes