OmniRoute — Free AI Gateway for Multi-Provider LLMs

v3.0.0

By the Numbers

67+

AI Providers

16

MCP Tools

9

Routing Strategies

926

Tests Passing

30

Languages

3

Protocols (MCP/A2A/ACP)

What's New in v3.0.0

The biggest release ever — 31 new providers, MCP Server, A2A Protocol, and much more.

vpn_key

Registered Keys API v3

Auto-provision API keys programmatically with per-provider and per-account quotas. SHA-256 hashed, idempotent issuance, budget limits, and optional GitHub issue reporting.

fork_right

Per-Model Combo Routing v3

Map model name patterns (glob) to specific combos. claude-sonnet* → code-combo, gpt-4o* → openai-combo. Dashboard UI with inline management.

auto_fix_high

Auto-Combo Engine v3

Self-healing routing with 6-factor scoring, 4 mode packs, bandit exploration, progressive cooldown, and probe-based provider re-admission. The AI manages your AI.

palette

Media Playground v3

Full media generation at /dashboard/media: Image Generation, Video, Music, Audio Transcription (2GB uploads), and Text-to-Speech with multiple voice providers.

travel_explore

Web Search Providers v3

5 search integrations: Perplexity, Serper, Brave Search, Exa, and Tavily. Ground AI responses with real-time web data and search analytics dashboard.

sync

Model Auto-Sync v3

Automatically refreshes model lists for connected providers every 24 hours. New models appear without manual updates. Configurable interval.

photo_library

Provider Icons v3

130+ provider SVG logos via @lobehub/icons. Applied across Dashboard, Providers, and Agents pages with SVG → PNG → generic fallback chain.

speed

Per-Key Rate Limits v3

Per-API-key request limits with sliding-window enforcement. Set max_requests_per_day and max_requests_per_minute per key. HTTP 429 on exceed.

How OmniRoute Works

Install once, connect providers, and code non-stop with automatic 4-tier fallback routing.

terminal

Claude Code

code

Codex CLI

smart_toy

Gemini CLI

edit_note

Cursor / Cline

apps

Any CLI Tool

hub OmniRoute

Tier 1 · Subscription

workspace_premium

Claude, Codex, Gemini

Tier 2 · API Key

vpn_key

OpenAI, Groq, DeepSeek

Tier 3 · Cheap

savings

GLM $0.6/1M, MiniMax

Tier 4 · Free

star

iFlow, Qwen, Kiro

Get Started in 60 Seconds

Install globally, connect your providers, and start coding with smart auto-fallback routing.

1

Install Globally

Run one command to install OmniRoute globally on your system.

$ npm install -g omniroute

2

Connect Providers

Open Dashboard and add your API keys or OAuth connections. Free providers available!

Dashboard → Providers → Connect

3

Point Your CLI

Configure Claude Code, Cursor, Cline, or any OpenAI-compatible tool.

http://localhost:20128/v1

🐳

Prefer Docker?

Run OmniRoute as a container with persistent data volume.

$ docker run -d -p 20128:20128 -v omniroute-data:/app/data diegosouzapw/omniroute:latest

Why Choose OmniRoute?

See how OmniRoute compares to alternatives.

Feature	OmniRoute	LiteLLM
Providers Supported	67+	100+
Free Tier Routing	✓	✗
Dashboard UI	✓	✗
Semantic Cache	✓	✗
Circuit Breaker	✓	✓
9 Routing Strategies	✓	✗
LLM Evaluations	✓	✗
Translator Playground	✓	✗
CLI Tools Manager	✓	✗
Custom Combos	✓	✗
MCP Server (16 tools)	✓	✗
A2A Protocol (Agent-to-Agent)	✓	✗
Desktop App	✓	✗
Usage Analytics	✓	✓
Cost Management	✓	✓
Docker Deploy	✓	✓
Media Playground (Image/Video/Audio/TTS)	✓	✗
Registered Keys API	✓	✗
Auto-Combo Engine (Self-Healing)	✓	✗
Web Search Providers (5)	✓	✗
Per-Model Combo Routing	✓	✗
130+ Provider Icons (SVG)	✓	✗
Self-hosted & Free	✓	✓

67+ Providers Ready

Connect via OAuth, API Key, or use completely free providers.

Free (Unlimited or High Quota)

OAuth / Subscription

API Key

iFlow AI

Qwen Code

Kiro AI

Gemini CLI

Claude Code

OpenAI

Anthropic

Google AI

Antigravity

OpenClaw

Groq

DeepSeek

xAI (Grok)

Mistral

Together AI

Fireworks

Perplexity

Cerebras

Cohere

OpenRouter

GLM (ZhipuAI)

MiniMax

Moonshot

Nebius

NVIDIA

SiliconFlow

Sambanova

Novita AI

Chutes AI

Kluster AI

InfiniAI

Targon

AI21 Labs

Lambda

Lepton AI

Deepgram

AssemblyAI

NanoBanana

HuggingFace

Vertex AI

Alibaba DashScope

LongCat AI

Pollinations

Cloudflare AI

Scaleway

AI/ML API

Puter AI

OpenCode Zen

OpenCode Go

Kimi Coding

Alibaba Coding

ElevenLabs

Cartesia

PlayHT

Ollama Cloud

Powerful Features

Everything you need to route, monitor, and optimize your AI usage.

route Routing & Reliability

swap_vert

Smart 4-Tier Fallback

Subscription → API Key → Cheap → Free. Automatic switching when quota runs out, zero downtime.

account_tree

Intra-Family Model Fallback New

When a model is unavailable, automatically falls back to sibling models in the same family before returning an error.

balance

9 Routing Strategies + Auto-Combo v3

Round-robin, weighted, random, strict-random, fill-first, P2C, cost-optimized, priority, and auto-combo with 6-factor scoring and self-healing. Per-combo or global.

offline_bolt

Circuit Breaker

Auto-open and close per-provider with configurable cooldowns. Self-healing after failures.

shield

Anti-Thundering Herd

Mutex + automatic rate-limiting for API key providers. Prevents quota exhaustion spikes.

fingerprint

Request Idempotency

5-second dedup window for duplicate requests. Saves tokens and prevents double-sends.

psychology Intelligence

cached

Semantic Cache New

Two-tier cache (exact + semantic similarity) reduces cost and latency for repeated queries.

translate

Format Translation

Seamless OpenAI ↔ Claude ↔ Gemini format translation. Use any model with any client.

psychology_alt

Think Tag Parsing

Automatically parse and handle <think> tags from reasoning models like DeepSeek R1.

security

Prompt Injection Guard New

Built-in protection against prompt injection attacks on your AI endpoints.

route

Task-Aware Smart Routing New

Automatically selects the best model based on content type — coding, analysis, vision, summarization. 7 task types.

monitoring Monitoring & Analytics

timer

Real-Time Quota Tracking

Live token consumption, reset countdown, and cost estimation per provider.

analytics

Usage Analytics New

Full dashboard with tokens, costs, trends over time. Filter by provider, model, or period.

payments

Costs & Budget New

Track spending with editable per-model pricing. Set budget alerts and limits.

health_and_safety

Health Monitor New

Dashboard with healthcheck per provider, token validation, and auto-refresh status.

science

LLM Evaluations New

Golden set testing with 4 match strategies: exact, contains, regex, custom JS function.

code Developer Experience

chat

Translator Playground New

Built-in Chat Tester and Test Bench. Test any model in real-time from the dashboard.

build

CLI Tools Manager New

Configure Claude Code, Codex, OpenClaw, Kilo, Droid, and Cline directly from the dashboard.

tune

Custom Combos New

Create unlimited model combinations with 6 balancing strategies. Fine-tune routing per combo.

group

Multi-Account Support

Add multiple accounts per provider. Round-robin load balancing and automatic failover.

image

Media Playground v3

Full media generation: Image (NanoBanana, SD WebUI, ComfyUI), Video, Music, Audio Transcription (2GB, Deepgram, AssemblyAI), and Text-to-Speech (ElevenLabs, Cartesia, PlayHT).

cloud_sync

Cloud Sync

Sync config across devices via Cloudflare Workers. 300+ global edge locations.

key

API Key Access Controls New

Create scoped API keys with model restrictions, time-based access schedules, and enable/disable toggles.

folder_open

Connection Groups New

Organize provider connections by environment (dev/prod). Accordion view with smart auto-switch.

hub Protocol & Integration

extension

MCP Server (16 Tools) New

Model Context Protocol server with 16 agent-control tools. 3 transport modes: stdio, SSE, Streamable HTTP.

lan

A2A v0.3 Protocol New

Agent-to-Agent orchestration with JSON-RPC 2.0, task streaming, SSE heartbeat, and smart-routing skill.

desktop_windows

Desktop App New

Native Electron app for Windows, macOS, and Linux. System tray, auto-update, offline support, single-instance lock.

price_check

External Pricing Sync New

3-tier pricing resolution synced from LiteLLM. User overrides → synced → defaults. Opt-in via settings.

Beautiful Dashboard

Monitor everything in real-time. Manage providers, combos, analytics, and more.

Deploy Anywhere

Run locally, in a container, on a VM, or at the edge.

📦

npm

Install globally for local development

npm install -g omniroute

🐳

Docker

Container with persistent data volume

docker run -d diegosouzapw/omniroute

🖥️

VM / VPS

Deploy on Akamai, AWS, DigitalOcean

nginx → Docker → omniroute

⚡

Cloudflare Workers

Edge deployment with D1 database

wrangler deploy

Never Stop Coding

🤖 Free AI Provider for your favorite coding agents

By the Numbers

What's New in v3.0.0

Registered Keys API v3

Per-Model Combo Routing v3

Auto-Combo Engine v3

Media Playground v3

Web Search Providers v3

Model Auto-Sync v3

Provider Icons v3

Per-Key Rate Limits v3

How OmniRoute Works

Get Started in 60 Seconds

Install Globally

Connect Providers

Point Your CLI

Prefer Docker?

Why Choose OmniRoute?

67+ Providers Ready

Powerful Features

Smart 4-Tier Fallback

Intra-Family Model Fallback New

9 Routing Strategies + Auto-Combo v3

Circuit Breaker

Anti-Thundering Herd

Request Idempotency

Semantic Cache New

Format Translation

Think Tag Parsing

Prompt Injection Guard New

Task-Aware Smart Routing New

Real-Time Quota Tracking

Usage Analytics New

Costs & Budget New

Health Monitor New

LLM Evaluations New

Translator Playground New

CLI Tools Manager New

Custom Combos New

Multi-Account Support

Media Playground v3

Cloud Sync

API Key Access Controls New

Connection Groups New

MCP Server (16 Tools) New

A2A v0.3 Protocol New

Desktop App New

External Pricing Sync New

Beautiful Dashboard

Deploy Anywhere

npm

Docker

VM / VPS

Cloudflare Workers