bolt Free & Open Source AI Gateway

Never Stop Coding

AI Gateway for Multi-Provider LLMs

67+ Providers Smart Fallback Semantic Cache MCP Server Deploy Anywhere
terminal — omniroute setup
$ npm install -g omniroute
✓ OmniRoute installed globally
$ omniroute
🎉 Dashboard opens → Connect providers → Code!

🤖 Free AI Provider for your favorite coding agents

Connect any AI-powered IDE or CLI tool through OmniRoute — free API gateway for unlimited coding.

Claude Code
Claude Code
⭐ 67K+
Codex CLI
Codex CLI
⭐ 61K+
Gemini CLI
Gemini CLI
⭐ 95K+
Cline
Cline
⭐ 36K+
Cursor
Cursor
⭐ Editor
OpenCode
OpenCode
⭐ 10K+
Kilo Code
Kilo Code
⭐ 15K+
Roo Code
Roo Code
⭐ 26K+
Continue
Continue
⭐ 24K+
Factory Droid
Factory Droid
⭐ Tool

📡 All agents connect via localhost:20128/v1 or cloud.omniroute.online/v1one config, unlimited models and quota

v3.0.0

By the Numbers

67+
AI Providers
16
MCP Tools
9
Routing Strategies
926
Tests Passing
30
Languages
3
Protocols (MCP/A2A/ACP)

What's New in v3.0.0

The biggest release ever — 31 new providers, MCP Server, A2A Protocol, and much more.

vpn_key

Registered Keys API v3

Auto-provision API keys programmatically with per-provider and per-account quotas. SHA-256 hashed, idempotent issuance, budget limits, and optional GitHub issue reporting.

fork_right

Per-Model Combo Routing v3

Map model name patterns (glob) to specific combos. claude-sonnet* → code-combo, gpt-4o* → openai-combo. Dashboard UI with inline management.

auto_fix_high

Auto-Combo Engine v3

Self-healing routing with 6-factor scoring, 4 mode packs, bandit exploration, progressive cooldown, and probe-based provider re-admission. The AI manages your AI.

palette

Media Playground v3

Full media generation at /dashboard/media: Image Generation, Video, Music, Audio Transcription (2GB uploads), and Text-to-Speech with multiple voice providers.

travel_explore

Web Search Providers v3

5 search integrations: Perplexity, Serper, Brave Search, Exa, and Tavily. Ground AI responses with real-time web data and search analytics dashboard.

sync

Model Auto-Sync v3

Automatically refreshes model lists for connected providers every 24 hours. New models appear without manual updates. Configurable interval.

photo_library

Provider Icons v3

130+ provider SVG logos via @lobehub/icons. Applied across Dashboard, Providers, and Agents pages with SVG → PNG → generic fallback chain.

speed

Per-Key Rate Limits v3

Per-API-key request limits with sliding-window enforcement. Set max_requests_per_day and max_requests_per_minute per key. HTTP 429 on exceed.

How OmniRoute Works

Install once, connect providers, and code non-stop with automatic 4-tier fallback routing.

terminal
Claude Code
code
Codex CLI
smart_toy
Gemini CLI
edit_note
Cursor / Cline
apps
Any CLI Tool
hub OmniRoute
Tier 1 · Subscription
workspace_premium
Claude, Codex, Gemini
Tier 2 · API Key
vpn_key
OpenAI, Groq, DeepSeek
Tier 3 · Cheap
savings
GLM $0.6/1M, MiniMax
Tier 4 · Free
star
iFlow, Qwen, Kiro

Get Started in 60 Seconds

Install globally, connect your providers, and start coding with smart auto-fallback routing.

1

Install Globally

Run one command to install OmniRoute globally on your system.

$ npm install -g omniroute
2

Connect Providers

Open Dashboard and add your API keys or OAuth connections. Free providers available!

Dashboard → Providers → Connect
3

Point Your CLI

Configure Claude Code, Cursor, Cline, or any OpenAI-compatible tool.

http://localhost:20128/v1
🐳

Prefer Docker?

Run OmniRoute as a container with persistent data volume.

$ docker run -d -p 20128:20128 -v omniroute-data:/app/data diegosouzapw/omniroute:latest

Why Choose OmniRoute?

See how OmniRoute compares to alternatives.

Feature OmniRoute LiteLLM
Providers Supported 67+ 100+
Free Tier Routing
Dashboard UI
Semantic Cache
Circuit Breaker
9 Routing Strategies
LLM Evaluations
Translator Playground
CLI Tools Manager
Custom Combos
MCP Server (16 tools)
A2A Protocol (Agent-to-Agent)
Desktop App
Usage Analytics
Cost Management
Docker Deploy
Media Playground (Image/Video/Audio/TTS)
Registered Keys API
Auto-Combo Engine (Self-Healing)
Web Search Providers (5)
Per-Model Combo Routing
130+ Provider Icons (SVG)
Self-hosted & Free

67+ Providers Ready

Connect via OAuth, API Key, or use completely free providers.

Free (Unlimited or High Quota)
OAuth / Subscription
API Key
iFlow
iFlow AI
Qwen
Qwen Code
Kiro
Kiro AI
Gemini CLI
Gemini CLI
Claude Code
Claude Code
OpenAI
OpenAI
Anthropic
Anthropic
Google AI
Google AI
Antigravity
Antigravity
OpenClaw
OpenClaw
Groq
Groq
DeepSeek
DeepSeek
xAI
xAI (Grok)
Mistral
Mistral
Together AI
Together AI
Fireworks
Fireworks
Perplexity
Perplexity
Cerebras
Cerebras
Cohere
Cohere
OpenRouter
OpenRouter
GLM
GLM (ZhipuAI)
MiniMax
MiniMax
Moonshot
Moonshot
Nebius
Nebius
NVIDIA
NVIDIA
SiliconFlow
SiliconFlow
Sambanova
Sambanova
Novita
Novita AI
Chutes
Chutes AI
Kluster
Kluster AI
InfiniAI
InfiniAI
Targon
Targon
AI21
AI21 Labs
Lambda
Lambda
Lepton
Lepton AI
Deepgram
Deepgram
AssemblyAI
AssemblyAI
NanoBanana
NanoBanana
HuggingFace
HuggingFace
Vertex AI
Vertex AI
Alibaba
Alibaba DashScope
LongCat
LongCat AI
Pollinations
Pollinations
Cloudflare AI
Cloudflare AI
Scaleway
Scaleway
AI/ML API
AI/ML API
Puter AI
Puter AI
OpenCode Zen
OpenCode Zen
OpenCode Go
OpenCode Go
Kimi Coding
Kimi Coding
Alibaba Coding
Alibaba Coding
ElevenLabs
ElevenLabs
Cartesia
Cartesia
PlayHT
PlayHT
Ollama Cloud
Ollama Cloud

Powerful Features

Everything you need to route, monitor, and optimize your AI usage.

route Routing & Reliability
swap_vert

Smart 4-Tier Fallback

Subscription → API Key → Cheap → Free. Automatic switching when quota runs out, zero downtime.

account_tree

Intra-Family Model Fallback New

When a model is unavailable, automatically falls back to sibling models in the same family before returning an error.

balance

9 Routing Strategies + Auto-Combo v3

Round-robin, weighted, random, strict-random, fill-first, P2C, cost-optimized, priority, and auto-combo with 6-factor scoring and self-healing. Per-combo or global.

offline_bolt

Circuit Breaker

Auto-open and close per-provider with configurable cooldowns. Self-healing after failures.

shield

Anti-Thundering Herd

Mutex + automatic rate-limiting for API key providers. Prevents quota exhaustion spikes.

fingerprint

Request Idempotency

5-second dedup window for duplicate requests. Saves tokens and prevents double-sends.

psychology Intelligence
cached

Semantic Cache New

Two-tier cache (exact + semantic similarity) reduces cost and latency for repeated queries.

translate

Format Translation

Seamless OpenAI ↔ Claude ↔ Gemini format translation. Use any model with any client.

psychology_alt

Think Tag Parsing

Automatically parse and handle <think> tags from reasoning models like DeepSeek R1.

security

Prompt Injection Guard New

Built-in protection against prompt injection attacks on your AI endpoints.

route

Task-Aware Smart Routing New

Automatically selects the best model based on content type — coding, analysis, vision, summarization. 7 task types.

monitoring Monitoring & Analytics
timer

Real-Time Quota Tracking

Live token consumption, reset countdown, and cost estimation per provider.

analytics

Usage Analytics New

Full dashboard with tokens, costs, trends over time. Filter by provider, model, or period.

payments

Costs & Budget New

Track spending with editable per-model pricing. Set budget alerts and limits.

health_and_safety

Health Monitor New

Dashboard with healthcheck per provider, token validation, and auto-refresh status.

science

LLM Evaluations New

Golden set testing with 4 match strategies: exact, contains, regex, custom JS function.

code Developer Experience
chat

Translator Playground New

Built-in Chat Tester and Test Bench. Test any model in real-time from the dashboard.

build

CLI Tools Manager New

Configure Claude Code, Codex, OpenClaw, Kilo, Droid, and Cline directly from the dashboard.

tune

Custom Combos New

Create unlimited model combinations with 6 balancing strategies. Fine-tune routing per combo.

group

Multi-Account Support

Add multiple accounts per provider. Round-robin load balancing and automatic failover.

image

Media Playground v3

Full media generation: Image (NanoBanana, SD WebUI, ComfyUI), Video, Music, Audio Transcription (2GB, Deepgram, AssemblyAI), and Text-to-Speech (ElevenLabs, Cartesia, PlayHT).

cloud_sync

Cloud Sync

Sync config across devices via Cloudflare Workers. 300+ global edge locations.

key

API Key Access Controls New

Create scoped API keys with model restrictions, time-based access schedules, and enable/disable toggles.

folder_open

Connection Groups New

Organize provider connections by environment (dev/prod). Accordion view with smart auto-switch.

hub Protocol & Integration
extension

MCP Server (16 Tools) New

Model Context Protocol server with 16 agent-control tools. 3 transport modes: stdio, SSE, Streamable HTTP.

lan

A2A v0.3 Protocol New

Agent-to-Agent orchestration with JSON-RPC 2.0, task streaming, SSE heartbeat, and smart-routing skill.

desktop_windows

Desktop App New

Native Electron app for Windows, macOS, and Linux. System tray, auto-update, offline support, single-instance lock.

price_check

External Pricing Sync New

3-tier pricing resolution synced from LiteLLM. User overrides → synced → defaults. Opt-in via settings.

Beautiful Dashboard

Monitor everything in real-time. Manage providers, combos, analytics, and more.

OmniRoute Dashboard

Deploy Anywhere

Run locally, in a container, on a VM, or at the edge.

📦

npm

Install globally for local development

npm install -g omniroute
🐳

Docker

Container with persistent data volume

docker run -d diegosouzapw/omniroute
🖥️

VM / VPS

Deploy on Akamai, AWS, DigitalOcean

nginx → Docker → omniroute

Cloudflare Workers

Edge deployment with D1 database

wrangler deploy