omniroute
OmniRoute โ Unified AI proxy. Route any LLM through one endpoint.
50K+
Your universal API proxy โ one endpoint, 160+ providers, zero downtime. Now with MCP Server (29 tools), A2A Protocol, Memory/Skills Systems & Electron Desktop App.
Chat Completions โข Embeddings โข Image Generation โข Video โข Music โข Audio โข Reranking โข Web Search โข MCP Server โข A2A Protocol โข 100% TypeScript
๐ Websiteโ โข ๐ Quick Startโ โข ๐ก Featuresโ โข ๐ Docsโ โข ๐ฐ Pricingโ โข ๐ฌ WhatsAppโ
๐ Available in: ๐บ๐ธ Englishโ | ๐ง๐ท Portuguรชs (Brasil)โ | ๐ช๐ธ Espaรฑolโ | ๐ซ๐ท Franรงaisโ | ๐ฎ๐น Italianoโ | ๐ท๐บ ะ ัััะบะธะนโ | ๐จ๐ณ ไธญๆ (็ฎไฝ)โ | ๐ฉ๐ช Deutschโ | ๐ฎ๐ณ เคนเคฟเคจเฅเคฆเฅโ | ๐น๐ญ เนเธเธขโ | ๐บ๐ฆ ะฃะบัะฐัะฝััะบะฐโ | ๐ธ๐ฆ ุงูุนุฑุจูุฉโ | ๐ฏ๐ต ๆฅๆฌ่ชโ | ๐ป๐ณ Tiแบฟng Viแปtโ | ๐ง๐ฌ ะัะปะณะฐััะบะธโ | ๐ฉ๐ฐ Danskโ | ๐ซ๐ฎ Suomiโ | ๐ฎ๐ฑ ืขืืจืืชโ | ๐ญ๐บ Magyarโ | ๐ฎ๐ฉ Bahasa Indonesiaโ | ๐ฐ๐ท ํ๊ตญ์ดโ | ๐ฒ๐พ Bahasa Melayuโ | ๐ณ๐ฑ Nederlandsโ | ๐ณ๐ด Norskโ | ๐ต๐น Portuguรชs (Portugal)โ | ๐ท๐ด Romรขnฤโ | ๐ต๐ฑ Polskiโ | ๐ธ๐ฐ Slovenฤinaโ | ๐ธ๐ช Svenskaโ | ๐ต๐ญ Filipinoโ | ๐จ๐ฟ ฤeลกtinaโ
| Page | Screenshot |
|---|---|
| Providers | ![]() |
| Combos | ![]() |
| Analytics | ![]() |
| Health | ![]() |
| Translator | ![]() |
| Settings | ![]() |
| CLI Tools | ![]() |
| Usage Logs | ![]() |
| Endpoints | ![]() |
Connect any AI-powered IDE or CLI tool through OmniRoute โ free API gateway for unlimited coding.
|
OpenClaw โ โญ 205K |
NanoBot โ โญ 20.9K |
PicoClaw โ โญ 14.6K |
ZeroClaw โ โญ 9.9K |
IronClaw โ โญ 2.1K |
|
OpenCode โ โญ 106K |
Codex CLI โ โญ 60.8K |
Claude Code โ โญ 67.3K |
Gemini CLI โ โญ 94.7K |
Kilo Code โ โญ 15.5K |
๐ก All agents connect via http://localhost:20128/v1โ or http://cloud.omniroute.online/v1โ โ one config, unlimited models and quota
Stop wasting money and hitting limits:
OmniRoute solves this:
๐ฌ Join our community! WhatsApp Groupโ โ Get help, share tips, and stay updated.
good first issueWhen opening an issue, please run the system-info command and attach the generated file:
npm run system-info
This generates a system-info.txt with your Node.js version, OmniRoute version, OS details, installed CLI tools (qoder, gemini, claude, codex, antigravity, droid, etc.), Docker/PM2 status, and system packages โ everything we need to reproduce your issue quickly. Attach the file directly to your GitHub issue.
โโโโโโโโโโโโโโโ
โ Your CLI โ (Claude Code, Codex, Gemini CLI, OpenClaw, Cursor, Cline...)
โ Tool โ
โโโโโโโโฌโโโโโโโ
โ http://localhost:20128/v1
โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ OmniRoute (Smart Router) โ
โ โข Format translation (OpenAI โ Claude) โ
โ โข Quota tracking + Embeddings + Images โ
โ โข Auto token refresh โ
โโโโโโโโฌโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ
โโโ [Tier 1: SUBSCRIPTION] Claude Code, Codex, Gemini CLI
โ โ quota exhausted
โโโ [Tier 2: API KEY] DeepSeek, Groq, xAI, Mistral, NVIDIA NIM, etc.
โ โ budget limit
โโโ [Tier 3: CHEAP] GLM ($0.6/1M), MiniMax ($0.2/1M)
โ โ budget limit
โโโ [Tier 4: FREE] Qoder, Qwen, Kiro (unlimited)
Result: Never stop coding, minimal cost
Every developer using AI tools faces these problems daily. OmniRoute was built to solve them all โ from cost overruns to regional blocks, from broken OAuth flows to protocol operations and enterprise observability.
Developers pay $20โ200/month for Claude Pro, Codex Pro, or GitHub Copilot. Even paying, quota has a ceiling โ 5h of usage, weekly limits, or per-minute rate limits. Mid-coding session, the provider stops responding and the developer loses flow and productivity.
How OmniRoute solves it:
PROVIDER_LIMITS_SYNC_INTERVAL_MINUTES=70) with manual refresh available in the UIOpenAI uses one format, Claude (Anthropic) uses another, Gemini yet another. If a dev wants to test models from different providers or fallback between them, they need to reconfigure SDKs, change endpoints, deal with incompatible formats. Custom providers (FriendLI, NIM) have non-standard model endpoints.
How OmniRoute solves it:
http://localhost:20128/v1 serves as proxy for all 160+ providersx_groq, usage_breakdown, service_tier) that break OpenAI SDK v1.83+developer โ system for non-OpenAI providers; system โ user for GLM/ERNIE<think> blocks from models like DeepSeek R1 into standardized reasoning_contentjson_schema โ responseMimeType/responseSchema automatic conversionstream defaults to false โ Aligns with OpenAI spec, avoiding unexpected SSE in Python/Rust/Go SDKsProviders like OpenAI/Codex block access from certain geographic regions. Users get errors like unsupported_country_region_territory during OAuth and API connections. This is especially frustrating for developers from developing countries.
How OmniRoute solves it:
unsupported_country_region_territorywreq-js to bypass bot detectionNot everyone can pay $20โ200/month for AI subscriptions. Students, devs from emerging countries, hobbyists, and freelancers need access to quality models at zero cost.
How OmniRoute solves it:
api.ollama.com with free "Light usage" tier; use ollamacloud/<model> prefixgc/gemini-3-flash โ if/kimi-k2-thinking โ qw/qwen3-coder-plus = $0/month with zero downtimeWhen exposing an AI gateway to the network (LAN, VPS, Docker), anyone with the address can consume the developer's tokens/quota. Without protection, APIs are vulnerable to misuse, prompt injection, and abuse.
How OmniRoute solves it:
/dashboard/api-manager pageopenai/*, wildcard patterns), with Allow All/Restrict toggle/v1/models and block specific providers from the listingwithAuth middleware + CSRF tokensAI providers can become unstable, return 5xx errors, or hit temporary rate limits. If a dev depends on a single provider, they're interrupted. Without circuit breakers, repeated retries can crash the application.
How OmniRoute solves it:
Retry-After hints and exponential backoff429 rate limits stay in Connection CooldownDevelopers use Cursor, Claude Code, Codex CLI, OpenClaw, Gemini CLI, Kilo Code... Each tool needs a different config (API endpoint, key, model). Reconfiguring when switching providers or models is a waste of time.
How OmniRoute solves it:
chatLanguageModels.json for VS Code with bulk model selectionhttp://localhost:20128/v1 once, access 160+ providersClaude Code, Codex, Gemini CLI, Copilot โ all use OAuth 2.0 with expiring tokens. Developers need to re-authenticate constantly, deal with client_secret is missing, redirect_uri_mismatch, and failures on remote servers. OAuth on LAN/VPS is particularly problematic.
How OmniRoute solves it:
redirect_uri + manual URL mode for remote serverswindow.location.origin for reverse proxy compatibilityDevelopers use multiple paid providers but have no unified view of spending. Each provider has its own billing dashboard, but there's no consolidated view. Unexpected costs can pile up.
How OmniRoute solves it:
When a call fails, the dev doesn't know if it was a rate limit, expired token, wrong format, or provider error. Fragmented logs across different terminals. Without observability, debugging is trial-and-error.
How OmniRoute solves it:
Content type
Image
Digest
sha256:be909c8eeโฆ
Size
165.4 MB
Last updated
37 minutes ago
docker pull diegosouzapw/omniroute