Documentation

Agent Gateway

Agentgateway is an opensource LLM, MCP, and A2A gateway hosted as part of the Linux Foundation. It focuses on implementing the missing pieces not found in traditional API gateways to support MCP and Agent workloads. The project emphasizes enterprise-grade security, observability, resiliency, reliability, and multi-tenancy features. Agentgateway is built to be the most performant, reliable, and mature LLM/MCP gateway on the market.

ℹ️

New to agentgateway? Get started in minutes with the Quickstart guide!

Figure: agentgateway works across compute environments to provide connectivity to various agentic tools, including MCP servers, agents, and OpenAPI endpoints.

Why Agentgateway?

Traditional API gateways and reverse proxies aren’t built for MCP and A2A, and adapting them isn’t straightforward. These systems are optimized for stateless REST-style interactions—one request in, pick a backend, one response out—with no session context or ongoing connection state.

MCP and A2A are fundamentally different:

Traditional API Gateway	Agentgateway
Stateless request/response	Stateful JSON-RPC sessions with long-lived connections
One request → one backend	Session fan-out across multiple MCP servers
Client-initiated only	Bidirectional: servers can push events (SSE) to clients
Simple routing by path/header	Protocol-aware routing that understands JSON-RPC message bodies
Static backend mapping	Dynamic tool virtualization on a per-client basis

Challenges traditional gateways can’t solve

Multiplexing & fan-out: A single client request like “list available tools” needs to fan out across multiple backend MCP servers, aggregate responses, and return a unified result.
Server-initiated events: MCP servers can push real-time updates via Server-Sent Events (SSE) that must be properly routed back through the client session.
Protocol negotiation: Graceful handling of protocol upgrades and fallbacks as MCP/A2A specs evolve.
Per-session authorization: Different clients may have access to different tools, requiring dynamic adjustment of what gets exposed.
Tool poisoning protection: Defense against direct tampering, shadowing, and rug-pull attacks on tools.

Built for performance

Agentgateway is built in Rust because performance and memory safety are non-negotiable for stateful, long-lived connections and fan-out patterns. Every millisecond and megabyte counts when managing concurrent sessions across multiple backend servers.

Features

Agentgateway is a unified data plane for managing agent connectivity, built in Rust for high performance and reliability. It supports agent protocols including MCP and A2A, and provides a unified interface for LLM consumption.

LLM Gateway

Route traffic to major LLM providers through a unified OpenAI-compatible API. Seamlessly switch between providers without changing your application code.

Provider	Chat Completions	Streaming
OpenAI / Azure OpenAI	✅	✅
Anthropic	✅	✅
Google Gemini	✅	✅
Google Vertex AI	✅	✅
Amazon Bedrock	✅	✅

OpenAI-compatible providers

Don’t see your provider? Many LLMs expose OpenAI-compatible APIs. Agentgateway can route to any provider that supports the OpenAI API format, including:

Cohere, Mistral, Groq, Together AI, Fireworks
Ollama, LM Studio, vLLM, llama.cpp (local models)
Any custom or self-hosted endpoint with OpenAI-compatible /v1/chat/completions

Self-hosted models & Inference routing

Running your own models on GPU infrastructure? Agentgateway implements the Kubernetes Inference Gateway extensions for intelligent routing to local LLM workloads. Route based on:

GPU & KV cache utilization — Send requests to the least-loaded model
Prompt criticality — Prioritize high-priority requests
LoRA adapters — Route to models with specific fine-tuned adapters
Work queue depth — Avoid overloaded inference servers

MCP Gateway

Connect LLMs to tools and external data sources using the Model Context Protocol (MCP). Agentgateway provides:

Tool federation: Aggregate multiple MCP servers behind a single endpoint
Protocol support: stdio, HTTP/SSE, and Streamable HTTP transports
OpenAPI integration: Expose existing REST APIs as MCP-native tools
Authentication & authorization: Built-in MCP auth spec compliance with OAuth providers (Auth0, Keycloak)

A2A Gateway

Enable secure communication between AI agents using the Agent-to-Agent (A2A) protocol. Agents can:

Discover each other’s capabilities
Negotiate interaction modalities (text, forms, media)
Collaborate on long-running tasks
Operate without exposing internal state or tools

Security & Observability

Authentication: JWT, API keys, basic auth, MCP auth spec
Authorization: Fine-grained RBAC with Cedar policy engine
Traffic policies: Rate limiting, CORS, TLS, external authz
Observability: Built-in OpenTelemetry metrics, logs, and distributed tracing

Platform Agnostic

Run agentgateway anywhere—bare metal, VMs, containers, or Kubernetes. Conformant to the Kubernetes Gateway API with support for HTTPRoute, GRPCRoute, TCPRoute, and TLSRoute.

Where Next?

Quickstart About Deploy on Kubernetes LLM consumption Inference routing MCP connectivity Agent connectivity Configuration Reference FAQs