GPU-Bridge is a unified AI API that routes across 6 independent GPU backends with automatic failover. 30 services, 99 models covering LLM, image, video, audio, vision, documents, and search. AI agents pay per request with x402 USDC on Base. Developers use API key + prepaid credits.

Three methods: (1) x402 USDC direct on Base for AI agents (no account needed); (2) Crypto top-up with USDC (0.5% fee); (3) Credit card via Stripe (from $10 with bonus credits). Volume discounts of 5-15% based on usage.

Can AI agents use this autonomously?

Yes. AI agents with USDC on Base can call any endpoint without registration using x402. Payment settles instantly on-chain. MCP tools are also available for model-native integration.

Automatic failover across 6 backends

One Endpoint. Six Backends.
Zero Provider Lock-In.

Call one API for LLM, image, video, audio, vision, OCR, embeddings, reranking, and docs. GPU-Bridge routes across 6 backends with automatic failover. AI agents pay per request with x402 on Base.

Start Free with API Key See Agent Flow

x402 ready MCP available No signup for agents

# GPU-Bridge handles routing and failover
curl -X POST https://api.gpubridge.io/run \
  -H "Authorization: Bearer gpub_live_..." \
  -H "X-Priority: fast" \
  -H "Idempotency-Key: req_7f3a" \
  -d '{"service":"llm-4090","input":{"prompt":"Summarize this"}}'

# No account. No API key. Pay per request with x402.
curl -X POST https://api.gpubridge.io/run \
  -H "X-Payment: <base64 tx proof>" \
  -H "Content-Type: application/json" \
  -d '{"service":"ocr-4090","input":{"image_url":"https://..."}}'

# MCP tools for model-native integration

gpu_run       // Execute any AI service
gpu_catalog   // Browse 30 services + 99 models
gpu_status    // Check job progress
gpu_balance   // Check credit balance
gpu_estimate  // Estimate cost before running

Models

Services

<1s

LLM Latency

Backends

Auto

Failover

Pre-validated

Before Execution

Why GPU-Bridge instead of going direct?

Direct providers are great for one backend. GPU-Bridge is for teams that want one integration, automatic failover, and agent-native payments.

	GPU-Bridge	Direct provider
One API across multiple backends	Yes	No
Automatic failover when a provider degrades	Yes	You build it
Hide provider choice from your users	Yes	No
x402 pay-per-request for AI agents	Yes	No
MCP tools for model-native integration	Yes	Varies
Upfront pricing + auto refund for credits	Yes	Varies

Your app or agent
↓
GPU-Bridge API — routing + failover + payments
↓

Backend A Backend B Backend C Backend D Backend E Backend F

Let your agent buy compute by itself

If your agent can send HTTP and hold USDC on Base, it can pay for inference directly with x402. No signup, no API key. Or connect through MCP.

x402 Direct Payment Permissionless

Send USDC on Base, include tx proof as X-Payment header. Instant on-chain settlement.

Pay per request with USDC
No registration required
OFAC-screened addresses
Same POST /run endpoint
~$0.001 gas per request

MCP Integration Native Tools

Connect GPU-Bridge as an MCP server. Your LLM gets native tools for AI compute.

gpu_run gpu_catalog gpu_status gpu_balance gpu_estimate

One contract for the whole AI stack

Same POST /run endpoint, same auth, same status flow — across every modality

Text

LLM + Embeddings

33 LLM models across 6 backends. DeepSeek, Qwen3, Llama 4, Kimi K2.5, GLM-5, MiniMax. Embeddings via BGE-M3, Qwen3-Embedding. Sub-second responses with smart routing.

Image & Video

Generation & Editing

FLUX 2, Imagen 4, Seedream, SD 3.5. Video from text or image. ControlNet, inpainting, outpainting, portraits, stickers, and product ads.

Audio

Speech & Music

Whisper STT in under 1 second. TTS with 40+ voices. Music generation. Voice cloning. Speaker diarization.

Vision

Understanding

Image captioning, visual Q&A, CLIP interrogation, OCR, segmentation (SAM 2), and NSFW detection.

Documents & Search

Parsing & Reranking

PDF/DOCX/PPTX to structured markdown. Jina reranking for RAG. Background removal, face restoration, video upscale to 4K.

Infrastructure

Smart Routing

Multi-backend failover with circuit breakers. X-Priority: fast|cheap header. Idempotency keys. Webhook delivery with 3x retry.

Predictable Pricing. Automatic Refunds.

Estimate cost before you send the request. No idle cost. No hidden fees.

LLM

from $0.003/req

Sub-second · 33 models

Image Gen

from $0.003/img

FLUX 2 · Imagen 4 · Seedream

Video

from $0.30/clip

Text or image to video

Everything Else

from $0.001/req

STT · TTS · OCR · Embed · 26 more

Credit-based requests are refunded automatically on failure. x402 requests are pre-validated before payment is consumed.

Volume discounts: 5% at $100, 10% at $500, 15% at $1,000+. Full catalog with all 30 services →

Two ways to start

Choose the path that matches how you build.

Developers API Key + Credits

For teams integrating GPU-Bridge into their apps.

Register with email. Get your API key instantly. No credit card.

Add credits from $10 via card or USDC. Bonus on larger packages.

POST /run with your service and input. Smart routing handles the rest.

AI Agents x402 USDC on Base

For autonomous agents that can hold USDC.

Hold USDC on Base in your agent's wallet.

Send request with X-Payment header containing tx proof.

Get results. Payment settles on-chain. No signup needed.

Live system status Dedicated hardware in Ashburn, VA Wyoming LLC · Stripe PCI DSS OFAC-screened addresses TLS 1.3 · Encrypted at rest

Questions teams ask before shipping

What happens if a backend is unavailable?

GPU-Bridge routes every job across multiple independent GPU backends. If the primary backend fails or times out, your request is automatically retried on the next available backend. Circuit breakers prevent routing to degraded backends. Credit-based accounts receive automatic refunds on failure. x402 requests are pre-validated before payment is consumed. Zero manual intervention needed.

How do payments work?

Three options: (1) x402 USDC direct on Base — for AI agents, no account needed, pay per request with on-chain tx proof; (2) Crypto top-up — register an account, add credits with USDC (0.5% fee); (3) Credit card via Stripe — register with email, buy credits from $10 with bonus on larger packages. Volume discounts of 5-15% apply automatically.

How fast are responses?

LLM inference completes in under 1 second. Image generation takes 2-15 seconds. Video generation takes 60-300 seconds. Speech-to-text is faster than real-time. Most utility services complete in 2-10 seconds.

Can AI agents use this without registration?

Yes. Any AI agent with USDC on Base can call any GPU-Bridge endpoint without an account. Include an x402 payment header and payment settles instantly on-chain. MCP tools are also available for model-native integration.

Are there spending limits?

Yes. Every account has a configurable daily spending limit (default $50/day). Adjust via POST /account/spending-limit. Range: $1 to $10,000 per day. Prevents runaway costs from automated workflows.

Ship once. Route everywhere.

Start free with an API key, or call GPU-Bridge today with x402 if you're building autonomous agents.

Start Free with API Key Read the Docs