Updated May 2026 · Prices verified from official docs

DeepSeek Pricing
Made Simple

Web chat is 100% free. API starts at $0.14/1M tokens — up to 100× cheaper than GPT-5.5. 5 million free tokens for new developers. No monthly subscription required.

Start for Free → Calculate My Cost
🎁
Web Chat$0 / forever
💨
V4-Flash Input$0.14 / 1M
V4-Pro (promo)$0.435 / 1M
🧠
R1 Reasoning$0.55 / 1M
V4-Flash Input $0.14 / 1M (cache miss)
V4-Flash Cache Hit $0.014 / 1M (90% off)
V4-Flash Output $0.28 / 1M
V4-Pro Input $1.74 / 1M regular
V4-Pro Promo 75% OFF until May 31, 2026
V3.2 Input $0.28 / 1M
R1 Input $0.55 / 1M
Web Chat FREE — no subscription
New API Accounts 5M free tokens
Legacy Aliases Retire July 24, 2026
V4-Flash Input $0.14 / 1M (cache miss)
V4-Flash Cache Hit $0.014 / 1M (90% off)
V4-Flash Output $0.28 / 1M
V4-Pro Input $1.74 / 1M regular
V4-Pro Promo 75% OFF until May 31, 2026
V3.2 Input $0.28 / 1M
R1 Input $0.55 / 1M
Web Chat FREE — no subscription
New API Accounts 5M free tokens
Legacy Aliases Retire July 24, 2026
DeepSeek Pricing Free

What's Completely Free

No credit card. No trial. No limit on chat usage. DeepSeek's web and mobile products are permanently free for individual users.

💬
Web Chat

Full access to DeepSeek V4-Flash (Instant Mode) and V4-Pro (Expert Mode) at chat.deepseek.com. No usage limits, no rate restrictions for normal use.

Unlimited · Free Forever
📱
Mobile Apps

Free iOS and Android apps with voice input, file uploads, image understanding, conversation history sync, and DeepThink reasoning mode. No in-app purchases needed.

iOS & Android · Free
🎁
API Free Credits

New API accounts at platform.deepseek.com receive 5 million tokens in free credits. No credit card required. Valid for 30 days — enough for ~2,500–5,000 test calls.

5M Tokens · No Card
🤗
Open Weights (Self-Host)

All DeepSeek models are MIT-licensed open source. Download from Hugging Face and run locally on your own GPU hardware with zero API fees. V4-Flash weights = 160GB.

MIT License · Free Forever
ℹ️ Note: There is no paid "Plus" or "Pro" subscription tier for chat users. DeepSeek's chat product is entirely free. The only paid service is the API (pay-per-token for developers building applications).
DeepSeek API Pricing

Every Model, Every Price

Pay only for tokens you use. No monthly fees, no seat licenses. Prices per million (1M) tokens in USD.

🔥 Limited-time promo: V4-Pro is 75% off until May 31, 2026, 15:59 UTC. Regular price $1.74/1M input → promo price $0.435/1M. Cache-hit prices reduced 10× effective April 26, 2026.
🔥 75% OFF until May 31
FLAGSHIP
V4-Pro
model: "deepseek-v4-pro"

Best open-weight model available. 1.6T total / 49B active. 80.6% SWE-bench, Codeforces #1 (3,206). Use for complex reasoning, agentic coding, and frontier-quality tasks.

Input — cache miss$0.435 / 1M (promo)
Input — regular price$1.74 / 1M
Input — cache hit$0.044 / 1M
Output tokens$0.87 / 1M (promo)
Context window1M tokens
Max output384K tokens
Use V4-Pro Now →
🔵
STABLE
V3.2 (Chat)
model: "deepseek-v3.2"

Previous flagship. 671B total / 37B active. 128K context. Excellent for general chat, RAG, summarization, and text generation where V4's extra capability isn't needed.

Input — cache miss$0.28 / 1M
Input — cache hit$0.028 / 1M
Output tokens$0.42 / 1M
Context window128K tokens
Max output (chat)8K tokens
Max output (reasoner)64K tokens
Use V3.2
🧠
REASONING
DeepSeek-R1
model: "deepseek-r1"

Dedicated chain-of-thought reasoning. 97.3% MATH-500. RL-trained without SFT. Best for math olympiad, scientific reasoning, and multi-step logical inference tasks.

Input — cache miss$0.55 / 1M
Input — cache hit$0.14 / 1M
Output tokens$2.19 / 1M
Context window128K tokens
Chain-of-thoughtYes (explicit)
vs OpenAI o196% cheaper
Use R1
🎁
FREE
Web Chat
chat.deepseek.com

100% free. No account required. Includes DeepThink reasoning mode, file uploads, image analysis, and web search. Both V4 models available instantly.

Price$0 / month
V4-Flash access✓ Instant Mode
V4-Pro access✓ Expert Mode
DeepThink (R1)✓ Included
File & image uploads✓ Included
Web search✓ Included
Open Chat Free →
⚠️
RETIRING
Legacy Aliases
retiring: July 24, 2026

deepseek-chat and deepseek-reasoner now route to V4-Flash. They will stop working July 24, 2026 at 15:59 UTC. Migration is changing only the model name — no other code changes needed.

deepseek-chat →V4-Flash (non-think)
deepseek-reasoner →V4-Flash (think)
Retire deadlineJul 24, 2026
Migrate todeepseek-v4-flash
Base URL change?No change needed
Migration Guide ↗

✓ Prices verified April 28, 2026 · Always confirm at api-docs.deepseek.com/quick_start/pricing before production use.

DeepSeek Pricing Per Month

Real Monthly Cost Estimates

No subscription fee — you pay only for tokens consumed. Here's what typical workloads actually cost per month with 70% cache hit rate.

Personal / Hobbyist
Light Usage
~10K calls/mo · 500 input + 200 output tokens/call
V4-Flash~$0.30
V4-Pro~$1.30
V3.2~$0.60
ChatGPT API~$30+

Web chat is free — API only needed for automated apps or integrations.

Small SaaS / Startup
Medium Usage
~100K calls/mo · 800 input + 400 output/call
V4-Flash~$4–8
V4-Pro~$15–30
V3.2~$8–15
ChatGPT API~$200–400

V4-Flash at this scale saves thousands compared to GPT-5.5 or Claude Opus.

Production App
Heavy Usage
~1M calls/mo · 1K input + 500 output/call
V4-Flash~$50–100
V4-Pro~$200–400
V3.2~$80–160
ChatGPT API~$3,000+

Maximize cache hits: consistent system prompts can cut input costs by 90%.

Enterprise / Agent
Very Heavy
~10M calls/mo + long context
V4-Flash~$200–800
V4-Pro~$1K–4K
Self-host~$0 API fees
ChatGPT API~$30K–100K

At enterprise scale, self-hosting V4-Flash (MIT license) can eliminate API costs entirely.

DeepSeek Pricing Calculator

Calculate Your Exact Cost

Enter your usage parameters and instantly see your estimated monthly, daily, and per-call costs with context caching factored in.

💰 Cost Estimator
$0
Monthly Cost
$0
Daily Cost
$0
Per Call Avg
Cheaper Than GPT
Enter values above to see your cost breakdown.
DeepSeek Pricing vs ChatGPT

How DeepSeek Compares

Side-by-side API pricing and subscription comparison against every major AI provider as of May 2026.

100×
V4-Flash output ($0.28) vs GPT-5.5 output ($30.00)
Cheaper on output
36×
V4-Flash input ($0.14) vs GPT-5.5 input ($5.00)
Cheaper on input
$0
DeepSeek web chat vs ChatGPT Plus at $20/month
Free vs $20/mo

API Token Pricing (per 1M tokens)

ModelInput /1MCache Hit /1MOutput /1MContextOpen?
DeepSeek V4-Flash$0.14$0.014$0.281M✓ MIT
DeepSeek V4-Pro$1.74$0.174$3.481M✓ MIT
DeepSeek V3.2$0.28$0.028$0.42128K✓ MIT
DeepSeek R1$0.55$0.14$2.19128K✓ MIT
GPT-5.5$5.00manual$20.00128K✗ Closed
GPT-4o$2.50$1.25$10.00128K✗ Closed
Claude Opus 4.7$5.00$0.50$25.00200K✗ Closed
Gemini 3 Pro$1.25manual$5.001M✗ Closed
GPT-5.4$10.00manual$30.00128K✗ Closed

Chat Subscription Comparison (Individual)

ProductMonthly PriceModels IncludedFree TierAPI Access
DeepSeek Chat$0 / monthV4-Flash + V4-Pro✓ UnlimitedPay-per-token
ChatGPT Plus$20 / monthGPT-5.2 + DALL-E✓ LimitedExtra cost
Claude Pro$20 / monthOpus 4.5 + Sonnet✓ LimitedExtra cost
Gemini AI Pro$20 / monthGemini 3.1 Pro✓ LimitedExtra cost
ChatGPT Go$5 / monthGPT-5 basic✓ Very limitedExtra cost
DeepSeek-R1 API Pricing

R1 — The Reasoning Specialist

96% cheaper than OpenAI o1 for the same class of deep reasoning tasks. Use when accuracy on complex logic matters more than speed.

DeepSeek-R1

Pure RL-trained chain-of-thought model. No supervised fine-tuning. Emergent reasoning, self-verification, and backtracking — the most capable reasoning model you can run for under $1/1M tokens.

Input — cache miss$0.55 / 1M
Input — cache hit$0.14 / 1M
Output tokens$2.19 / 1M
Max CoT output64K tokens
Context window128K tokens
MATH-500 score97.3%
Reasoning typeExplicit CoT
vs OpenAI o1 pricing96% cheaper
R1 vs OpenAI o1 — Cost Comparison
Input (per 1M) — DeepSeek R1$0.55
Input (per 1M) — OpenAI o1$15.00
Output (per 1M) — DeepSeek R1$2.19
Output (per 1M) — OpenAI o1$60.00

When to use R1:

Math olympiad / competition problems
Complex multi-step logical reasoning
Scientific analysis requiring chains of inference
Simple chat or fast-response apps (use Flash)
Latency-sensitive pipelines (R1 is slower)
DeepSeek Coding Plan

Best Plans for Developers

No dedicated "coding plan" exists — but here's how each option maps to common developer use cases and budgets.

🎁
100% Free
Personal Dev
Free Chat + Weights
$0
forever · no signup needed
chat.deepseek.com — V4-Flash + V4-Pro
DeepThink reasoning (R1-powered)
Code generation + review + debugging
File upload + image analysis
Open source weights (Ollama, LM Studio)
No programmatic API access
🧪
Side Projects
API Starter
$5
minimum top-up · ~35M Flash tokens
5M free tokens on new account
All models: V4-Flash, V4-Pro, R1
OpenAI-compatible (drop-in replacement)
Function calling + JSON mode
Streaming SSE responses
Automatic context caching (90% off)
🚀
Professional / SaaS
V4-Flash Production
$0.14
per 1M input tokens · no monthly fee
80.6% SWE-bench (coding benchmark)
93.5% LiveCodeBench
1M token context (full codebase in one prompt)
Repository-level code understanding
Agentic coding (Claude Code compatible)
LangChain / LlamaIndex compatible
🏢
Enterprise / Self-host
Open Weights
$0
API fees · MIT license · your GPU costs only
Full model weights on Hugging Face
V4-Flash: 160GB download
V4-Pro: 865GB download
Commercial use allowed (MIT)
Full data privacy — on-premise
Requires multi-GPU infrastructure
DeepSeek Pricing Reddit

What Developers Are Saying

Community insights from r/deepseek, r/LocalLLaMA, and r/MachineLearning on DeepSeek's pricing and value.

ML
u/mleng_seattle
r/deepseek · 847 upvotes
▲ 847

"Switched our entire RAG pipeline from GPT-4o to V4-Flash last week. Monthly bill dropped from $1,200 to $38. Same quality for our document summarization use case. The 90% cache discount makes the math absolutely bonkers."

RAGcost-savingsV4-Flash
DR
u/devrel_advocate
r/LocalLLaMA · 621 upvotes
▲ 621

"People keep asking if the free tier is legit. It is. chat.deepseek.com gives you V4-Pro (Expert Mode) for free, no rate limits that I've noticed for normal use. There is no 'DeepSeek Plus' — it's just free. For API you need to pay but it starts at $0.14/1M."

free-tierverified
QF
u/quant_finance_dev
r/MachineLearning · 512 upvotes
▲ 512

"R1 for math is insane. 97.3% on MATH-500 at $2.19/1M output vs o1 at $60/1M. That's not a rounding difference — it's 27× cheaper for equivalent reasoning quality on quantitative problems. This changes my cost model for production math agents."

R1mathvs-o1
ST
u/startup_founder_23
r/deepseek · 389 upvotes
▲ 389

"Caveat: V4-Flash is great but be careful with sensitive customer data. Chinese company, servers in China, no HIPAA/SOC2 guarantees on the direct API. We run it through AWS Bedrock for compliance. Adds ~30% cost but worth it for our healthcare SaaS."

privacycomplianceAWS-Bedrock
VS
u/vibe_coder_2026
r/LocalLLaMA · 744 upvotes
▲ 744

"If you're using Cursor, Claude Code, or Windsurf — just point it at DeepSeek V4-Flash. The agentic coding is legitimately on par with V4-Pro for most repo tasks and you're paying $0.14/$0.28 instead of $5/$20. Change two config lines."

cursorcoding-agentssetup-guide
PH
u/ph_dev_indie
r/deepseek · 467 upvotes
▲ 467

"Got the 75% promo on V4-Pro before it expires May 31. At $0.435/1M input it's stupid cheap for frontier reasoning. Already set a calendar reminder to check if the discount extends. DeepSeek has been progressively reducing prices — I wouldn't be shocked if V4-Flash goes even lower."

V4-Propromomay-deadline
FAQ

Pricing Questions Answered

Is DeepSeek really free? What do I get at $0?+

Yes — completely. The web chat at chat.deepseek.com and the official iOS/Android apps are 100% free with no usage limits for normal use. You get access to both V4-Flash (Instant Mode) and V4-Pro (Expert Mode), DeepThink reasoning, file uploads, web search, and image analysis — all at $0/month. There is no paid "Plus" or "Pro" chat subscription. The only paid service is the API for developers building applications.

What is the DeepSeek V4 pricing breakdown?+

DeepSeek V4 has two variants. V4-Flash: $0.14/1M input (cache miss), $0.014/1M (cache hit), $0.28/1M output. V4-Pro: $1.74/1M input, $0.174/1M cache hit, $3.48/1M output at regular price. V4-Pro is currently 75% off until May 31, 2026 — promo prices are $0.435/1M input and $0.87/1M output. Both models have 1M token context and 384K max output. Prices verified April 28, 2026 at api-docs.deepseek.com.

What does DeepSeek API cost per month?+

There is no monthly subscription — you pay only for tokens consumed. Rough estimates with 70% cache hit rate: Light use (10K calls/mo): ~$0.30 on V4-Flash. Medium use (100K calls/mo): ~$4–8 on V4-Flash. Heavy use (1M calls/mo): ~$50–100 on V4-Flash. Compare to ChatGPT API (GPT-5.5) which would cost ~$30, ~$200–400, and ~$3,000+ respectively for the same usage. Use the calculator on this page for your specific token volumes.

How does DeepSeek R1 API pricing compare to OpenAI o1?+

DeepSeek R1 costs $0.55/1M input and $2.19/1M output. OpenAI o1 costs $15/1M input and $60/1M output. That's a 27× difference on input and a 27× difference on output — roughly 96% cheaper for equivalent reasoning tasks. R1 generates more output tokens per query (chain-of-thought), so your effective per-query cost is higher than Flash/V3, but still dramatically cheaper than any OpenAI reasoning model. Use R1 only when deep multi-step reasoning is genuinely needed.

How does context caching work and how much does it save?+

Automatic and free — no configuration needed. When a request starts with the same prefix as a recent request (system prompt, document, etc.), DeepSeek serves those tokens at 10% of the normal input price. For V4-Flash: $0.014/1M instead of $0.14/1M. At 70% cache hit rate, effective input cost drops by ~63%. At 90% cache hit rate (achievable with consistent system prompts), effective input cost drops by ~86%. Structure prompts with static content first (system prompt, documents) and variable content (user queries) last to maximize hit rate.

Is DeepSeek pricing cheaper than ChatGPT?+

Dramatically cheaper on API. V4-Flash ($0.14/$0.28) vs GPT-5.5 ($5/$20): 36× cheaper on input, 71× cheaper on output. Even V4-Pro at regular prices ($1.74/$3.48) is 2.9× cheaper on input and 5.7× cheaper on output than GPT-5.5. For chat (not API), DeepSeek is free while ChatGPT Plus costs $20/month. Reddit consistently confirms the savings are real in production — switch typically takes under 5 minutes since DeepSeek is OpenAI-API-compatible.

What is the DeepSeek coding plan for developers?+

There's no dedicated "coding plan" — all API plans give you access to DeepSeek's coding-capable models. For coding specifically: use deepseek-v4-flash for most tasks (82.6% HumanEval, 79% SWE-bench, $0.14/1M). Use deepseek-v4-pro for complex agentic coding (80.6% SWE-bench, Codeforces #1 at 3,206). V4 is integrated with Claude Code, OpenClaw, and OpenCode. It works with Cursor and Windsurf by pointing the base URL to DeepSeek's API. Self-hosting via Hugging Face (MIT license) eliminates all API costs for enterprise scale.

When do deepseek-chat and deepseek-reasoner retire?+

Both legacy aliases retire on July 24, 2026 at 15:59 UTC — after which they will return errors with no fallback. Currently: deepseek-chat routes to V4-Flash non-thinking mode; deepseek-reasoner routes to V4-Flash thinking mode. Migration is a one-line change: replace the model name with deepseek-v4-flash or deepseek-v4-pro. Base URL, authentication, and request format are all unchanged. No card required, no other code changes.

Get Started Today

Start Free.
Scale Cheap.

Web chat is free forever. API starts at $0.14/1M tokens with 5M free tokens on new accounts. No subscription. No commitment.

Open Free Chat → Get API Key Official Pricing ↗