Last updated: June 2026 · Current pricing

LLM API Cost Calculator

Compare GPT-4.1, Claude 4, Gemini 2.5, and 10+ models. Enter your token volume and request count — see exact monthly costs. Free, no signup, runs in your browser.

Advertisement

🤖 LLM API Cost Calculator

Select model · Enter token volume · Results update live

Estimated cost
$0
— per month
Input / output per 1M tokens
Cost per request
Input cost
Output cost
Total tokens

2026 LLM API Pricing Table

All prices in USD per 1 million tokens, pay-as-you-go, June 2026. Source: official provider pricing pages.

ModelInput / 1MOutput / 1MContextBest for
Gemini 2.5 Flash-LiteCHEAPEST$0.10$0.401MUltra-high volume
GPT-4o mini$0.15$0.60128KHigh-volume, low-cost
Gemini 2.5 Flash$0.30$2.501MBalanced speed/cost
Claude Haiku 4.5$1.00$5.00200KQuality on a budget
GPT-4.1$2.00$8.001MGeneral flagship 2026
o3$2.00$20.00200KHard reasoning
GPT-4o$2.50$10.00128KMultimodal tasks
Claude Sonnet 4.6$3.00$15.001MCoding & reasoning
Gemini 2.5 Pro$1.25$10.001MLong-context tasks
Claude Opus 4.8$5.00$25.001MComplex agentic tasks
Advertisement
💡 50% off with Batch API

OpenAI, Anthropic, and Google all offer batch processing at 50% discount for async workloads (up to 24h latency). Zero quality difference.

Real-World Cost Examples

Monthly cost for common production workloads.

WorkloadVolumeGPT-4o miniGPT-4.1Claude Sonnet 4.6
Customer support bot10K req/day · 1K+300 tok$4.05$66$97.50
Document summarizer1K docs/day · 4K+800 tok$9.36$163$246
Code review assistant500 req/day · 3K+1K tok$9.45$156$247.50
RAG Q&A system5K req/day · 2K+500 tok$22.50$360$562.50

Ready to build? Start with free cloud credits:

Frequently Asked Questions

How much does GPT-4o cost per 1 million tokens?+

GPT-4o costs $2.50 per million input tokens and $10.00 per million output tokens (pay-as-you-go, June 2026). The newer GPT-4.1 is cheaper at $2.00/$8.00 per 1M and often a better choice.

What is the cheapest LLM API in 2026?+

Gemini 2.5 Flash-Lite at $0.10/$0.40 per million tokens is the cheapest capable model. GPT-4o mini ($0.15/$0.60) is the cheapest OpenAI option. Both are strong for high-volume workloads.

How much does Claude Sonnet 4.6 cost?+

Claude Sonnet 4.6 costs $3.00 per million input tokens and $15.00 per million output tokens via the Anthropic API. It offers 1M token context and excels at coding tasks.

How do I reduce LLM API costs?+

Use cheaper models (GPT-4o mini vs GPT-4o is 16× cheaper). Enable Batch API for 50% off async workloads. Cache repeated prompts with prompt caching (Anthropic: 90% off cached tokens). Trim system prompts. Set max_tokens explicitly.

What is a token in LLM pricing?+

A token is roughly 4 characters or 0.75 words. A 1,000-word document is approximately 1,333 tokens. Most LLM APIs charge separately for input (prompt) and output (completion) tokens.

Advertisement

Related Calculators