Estimate GPT-4o, GPT-4o mini, o1, and o3 costs instantly. Enter your token volume and request count — see exact monthly and annual bills. Free, no signup, runs in your browser.
Select model · Enter tokens and volume · Results update live
Pay-as-you-go rates in USD per 1 million tokens, June 2026.
| Model | Input / 1M tokens | Output / 1M tokens | Context | Use case |
|---|---|---|---|---|
| GPT-4o miniCHEAPEST | $0.15 | $0.60 | 128K | High-volume tasks |
| GPT-4o | $2.50 | $10.00 | 128K | Flagship general use |
| o1-mini | $3.00 | $12.00 | 128K | STEM & coding |
| o1 | $15.00 | $60.00 | 200K | Hard reasoning tasks |
| o3-mini | $1.10 | $4.40 | 200K | Cost-efficient reasoning |
All models support Batch API — async processing (up to 24h) at half price. Zero quality difference. Perfect for document processing, content generation, and any non-real-time workload.
16× cheaper than GPT-4o. Best for classification, extraction, summarization, simple chat, intent detection.
Complex reasoning, code generation, multimodal, nuanced analysis. Use when GPT-4o mini falls short.
Best value reasoning model. STEM problems, coding challenges, math — cheaper than o1 with similar quality.
Maximum capability reasoning. Use sparingly for truly hard tasks where other models consistently fail.
GPT-4o mini: $0.15/$0.60 per million tokens (cheapest). GPT-4o: $2.50/$10.00. o1: $15.00/$60.00. o3-mini: $1.10/$4.40. All are pay-as-you-go with no monthly minimum (June 2026 pricing).
Yes — "ChatGPT API" refers to accessing GPT models via OpenAI's chat completions API. The same models powering ChatGPT Plus are available via API with per-token billing. Your account, billing, and rate limits are managed at platform.openai.com.
No free tier for production API use. New accounts may receive a small one-time credit. ChatGPT.com has a free web interface, but that's separate from API access. You need a paid API account with billing configured.
(1) Route simple tasks to GPT-4o mini — 16× cheaper. (2) Use Batch API for non-real-time work — 50% off. (3) Trim your system prompt — every saved token is billed every request. (4) Set max_tokens explicitly. (5) Cache repeated identical responses. Full 7-strategy guide →
GPT-4o ($2.50/$10.00) is slightly more expensive than Gemini 1.5 Pro ($1.25/$5.00) but cheaper than Claude 3.5 Sonnet ($3.00/$15.00) on input. For high-volume tasks, Gemini 1.5 Flash ($0.075/$0.30) is dramatically cheaper than all. Full comparison →