Question 1

How much does the Anthropic Claude API cost?

Accepted Answer

Anthropic Claude API pricing (June 2026): Claude 3.5 Sonnet costs $3.00/M input and $15.00/M output tokens. Claude 3.5 Haiku costs $0.80/M input and $4.00/M output. Claude 3 Opus costs $15.00/M input and $75.00/M output. Prompt caching reduces input cost to $0.30/M for Sonnet on cached tokens.

Question 2

What is Claude prompt caching and how much does it save?

Accepted Answer

Prompt caching stores repeated context (system prompts, documents, examples) server-side. Cache hits cost 10% of normal input price on Claude 3.5 Sonnet — $0.30/M instead of $3.00/M. For a 2,000-token system prompt on 100,000 requests/month, caching saves approximately $540/month.

Question 3

Is Claude 3.5 Sonnet cheaper than GPT-4o?

Accepted Answer

On input tokens, Claude 3.5 Sonnet ($3.00/M) is slightly more expensive than GPT-4o ($2.50/M). On output tokens, Sonnet ($15.00/M) is 50% more expensive than GPT-4o ($10.00/M). However, Claude's prompt caching can make Sonnet significantly cheaper for workloads with large repeated context.

Question 4

Which Claude model is cheapest?

Accepted Answer

Claude 3.5 Haiku is the cheapest capable Claude model at $0.80/M input and $4.00/M output tokens — ideal for high-volume classification, extraction, and simple tasks. For cost-critical workloads, Haiku is 3.75× cheaper than Sonnet on input and 3.75× cheaper on output.

Model	Input / 1M	Cached Input / 1M	Output / 1M	Context
Claude 3.5 HaikuCHEAPEST	$0.80	$0.08	$4.00	200K
Claude 3.5 Sonnet	$3.00	$0.30	$15.00	200K
Claude 3 Opus	$15.00	$1.50	$75.00	200K

Claude API Pricing Calculator

🧠 Claude API Cost Estimator

Claude's Killer Feature: Prompt Caching

⚡ Up to 90% off input tokens

2026 Claude API Pricing

Frequently Asked Questions

Compare All LLM API Costs