Claude Haiku 4.5 vs Gemini 2.5 Pro— Pricing & Token Cost Comparison
Side-by-side API pricing and tokenizer details for Claude Haiku 4.5 (Anthropic) and Gemini 2.5 Pro (Google).
Side-by-side pricing
| Feature | Claude Haiku 4.5 | Gemini 2.5 Pro |
|---|---|---|
| Provider | Anthropic | |
| Input (per 1M tokens) | $0.800 | $1.25 |
| Output (per 1M tokens) | $4.00 | $10.00 |
| Context caching | Yes — 90% off cached tokens | No |
| Batch API discount | Not available | Not available |
| Context window | 200K tokens | 1M tokens |
| Tokenizer | Anthropic tokenizer | Gemini tokenizer |
Real-world cost example
1,000 API requests per month, each with 500 input tokens and 200 output tokens (500K input + 200K output total).
Claude Haiku 4.5
$1.2000
Input: $0.4000 + Output: $0.8000
Gemini 2.5 Pro
$2.6250
Input: $0.6250 + Output: $2.0000
Claude Haiku 4.5 is 54% cheaper for this workload — saving $1.4250 per month at this volume.
Frequently asked questions
- Is Claude Haiku 4.5 cheaper than Gemini 2.5 Pro?
- Yes, Claude Haiku 4.5 is cheaper for the typical workload above. At $0.800/1M input and $4.00/1M output tokens, it costs $1.2000 versus $2.6250 for Gemini 2.5 Pro — a 54% difference. Costs scale linearly, so larger workloads amplify this gap.
- What is the context window of Claude Haiku 4.5 vs Gemini 2.5 Pro?
- Claude Haiku 4.5 supports a 200K token context window. Gemini 2.5 Pro supports a 1M token context window. A larger context window lets you include more text — documents, conversation history, or code — in a single API call.
- Do Claude Haiku 4.5 or Gemini 2.5 Pro support context caching or batch discounts?
- Claude Haiku 4.5 supports context caching (90% off repeated tokens). It does not offer a batch API discount. Gemini 2.5 Pro does not support context caching. It does not offer a batch API discount.
Calculate costs for your actual prompt
Paste your prompt into the calculator and get exact token counts using each model's real tokenizer — all in your browser.
Open calculator