Coding agents and autonomous workflows consume 10–100× the tokens of a chat. Pick your scenario and see the bill.
Vendor choice matters here - the spread between cheapest and priciest is 706%. Routing the right tasks to the right model can cut your bill significantly.
| Vendor / Model | Cached input | Realtime input | Output | Monthly |
|---|---|---|---|---|
|
DeepSeek DeepSeek V3.2 (reasoner) |
$1 | $13 | $17 | $31 |
|
OpenAI GPT-5.4 Nano |
$1 | $9 | $50 | $60 |
|
Google Gemini 3.1 Flash-Lite |
$1 | $12 | $59 | $72 |
|
Anthropic Claude Haiku 4.5 |
$5 | $46 | $198 | $249 |