Agentic tasks per day (across team)

Tokens per task (avg)

Model tier

Catalog freshness:

✓ Curated · verified today ago

Estimated monthly cost

$66/mo

~$791/year · 1,100 tasks/mo

DeepSeek

$31/mo

DeepSeek V3.2 (reasoner)

OpenAI

$60/mo

GPT-5.4 Nano

Google

$72/mo

Gemini 3.1 Flash-Lite

Anthropic

$249/mo

Claude Haiku 4.5

Vendor choice matters here - the spread between cheapest and priciest is 706%. Routing the right tasks to the right model can cut your bill significantly.

Workflow scale

Tasks per day (across team)

Tokens per task (avg)

Reference: small refactor ~25K · Claude Code session ~100K · multi-file research ~300K · long autonomous run ~800K

Working days per month

Token split - input share (0.1–0.95)

0.7 = 70% input / 30% output (typical code-editing). Lower for output-heavy workflows.

Model & optimization

Model tier

Cache hit rate (0–1)

Fraction of input tokens served from cache (system prompts, repeated context). Vendors without published cache rates get 0 regardless.

Batch-eligible share (0–1)

Fraction of workload that can run async (50% discount). Most realtime agents = 0.

Runaway-loop safety buffer (1.0–3.0)

Multiplier for unexpected agent loops. 1.2 = 20% buffer (recommended).

Per-vendor breakdown

Vendor / Model	Cached input	Realtime input	Output	Monthly
DeepSeek DeepSeek V3.2 (reasoner)	$1	$13	$17	$31
OpenAI GPT-5.4 Nano	$1	$9	$50	$60
Google Gemini 3.1 Flash-Lite	$1	$12	$59	$72
Anthropic Claude Haiku 4.5	$5	$46	$198	$249

Want to model this as an architect? Try the Agent Loop Cost calculator for per-task breakdown, or Agentic AI Stack for planner+executor stacks. For full TCO, run the TCO Wizard. See methodology →

Estimate your monthly agentic-AI burn

Workflow scale

Model & optimization

Per-vendor breakdown