AICost Clarity

Where your AI money actually goes.

AICost offers 42 free AI cost calculators →, 42 expert guides →, and TCO & ROI playbooks → for self-service. But when you need expert help quickly to gain visibility into your AI & cloud spend for your workload, choose one of the offers below.

You get a clear, honest diagnosis of your current AI spend. We tell you where the waste is. You decide what to do.

Three sizes — pick the one that fits

Pay via Stripe, then schedule your kickoff call.

AICost Clarity Personal

Indie devs · vibecoders · freelancers

$49 one-time

⏱ 24-hour async turnaround

Send us 1 invoice or screenshot
We recommend the cheapest equivalent for your use case
1-page email summary with specific savings dollars
Money-back if we don’t show ≥$50/yr in savings

Best for: Spending $50–$500/mo on AI subscriptions or API

Most popular

AICost Clarity SMB

5–50 employee businesses · $500–$10K/mo AI spend

$199 one-time

⏱ 3-day delivery

Send us 3–5 invoices (Anthropic, OpenAI, AWS Bedrock, Cursor, etc.)
Multi-vendor concentration risk assessment
Where your money goes (line-item analysis)
30-min Zoom call walking you through findings
2-page branded PDF you can share with your team
Email Q&A follow-up for 7 days

Best for: You see vendor invoices but don’t know if they’re reasonable

AICost Clarity Enterprise

$10K+/mo AI spend · regulated industries

$2,500 one-time

⏱ 7-day delivery

Full AI/cloud spend audit across all vendors
TCO model with your numbers
Vendor concentration risk + lock-in scoring
Industry-specific custom workloads (5+) built into our catalog
6-page branded PDF (your logo) with executive summary
60-min strategy call
Live access to your private workloads in our catalog

Best for: Board-level AI spend reviews · regulated industries · multi-vendor audits

Try free first

Most of what you need is in our free calculators and guides. Browse the ones we use most for each size — if they answer your question, no need to hire us.

Personal 6 free tools

⭐

AI Subscription Picker - ChatGPT vs Claude vs Gemini vs Cursor

Open calculator → 📖 Read guide

🆓

Free Tier Checker - What You Can Actually Get for $0/Month

Open calculator → 📖 Read guide

⚙️

Developer AI Stack - Cursor + Copilot + Claude + ChatGPT

Open calculator → 📖 Read guide

🎨

Creator AI Bundle - Midjourney + Suno + ElevenLabs + Writer

Open calculator → 📖 Read guide

👨‍👩‍👧

AI Family Plan - ChatGPT Team, Claude Team, and Family Bundles

Open calculator → 📖 Read guide

🧮

AI Cost Calculator - A First-Principles Guide to LLM Pricing

Open calculator → 📖 Read guide

SMB 6 free tools

🧮

AI Cost Calculator - A First-Principles Guide to LLM Pricing

Open calculator → 📖 Read guide

🤖

Agentic Workflow Cost - A Guide for Engineering Leaders

Open calculator → 📖 Read guide

🛡️

Vendor Concentration Risk - How Exposed Is Your AI Portfolio?

Open calculator → 📖 Read guide

🎯

AI ROI Quick Check - Will Your AI Investment Pay Back?

Open calculator → 📖 Read guide

⚠️

Overage Forecaster - When Will You Breach Your AI Budget?

Open calculator → 📖 Read guide

💼

AI Budget Planner - Allocate Spend Across Use Cases

Open calculator → 📖 Read guide

Enterprise 6 free tools

🏦

TCO Quick - 5-Question Wizard for AI Total Cost of Ownership

Open calculator → 📖 Read guide

📊

Scale Projection - What Happens to Your Bill at 10×, 100×?

Open calculator → 📖 Read guide

🛡️

Vendor Concentration Risk - How Exposed Is Your AI Portfolio?

Open calculator → 📖 Read guide

🧠

Agentic AI Stack - Full Cost from Tools to Memory

Open calculator → 📖 Read guide

🎓

Fine-Tuning Cost - Training + Inference Break-Even

Open calculator → 📖 Read guide

📈

AI Pricing History Explorer - Track Provider Price Changes Over Time

Open calculator → 📖 Read guide

Used these and still want help? Book free 15-min discovery →

More than just AI vendor optimization

On Optimize and Forecast tiers, we pair AI cost analysis with ToolsInfo's 115K+ tool catalog (operated by the same team). You don't just save AI cost — you save labor cost too.

Example: "Switch OpenAI → Claude (save $400/mo) and automate invoice reconciliation via Zapier + GoHighLevel (save $1,200/mo in labor). Net: $1,600/mo."

Ready for AICost Clarity?

Pick a size above, or talk to us first — free.

Book a free 15-min discovery →

📖 Data sources & methodology 161 text models · 9 embeddings · 24 vision · 41 audio · 8 vector DBs across 10 vendor pages · last verified 2026-06-05

Methodology

All prices are USD per 1 million tokens, current as of 2026-06-05.
Vendor-published values have no mark. Inferred/extrapolated values are marked with * and listed below.
Batch API discounts are 50% off standard rates across providers that offer Batch mode.
Prompt caching discounts vary by provider (typically 80-90% off cached input tokens).
Regional data-residency surcharges (Anthropic 1.1x, OpenAI 1.1x, Google regional tiers) are NOT included in base rates.
Long-context pricing tiers apply when input exceeds model threshold.
Embedding prices are input-only (no output tokens generated).

Primary sources

Last-verified date is the most recent successful daily snapshot (aicost_pricing_snapshots) or, when no snapshot exists yet, the latest successful crawler run (aicost_crawler_runs). 10 of 10 vendors are currently verified. Aggregator services (TokenCost, AI Pricing Guru, etc.) are not listed.

Anthropic

2026-06-05

https://www.anthropic.com/pricing

Daily snapshot since Sep 2023 · 578 days captured

Anthropic Docs

2026-06-05

https://platform.claude.com/docs/en/about-claude/pricing

Daily snapshot since Sep 2023 · 578 days captured

OpenAI

2026-06-05

https://openai.com/api/pricing/

Daily snapshot since Sep 2023 · 579 days captured

Google AI

2026-06-05

https://ai.google.dev/gemini-api/docs/pricing

Daily snapshot since Dec 2023 · 554 days captured

Google Vertex

2026-06-05

https://cloud.google.com/vertex-ai/generative-ai/pricing

Daily snapshot since Dec 2023 · 554 days captured

DeepSeek

2026-06-05

https://api-docs.deepseek.com/quick_start/pricing

Daily snapshot since May 2024 · 493 days captured

xAI

2026-06-05

https://x.ai/api

Daily snapshot since Nov 2024 · 411 days captured

Mistral

2026-06-05

https://mistral.ai/pricing

Daily snapshot since Dec 2023 · 552 days captured

Cohere

2026-06-05

https://cohere.com/pricing

Daily snapshot since Sep 2023 · 578 days captured

Voyage AI

2026-06-05

https://docs.voyageai.com/docs/pricing

Inferred values (marked with * in calculator tables)

Derived from industry conventions, not directly published by the vendor. Typical conventions: cached input = 10% of base (90% off), Batch API = 50% of base (50% off).

Vendor / Model	Field	Why it’s inferred
Anthropic — Claude Sonnet 4.6	`cachedInput`	Derived at 10% of input rate — Anthropic publishes 90% cache-hit discount on this tier.
Anthropic — Claude Sonnet 4.5	`cachedInput`	Derived at 10% of input rate; same 90% cache-hit convention as Sonnet 4.6.
Anthropic — Claude Sonnet 4.5	`batchInput`	Derived at 50% of standard input — Anthropic documents uniform 50% Batch discount.
Anthropic — Claude Sonnet 4.5	`batchOutput`	Derived at 50% of standard output — Anthropic documents uniform 50% Batch discount.
Anthropic — Claude Haiku 4.5	`cachedInput`	Derived at 10% of input rate — Anthropic 90% cache-hit discount convention.
OpenAI — GPT-5.4 Mini	`cachedInput`	Derived at 10% of input — OpenAI documents automatic 90% discount on cache hits across GPT-5.x tier.
OpenAI — GPT-5.4 Nano	`cachedInput`	Derived at 10% of input — OpenAI 90% cache-hit convention.
OpenAI — GPT-5.4 Nano	`batchInput`	Derived at 50% of input — OpenAI Batch API uniform 50% discount.
OpenAI — GPT-5.4 Nano	`batchOutput`	Derived at 50% of output — OpenAI Batch API uniform 50% discount.
OpenAI — GPT-5.4 Pro	`cachedInput`	Derived at 10% of input — OpenAI 90% cache-hit convention.
OpenAI — GPT-5.4 Pro	`batchInput`	Derived at 50% of input — OpenAI Batch API uniform 50% discount.
OpenAI — GPT-5.4 Pro	`batchOutput`	Derived at 50% of output — OpenAI Batch API uniform 50% discount.
OpenAI — GPT-5.2	`cachedInput`	Derived at 10% of input; no residency uplift.
OpenAI — GPT-5.2	`batchInput`	Derived at 50% of input.
OpenAI — GPT-5.2	`batchOutput`	Derived at 50% of output.
OpenAI — GPT-5	`cachedInput`	Derived at 10% of input.
OpenAI — GPT-5	`batchInput`	Derived at 50% of input.
OpenAI — GPT-5	`batchOutput`	Derived at 50% of output.
OpenAI — GPT-5.5 Pro	`cachedInput`	Derived at 10% of input — OpenAI does not publish a cached rate for *-pro models; using the family convention.
OpenAI — GPT-5.5 Pro	`batchInput`	Derived at 50% of input.
OpenAI — GPT-5.5 Pro	`batchOutput`	Derived at 50% of output.
OpenAI — GPT-5.2 Pro	`cachedInput`	Derived at 10% of input — pro-tier convention.
OpenAI — GPT-5.2 Pro	`batchInput`	Derived at 50% of input.
OpenAI — GPT-5.2 Pro	`batchOutput`	Derived at 50% of output.
OpenAI — GPT-5.1	`batchInput`	Derived at 50% of input.
OpenAI — GPT-5.1	`batchOutput`	Derived at 50% of output.
OpenAI — GPT-5 Pro	`batchInput`	Derived at 50% of input.
OpenAI — GPT-5 Pro	`batchOutput`	Derived at 50% of output.
OpenAI — GPT-5 Nano	`cachedInput`	Derived at 10% of input.
OpenAI — GPT-5 Nano	`batchInput`	Derived at 50% of input.
OpenAI — GPT-5 Nano	`batchOutput`	Derived at 50% of output.
Google — Gemini 3 Flash	`cachedInput`	Derived at 10% of input — Google caching discount convention ~90%.
Google — Gemini 3.1 Flash-Lite	`cachedInput`	Derived at 10% of input — Google caching convention.
Google — Gemini 3.1 Flash-Lite	`batchInput`	Derived at 50% of input — Google Batch API uniform 50% discount.
Google — Gemini 3.1 Flash-Lite	`batchOutput`	Derived at 50% of output — Google Batch API uniform 50% discount.
Google — Gemini 2.5 Pro	`cachedInput`	Derived at 10% of input.
Google — Gemini 2.5 Flash	`cachedInput`	Derived at 10% of input.
Google — Gemini 2.5 Flash-Lite	`cachedInput`	Derived at 10% of input — Google caching convention.
Google — Gemini 2.5 Flash-Lite	`batchInput`	Derived at 50% of input — Google Batch API uniform 50% discount.
Google — Gemini 2.5 Flash-Lite	`batchOutput`	Derived at 50% of output — Google Batch API uniform 50% discount.
Google — Gemini 2.0 Flash	`cachedInput`	Derived at 25% of input per Google 2.0 family caching rates.
Google — Gemini 2.0 Flash	`batchInput`	Derived at 50% of input — Google Batch API uniform 50% discount.
Google — Gemini 2.0 Flash	`batchOutput`	Derived at 50% of output — Google Batch API uniform 50% discount.
Google — Gemini 2.0 Flash-Lite	`cachedInput`	Derived at 10% of input — Google caching convention.
Google — Gemini 2.0 Flash-Lite	`batchInput`	Derived at 50% of input — Google Batch API uniform 50% discount.
Google — Gemini 2.0 Flash-Lite	`batchOutput`	Derived at 50% of output — Google Batch API uniform 50% discount.
xAI — Grok 4 (legacy)	`cachedInput`	Extrapolated at 25% of base.

Pricing is cross-verified against the LiteLLM community registry when available. Daily snapshots are kept in aicost_pricing_snapshots; every change is logged to aicost_price_changelog with old & new values for full audit trail. Read the full methodology →