100% free · No signup required

Select your AI cost issue?

Pick one or more situations that match yours, or browse the full catalog below.

46 calculators · 2 playbooks · verified pricing across 158+ models · last updated 2026-06-05

Select your AI cost issues to filter the AI Cost Calculators you need

Selected: none

🗂️All 46 calculators, organized by what they solve.

Popular tools first, then grouped by problem, then alphabetical for power users.

I’m a:

🔥Popular this week

What other people are running through aicost.ai right now.

Popular
🧮

AI Cost Calculator

The foundational "what does this workload cost?" tool.

Try the calculator →
📖 Guide
Popular
🔤

Token Estimator

Paste text → instant cost across every model.

Try the calculator →
📖 Guide
Popular
🔍

AI Model Finder

Compare every model side-by-side. Filter by context, price, modality.

Try the calculator →
📖 Guide
Popular
✂️

Token Reduction Stack

Stack 6 techniques. See exact savings on YOUR workload.

Try the calculator →
📖 Guide
Popular
📈

ROI Quick Check

Hours saved × labor cost vs spend. Payback in 30 seconds.

Try the calculator →
📖 Guide
Popular
🤖

Agentic AI Stack

Planner + executor loop + verifier. Full agent cost on one screen.

Try the calculator →
📖 Guide

🚨My bill is spiking (7)

Audit a runaway bill, find the biggest line items, model what one bad day costs.

Popular
🔤

Token Estimator

Paste text → instant cost across every model.

Try the calculator →
📖 Guide
💰

Cheapest Model Finder

Lowest-cost model that meets your quality bar.

Try the calculator →
📖 Guide
Popular
✂️

Token Reduction Stack

Stack 6 techniques. See exact savings on YOUR workload.

Try the calculator →
📖 Guide
💹

Margin Calculator

Is your AI feature profitable? Gross margin + grade.

Try the calculator →
📖 Guide
📆

Quarterly Spend Forecaster

The agentic shift. Your CFO is about to be surprised.

Try the calculator →
📖 Guide
⚠️

Overage Forecaster

True monthly cost on hybrid (seat + overage) plans.

Try the calculator →
📖 Guide
Popular
📈

Pricing History Explorer

2.5+ years of vendor pricing. 62K+ historical price points.

Try the calculator →
📖 Guide

📉Cut what I'm spending (17)

Caching, batching, model routing, distillation — stack the savings on what you already run.

Popular
🔤

Token Estimator

Paste text → instant cost across every model.

Try the calculator →
📖 Guide
📏

Context Window Cost

At what doc size does your cost double?

Try the calculator →
📖 Guide
💰

Cheapest Model Finder

Lowest-cost model that meets your quality bar.

Try the calculator →
📖 Guide
Popular
✂️

Token Reduction Stack

Stack 6 techniques. See exact savings on YOUR workload.

Try the calculator →
📖 Guide
💾

Prompt Cache ROI

Will caching save money or cost money? Break-even hit-rate by vendor.

Try the calculator →
📖 Guide
🔀

Multi-Model Router

Route easy queries to cheap, hard ones to flagship. Tier savings.

Try the calculator →
📖 Guide

Batch vs Realtime

~50% off async. What fraction of your workload tolerates 24h SLA?

Try the calculator →
📖 Guide
⚖️

Buy vs Build

Outcome-priced vendor or build it yourself? Engineering cost included.

Try the calculator →
📖 Guide
🧠

RAG vs Fine-Tuning

Head-to-head cost across both approaches at YOUR query volume.

Try the calculator →
📖 Guide
🧩

Chunking Optimizer

Chunk size vs retrieval quality vs token cost. Find the sweet spot.

Try the calculator →
📖 Guide
🔎

Hybrid Search Cost

Vector + BM25. Twice the cost, often 30%+ better recall.

Try the calculator →
📖 Guide
🎓

Fine-Tuning Cost

Training + inference + break-even vs base. 4 providers.

Try the calculator →
📖 Guide
🖥️

Self-Host Breakeven

Llama / Qwen on rented GPUs vs API. Real ops costs included.

Try the calculator →
📖 Guide
🗓️

Annual vs Monthly

Lock-in cost vs commit discount. Quantified.

Try the calculator →
📖 Guide
🛠️

Dev Stack Calculator

Find overlap in your AI dev tools. Stop paying for redundancy.

Try the calculator →
📖 Guide
⚖️

API vs Pro+SDK Breakeven

Direct API or a Claude Pro/Max/Team plan? Models the June-15 Agent SDK carve-out.

Try the calculator →
📖 Guide
🏗️

Subscription Picker for Builders

Best AI subscription for your team by persona — founder, CTO, FP&A, consultant.

Try the calculator →
📖 Guide

🧮Plan a new workload (16)

Model the workload before you build it. Forecast, scale, and break-even analysis.

Popular
🧮

AI Cost Calculator

The foundational "what does this workload cost?" tool.

Try the calculator →
📖 Guide
📏

Context Window Cost

At what doc size does your cost double?

Try the calculator →
📖 Guide
Popular
🔍

AI Model Finder

Compare every model side-by-side. Filter by context, price, modality.

Try the calculator →
📖 Guide
⚖️

Buy vs Build

Outcome-priced vendor or build it yourself? Engineering cost included.

Try the calculator →
📖 Guide
Popular
📈

ROI Quick Check

Hours saved × labor cost vs spend. Payback in 30 seconds.

Try the calculator →
📖 Guide
💹

Margin Calculator

Is your AI feature profitable? Gross margin + grade.

Try the calculator →
📖 Guide
💼

Budget Planner

Plan AI spend across every use case. Where the budget goes.

Try the calculator →
📖 Guide
📅

Annual Cost Forecaster

Project monthly AI spend for 12 months with growth assumptions.

Try the calculator →
📖 Guide
📆

Quarterly Spend Forecaster

The agentic shift. Your CFO is about to be surprised.

Try the calculator →
📖 Guide
📊

Scale Projection

Show CFOs what scaling 10× actually costs. Tier-transition warnings.

Try the calculator →
📖 Guide
⚠️

Vendor Concentration Risk

What if your #1 AI vendor goes down? HHI score + diversification plan.

Try the calculator →
📖 Guide
🌍

Region Cost Map

US vs EU vs APAC pricing. Data-residency premiums up to 25%.

Try the calculator →
📖 Guide
⚖️

API vs Pro+SDK Breakeven

Direct API or a Claude Pro/Max/Team plan? Models the June-15 Agent SDK carve-out.

Try the calculator →
📖 Guide
🏗️

Subscription Picker for Builders

Best AI subscription for your team by persona — founder, CTO, FP&A, consultant.

Try the calculator →
📖 Guide

TCO Quick Estimate

Fast total-cost-of-ownership read in under a minute.

Try the calculator →
📖 Guide
📊

TCO Complete Wizard

Full AI total cost of ownership — workload × vertical × cloud, 6 pillars.

Try the calculator →
📖 Guide

🤖Cost an agent / RAG / voice stack (17)

Multi-step agents, retrieval pipelines, voice bots — full integrated stack costs.

🧠

RAG vs Fine-Tuning

Head-to-head cost across both approaches at YOUR query volume.

Try the calculator →
📖 Guide
🧩

Chunking Optimizer

Chunk size vs retrieval quality vs token cost. Find the sweet spot.

Try the calculator →
📖 Guide
🔎

Hybrid Search Cost

Vector + BM25. Twice the cost, often 30%+ better recall.

Try the calculator →
📖 Guide
🧬

Embedding Cost

RAG indexing + re-indexing + query cost across 9 embedding models.

Try the calculator →
📖 Guide
🗄️

Vector DB Cost

Pinecone vs Qdrant vs Weaviate vs pgvector. 8 options compared.

Try the calculator →
📖 Guide
📚

RAG Pipeline Cost

Full RAG stack on one screen — ingest, storage, query, generation.

Try the calculator →
📖 Guide
📰

Multimodal RAG Stack

Doc + vision + audio retrieval. Composed cost across all stages.

Try the calculator →
📖 Guide
🔁

Agent Loop Cost

Multi-turn agent cost with context growth + runaway warnings.

Try the calculator →
📖 Guide
Popular
🤖

Agentic AI Stack

Planner + executor loop + verifier. Full agent cost on one screen.

Try the calculator →
📖 Guide
Popular
⚙️

Agentic Workflow Cost

Claude Code · Cursor · Copilot. Monthly burn across 4 vendors.

Try the calculator →
📖 Guide
🎙️

Voice Agent Stack

STT + LLM + TTS with interrupt-waste modeling.

Try the calculator →
📖 Guide
🎧

Audio Cost

Speech-to-text + TTS pricing. Voice agent per-call breakdown.

Try the calculator →
📖 Guide
🖼️

Vision Cost

Image analysis pricing. Per-image, per-tile, with detail levels.

Try the calculator →
📖 Guide
🎓

Fine-Tuning Cost

Training + inference + break-even vs base. 4 providers.

Try the calculator →
📖 Guide
🖥️

Self-Host Breakeven

Llama / Qwen on rented GPUs vs API. Real ops costs included.

Try the calculator →
📖 Guide
Playbook
📖

Agentic AI Playbook

Step-by-step guided journey. Learn the cost levers in ~10 min.

Try the calculator →
📖 Guide
Playbook
📖

Multimodal RAG Playbook

Build a RAG system. Understand cost levers step by step.

Try the calculator →
📖 Guide

📊Track price trends (4)

2.5 years of vendor pricing history. Free-tier checker. Currency conversion.

Popular
🔍

AI Model Finder

Compare every model side-by-side. Filter by context, price, modality.

Try the calculator →
📖 Guide
💱

Currency Converter

AI spend in 12 currencies — EUR, GBP, INR, JPY, more.

Try the calculator →
📖 Guide
Popular
📈

Pricing History Explorer

2.5+ years of vendor pricing. 62K+ historical price points.

Try the calculator →
📖 Guide
🆓

Free Tier Checker

Do you actually need to pay? Free-tier limits across major vendors.

Try the calculator →
📖 Guide

👤Personal / consumer AI (6)

Subscription decisions for individuals, families, creators, and developers.

🆓

Free Tier Checker

Do you actually need to pay? Free-tier limits across major vendors.

Try the calculator →
📖 Guide
Popular

Subscription Picker

Which AI subscription should I pay for? 6 quick questions.

Try the calculator →
📖 Guide
🗓️

Annual vs Monthly

Lock-in cost vs commit discount. Quantified.

Try the calculator →
📖 Guide
🎬

Creator Bundle

YouTuber, podcaster, blogger? Optimal AI stack + redundancy report.

Try the calculator →
📖 Guide
👨‍👩‍👧

Family Plan Comparator

Cheapest AI plan for a household. Per-person cost ranked.

Try the calculator →
📖 Guide
🛠️

Dev Stack Calculator

Find overlap in your AI dev tools. Stop paying for redundancy.

Try the calculator →
📖 Guide

🎯Calculators for your selected issues

Showing every calculator that addresses the issues you selected. Pick more pills above to expand the list, or clear all to browse everything.

I’m a:
📚 All 46 calculators · alphabetical For power users & search engines
🚀 New · Public Beta For AI agents · MCP server

Ask AI cost questions inside Claude, ChatGPT, Cursor & Perplexity — natively.

aicost shipped a Model Context Protocol (MCP) server. Plug it into your favorite AI assistant and it can call our 48+ calculators mid-conversation. No more switching tabs to look up pricing — your AI just answers with verified numbers and cites the source.

✓ Working today
  • · Claude.ai Pro/Max/Enterprise
  • · ChatGPT Plus/Pro/Team (OAuth)
  • · Perplexity Pro/Max (OAuth)
  • · Cursor · Continue · Zed · Cody · Goose
🔮 Coming next
  • · Hybrid pricing (subscription vs API)
  • · TCO + ROI playbooks for enterprise
  • · Domain calcs (healthcare, finance, dev)
  • · Self-serve API keys at aicost.ai/account
📨
Want a beta invite?
Email [email protected] with the AI client you'd use (Claude / ChatGPT / Cursor / Perplexity / etc.) and we'll send setup instructions. Free during beta.

Open standard · Model Context Protocol · live at https://mcp.aicost.ai

Go deeper

Our playbooks on cutting this number.

📚
All Cost Topics
12 playbooks for cost problems
📋
Vendor Pricing Guides
15 vendors · weekly refresh
🎯
Book a Quickscan
$1.5K · 5-day cost review

The calculator's an estimate. Want the real number?

A 5-day Quickscan ($1,500) reviews your actual usage across every pillar — financial, reliability, governance, privacy, MLOps, observability — and returns a concrete savings plan.

Book a Quickscan →
📖 Data sources & methodology 161 text models · 9 embeddings · 24 vision · 41 audio · 8 vector DBs across 10 vendor pages · last verified 2026-06-05

Methodology

  • All prices are USD per 1 million tokens, current as of 2026-06-05.
  • Vendor-published values have no mark. Inferred/extrapolated values are marked with * and listed below.
  • Batch API discounts are 50% off standard rates across providers that offer Batch mode.
  • Prompt caching discounts vary by provider (typically 80-90% off cached input tokens).
  • Regional data-residency surcharges (Anthropic 1.1x, OpenAI 1.1x, Google regional tiers) are NOT included in base rates.
  • Long-context pricing tiers apply when input exceeds model threshold.
  • Embedding prices are input-only (no output tokens generated).

Primary sources

Last-verified date is the most recent successful daily snapshot (aicost_pricing_snapshots) or, when no snapshot exists yet, the latest successful crawler run (aicost_crawler_runs). 10 of 10 vendors are currently verified. Aggregator services (TokenCost, AI Pricing Guru, etc.) are not listed.

Anthropic
2026-06-05
https://www.anthropic.com/pricing
Daily snapshot since Sep 2023 · 578 days captured
Anthropic Docs
2026-06-05
https://platform.claude.com/docs/en/about-claude/pricing
Daily snapshot since Sep 2023 · 578 days captured
OpenAI
2026-06-05
https://openai.com/api/pricing/
Daily snapshot since Sep 2023 · 579 days captured
Google AI
2026-06-05
https://ai.google.dev/gemini-api/docs/pricing
Daily snapshot since Dec 2023 · 554 days captured
Google Vertex
2026-06-05
https://cloud.google.com/vertex-ai/generative-ai/pricing
Daily snapshot since Dec 2023 · 554 days captured
DeepSeek
2026-06-05
https://api-docs.deepseek.com/quick_start/pricing
Daily snapshot since May 2024 · 493 days captured
xAI
2026-06-05
https://x.ai/api
Daily snapshot since Nov 2024 · 411 days captured
Mistral
2026-06-05
https://mistral.ai/pricing
Daily snapshot since Dec 2023 · 552 days captured
Cohere
2026-06-05
https://cohere.com/pricing
Daily snapshot since Sep 2023 · 578 days captured

Inferred values (marked with * in calculator tables)

Derived from industry conventions, not directly published by the vendor. Typical conventions: cached input = 10% of base (90% off), Batch API = 50% of base (50% off).

Vendor / Model Field Why it’s inferred
Anthropic — Claude Sonnet 4.6 cachedInput Derived at 10% of input rate — Anthropic publishes 90% cache-hit discount on this tier.
Anthropic — Claude Sonnet 4.5 cachedInput Derived at 10% of input rate; same 90% cache-hit convention as Sonnet 4.6.
Anthropic — Claude Sonnet 4.5 batchInput Derived at 50% of standard input — Anthropic documents uniform 50% Batch discount.
Anthropic — Claude Sonnet 4.5 batchOutput Derived at 50% of standard output — Anthropic documents uniform 50% Batch discount.
Anthropic — Claude Haiku 4.5 cachedInput Derived at 10% of input rate — Anthropic 90% cache-hit discount convention.
OpenAI — GPT-5.4 Mini cachedInput Derived at 10% of input — OpenAI documents automatic 90% discount on cache hits across GPT-5.x tier.
OpenAI — GPT-5.4 Nano cachedInput Derived at 10% of input — OpenAI 90% cache-hit convention.
OpenAI — GPT-5.4 Nano batchInput Derived at 50% of input — OpenAI Batch API uniform 50% discount.
OpenAI — GPT-5.4 Nano batchOutput Derived at 50% of output — OpenAI Batch API uniform 50% discount.
OpenAI — GPT-5.4 Pro cachedInput Derived at 10% of input — OpenAI 90% cache-hit convention.
OpenAI — GPT-5.4 Pro batchInput Derived at 50% of input — OpenAI Batch API uniform 50% discount.
OpenAI — GPT-5.4 Pro batchOutput Derived at 50% of output — OpenAI Batch API uniform 50% discount.
OpenAI — GPT-5.2 cachedInput Derived at 10% of input; no residency uplift.
OpenAI — GPT-5.2 batchInput Derived at 50% of input.
OpenAI — GPT-5.2 batchOutput Derived at 50% of output.
OpenAI — GPT-5 cachedInput Derived at 10% of input.
OpenAI — GPT-5 batchInput Derived at 50% of input.
OpenAI — GPT-5 batchOutput Derived at 50% of output.
OpenAI — GPT-5.5 Pro cachedInput Derived at 10% of input — OpenAI does not publish a cached rate for *-pro models; using the family convention.
OpenAI — GPT-5.5 Pro batchInput Derived at 50% of input.
OpenAI — GPT-5.5 Pro batchOutput Derived at 50% of output.
OpenAI — GPT-5.2 Pro cachedInput Derived at 10% of input — pro-tier convention.
OpenAI — GPT-5.2 Pro batchInput Derived at 50% of input.
OpenAI — GPT-5.2 Pro batchOutput Derived at 50% of output.
OpenAI — GPT-5.1 batchInput Derived at 50% of input.
OpenAI — GPT-5.1 batchOutput Derived at 50% of output.
OpenAI — GPT-5 Pro batchInput Derived at 50% of input.
OpenAI — GPT-5 Pro batchOutput Derived at 50% of output.
OpenAI — GPT-5 Nano cachedInput Derived at 10% of input.
OpenAI — GPT-5 Nano batchInput Derived at 50% of input.
OpenAI — GPT-5 Nano batchOutput Derived at 50% of output.
Google — Gemini 3 Flash cachedInput Derived at 10% of input — Google caching discount convention ~90%.
Google — Gemini 3.1 Flash-Lite cachedInput Derived at 10% of input — Google caching convention.
Google — Gemini 3.1 Flash-Lite batchInput Derived at 50% of input — Google Batch API uniform 50% discount.
Google — Gemini 3.1 Flash-Lite batchOutput Derived at 50% of output — Google Batch API uniform 50% discount.
Google — Gemini 2.5 Pro cachedInput Derived at 10% of input.
Google — Gemini 2.5 Flash cachedInput Derived at 10% of input.
Google — Gemini 2.5 Flash-Lite cachedInput Derived at 10% of input — Google caching convention.
Google — Gemini 2.5 Flash-Lite batchInput Derived at 50% of input — Google Batch API uniform 50% discount.
Google — Gemini 2.5 Flash-Lite batchOutput Derived at 50% of output — Google Batch API uniform 50% discount.
Google — Gemini 2.0 Flash cachedInput Derived at 25% of input per Google 2.0 family caching rates.
Google — Gemini 2.0 Flash batchInput Derived at 50% of input — Google Batch API uniform 50% discount.
Google — Gemini 2.0 Flash batchOutput Derived at 50% of output — Google Batch API uniform 50% discount.
Google — Gemini 2.0 Flash-Lite cachedInput Derived at 10% of input — Google caching convention.
Google — Gemini 2.0 Flash-Lite batchInput Derived at 50% of input — Google Batch API uniform 50% discount.
Google — Gemini 2.0 Flash-Lite batchOutput Derived at 50% of output — Google Batch API uniform 50% discount.
xAI — Grok 4 (legacy) cachedInput Extrapolated at 25% of base.

Pricing is cross-verified against the LiteLLM community registry when available. Daily snapshots are kept in aicost_pricing_snapshots; every change is logged to aicost_price_changelog with old & new values for full audit trail. Read the full methodology →