Guides → Playground & Guide → Consumer AI Bill Diagnose - Where Your $20-200/mo AI Spend Is Going
Meet Lena Park. Freelance designer paying for 6 AI subscriptions. "I'm paying $130/month across ChatGPT Plus, Claude Pro, Cursor, Midjourney, Perplexity Pro, and ElevenLabs. Half of these I barely use - which to keep?"
🔥 Yearly cost: $1,560. Estimated waste: probably $600+. Need to see it clearly.
Consumer AI sprawl is the new SaaS sprawl. Average AI-active user has 4-6 paid subscriptions. ChatGPT Plus ($20), Claude Pro ($20), Cursor ($20), Midjourney ($10-30), Perplexity Pro ($20), ElevenLabs ($5-22), GitHub Copilot ($10), v0 ($20), and so on. $100-200/month is common - most goes unused.
Lena's $130/month adds up to $1,560/year. Auditing her 30-day usage: ChatGPT 80 messages (heavy), Claude 8 messages (light), Cursor daily (heavy), Midjourney 4 images (light), Perplexity 12 searches (light), ElevenLabs 2 audio renders (very light). Cut Claude, Midjourney, ElevenLabs → save $720/year. Use Claude free tier when needed, Midjourney pay-per-image, ElevenLabs free tier.
Three patterns of waste. (1) Forgotten subscriptions - auto-renew, never use. (2) Tier overshoot - paying $20 when $0 free tier covers usage. (3) Duplicate capabilities - paying for ChatGPT Plus AND Claude Pro AND Cursor when one would do. The diagnose calc surfaces all three.
Upload your ChatGPT, Claude, Cursor, Perplexity, or Midjourney receipt. See where your subscription dollars actually go and get cheaper-tier or shared-plan recommendations.
Below: live sliders. Move them to see numbers change in real time.
Each input shapes your cost. Move the slider — see the impact.
Open the full calculator — pick a model, enter your tokens, see per-call, daily, monthly, and annual cost.
🚀 Open the full calculator →Estimated waste = monthly spend × (1 - heavy use ratio). Lena: $130 × 50% = $65/mo waste = $780/year just on subscriptions she barely opens.
Heavy-use subs to KEEP. 3-5 messages/week minimum. If you hit message limits sometimes, paid tier is justified.
Light-use subs to CUT. Less than 5 messages/week - free tier or pay-per-use will cover. Cancel and use sparingly via free.
Duplicate capabilities - pick one. ChatGPT + Claude + Gemini paid all → keep one. Pick by which has the features you actually use (memory, vision, voice, code execution, etc.).
Same calculator, three different team sizes. Click a tab to see how the numbers shift.
ChatGPT Plus only, used daily. Tier is right-sized. Maybe consider Plus → Free if usage drops, but otherwise good.
Healthy range: Healthy - paid tier justified
6 subs, 3 actively used. Cancel Claude (use free tier), Midjourney (pay per image), ElevenLabs (free tier). Save $720-780/year. Feature-test free tiers before canceling.
Healthy range: $65/mo waste - cancel 3, save $780/year
10 subs, only 3 used heavily. Likely paying for ChatGPT + Claude + Gemini + Cursor + Copilot all. Pick 1-2 chat AIs (one for general, one for code), cancel the rest. Save $1,800/year.
Healthy range: $155/mo waste - major audit needed
Cost isn't the only dimension. Click any constraint — see how recommendations change.
The cheapest tier you'll actually use is the right tier. Free → annual upgrade later beats annual → cancel mid-term.
Most consumer free tiers limit to non-frontier models. Quality is fine for daily use; matters less than people think for casual queries.
ChatGPT Plus, Claude Pro, etc. consumer tiers train on user content unless you explicitly opt out. Don't paste customer data, code from work, etc.
Most providers let you opt out of training in settings. Do this on day 1 for every consumer subscription.
If you hit 'try again later' messages on free tier 3+ times/day, paid tier earns its keep on responsiveness alone.
ChatGPT memory + conversation history make canceling psychologically hard. Export important threads before deciding.
For most users, ONE paid subscription handles 80% of needs. Pick by feature fit (vision, voice, code, memory). Free-tier the rest.
Tradeoff analysis is where most AI projects go sideways. Talk to a CFO-grade AI cost analyst →
Pre-loaded scenarios for the most common applications. Click a tab to see realistic numbers — then the "Try this scenario" button to load it into the calculator above.
Standard freelancer setup. ChatGPT + Claude + Cursor + 2 specialty (Midjourney/v0/etc). Audit: which specialty subs do you ship work with? Keep those. Cancel the rest.
Healthy range: $40/mo waste - cancel 2
Cursor + ChatGPT + Copilot + 1 image AI. Code subs (Cursor + Copilot) overlap - pick one. ChatGPT for product thinking. Total: $50-60/mo healthy.
Healthy range: $20/mo waste - minor cuts
Midjourney + Runway + ElevenLabs + ChatGPT + Suno + Krea + 1 more. Half barely used. Pick 1-2 image AIs that fit your style, 1 video AI, 1 audio AI. Cancel the rest. Save $1,200/year.
Healthy range: $100/mo waste - heavy audit
After audit + canceling. 3 subs, all used. $60/mo. This is the post-optimization steady state. Re-audit quarterly.
Healthy range: Healthy - keep all
Honest limitations — every model is wrong; some are useful. Where this one falls short:
For these, use: Subscription Picker for replacement choice. Family Plan for shared. Free Tier Checker to verify.
Author: Subu Vdaygiri, Founder & CEO of CloudIntelligence.ai. 17 years Fortune 100 (Ingram Micro, Siemens). Wharton CTO program · Kellogg CPO program · 10× AWS+Azure certified.
Why this matters: pricing for major vendors has dropped 40-90% in the last 24 months. A budget set 12 months ago is probably wrong by 30%+.
View 3-year history for →
Last-verified date is the most recent successful daily snapshot
(aicost_pricing_snapshots) or, when no snapshot exists yet,
the latest successful crawler run (aicost_crawler_runs).
10 of 10
vendors are currently verified. Aggregator services (TokenCost, AI Pricing Guru, etc.)
are not listed.
Derived from industry conventions, not directly published by the vendor. Typical conventions: cached input = 10% of base (90% off), Batch API = 50% of base (50% off).
| Vendor / Model | Field | Why it’s inferred |
|---|---|---|
| Anthropic — Claude Sonnet 4.6 | cachedInput |
Derived at 10% of input rate — Anthropic publishes 90% cache-hit discount on this tier. |
| Anthropic — Claude Sonnet 4.5 | cachedInput |
Derived at 10% of input rate; same 90% cache-hit convention as Sonnet 4.6. |
| Anthropic — Claude Sonnet 4.5 | batchInput |
Derived at 50% of standard input — Anthropic documents uniform 50% Batch discount. |
| Anthropic — Claude Sonnet 4.5 | batchOutput |
Derived at 50% of standard output — Anthropic documents uniform 50% Batch discount. |
| Anthropic — Claude Haiku 4.5 | cachedInput |
Derived at 10% of input rate — Anthropic 90% cache-hit discount convention. |
| OpenAI — GPT-5.4 Mini | cachedInput |
Derived at 10% of input — OpenAI documents automatic 90% discount on cache hits across GPT-5.x tier. |
| OpenAI — GPT-5.4 Nano | cachedInput |
Derived at 10% of input — OpenAI 90% cache-hit convention. |
| OpenAI — GPT-5.4 Nano | batchInput |
Derived at 50% of input — OpenAI Batch API uniform 50% discount. |
| OpenAI — GPT-5.4 Nano | batchOutput |
Derived at 50% of output — OpenAI Batch API uniform 50% discount. |
| OpenAI — GPT-5.4 Pro | cachedInput |
Derived at 10% of input — OpenAI 90% cache-hit convention. |
| OpenAI — GPT-5.4 Pro | batchInput |
Derived at 50% of input — OpenAI Batch API uniform 50% discount. |
| OpenAI — GPT-5.4 Pro | batchOutput |
Derived at 50% of output — OpenAI Batch API uniform 50% discount. |
| OpenAI — GPT-5.2 | cachedInput |
Derived at 10% of input; no residency uplift. |
| OpenAI — GPT-5.2 | batchInput |
Derived at 50% of input. |
| OpenAI — GPT-5.2 | batchOutput |
Derived at 50% of output. |
| OpenAI — GPT-5 | cachedInput |
Derived at 10% of input. |
| OpenAI — GPT-5 | batchInput |
Derived at 50% of input. |
| OpenAI — GPT-5 | batchOutput |
Derived at 50% of output. |
| OpenAI — GPT-5.5 Pro | cachedInput |
Derived at 10% of input — OpenAI does not publish a cached rate for *-pro models; using the family convention. |
| OpenAI — GPT-5.5 Pro | batchInput |
Derived at 50% of input. |
| OpenAI — GPT-5.5 Pro | batchOutput |
Derived at 50% of output. |
| OpenAI — GPT-5.2 Pro | cachedInput |
Derived at 10% of input — pro-tier convention. |
| OpenAI — GPT-5.2 Pro | batchInput |
Derived at 50% of input. |
| OpenAI — GPT-5.2 Pro | batchOutput |
Derived at 50% of output. |
| OpenAI — GPT-5.1 | batchInput |
Derived at 50% of input. |
| OpenAI — GPT-5.1 | batchOutput |
Derived at 50% of output. |
| OpenAI — GPT-5 Pro | batchInput |
Derived at 50% of input. |
| OpenAI — GPT-5 Pro | batchOutput |
Derived at 50% of output. |
| OpenAI — GPT-5 Nano | cachedInput |
Derived at 10% of input. |
| OpenAI — GPT-5 Nano | batchInput |
Derived at 50% of input. |
| OpenAI — GPT-5 Nano | batchOutput |
Derived at 50% of output. |
| Google — Gemini 3 Flash | cachedInput |
Derived at 10% of input — Google caching discount convention ~90%. |
| Google — Gemini 3.1 Flash-Lite | cachedInput |
Derived at 10% of input — Google caching convention. |
| Google — Gemini 3.1 Flash-Lite | batchInput |
Derived at 50% of input — Google Batch API uniform 50% discount. |
| Google — Gemini 3.1 Flash-Lite | batchOutput |
Derived at 50% of output — Google Batch API uniform 50% discount. |
| Google — Gemini 2.5 Pro | cachedInput |
Derived at 10% of input. |
| Google — Gemini 2.5 Flash | cachedInput |
Derived at 10% of input. |
| Google — Gemini 2.5 Flash-Lite | cachedInput |
Derived at 10% of input — Google caching convention. |
| Google — Gemini 2.5 Flash-Lite | batchInput |
Derived at 50% of input — Google Batch API uniform 50% discount. |
| Google — Gemini 2.5 Flash-Lite | batchOutput |
Derived at 50% of output — Google Batch API uniform 50% discount. |
| Google — Gemini 2.0 Flash | cachedInput |
Derived at 25% of input per Google 2.0 family caching rates. |
| Google — Gemini 2.0 Flash | batchInput |
Derived at 50% of input — Google Batch API uniform 50% discount. |
| Google — Gemini 2.0 Flash | batchOutput |
Derived at 50% of output — Google Batch API uniform 50% discount. |
| Google — Gemini 2.0 Flash-Lite | cachedInput |
Derived at 10% of input — Google caching convention. |
| Google — Gemini 2.0 Flash-Lite | batchInput |
Derived at 50% of input — Google Batch API uniform 50% discount. |
| Google — Gemini 2.0 Flash-Lite | batchOutput |
Derived at 50% of output — Google Batch API uniform 50% discount. |
| xAI — Grok 4 (legacy) | cachedInput |
Extrapolated at 25% of base. |
Pricing is cross-verified against the
LiteLLM community registry
when available. Daily snapshots are kept in aicost_pricing_snapshots;
every change is logged to aicost_price_changelog with old & new
values for full audit trail. Read the full methodology →