Find the cheapest AI model for your workload
Compare every major LLM side-by-side. Sorted by price. Filter by context, modality, or compliance.
Filter the full catalog of 150+ AI models by your role, workload, capability needs, and price ceiling — output is a ranked shortlist you can ship to other calculators.
- Stop reading vendor blog posts — every model is in one table, freshness-stamped
- Filter by what you actually need (vision, long-context, tier-1 only) instead of skimming marketing pages
- Price slider + min-context slider narrow 150+ models to the 5-10 worth comparing
- Compare checkbox lets you hold 2-4 candidates side-by-side before exporting to Cost Calculator
These are the inputs, outputs, and how you can use this calculator for your AI workloads.
- Your roleTunes filter defaults to your priorities
- Your workload scenarioPre-tunes capability filters
- Capability chipsAND-filters by feature
- Max input price per 1M tokensCuts everything above your ceiling
- Minimum context windowCuts small-window models
- Provider filtersVendor allowlist
- Ranked model listModels matching all filters
- Side-by-side compareCheck 2-4 to compare in detail
- Pricing freshnessDays since last verification
- Capability tagsWhat this model can do
Cut 150+ models to 5-10 candidates without reading any vendor blog post
New providers launch monthly — this surfaces them instead of you finding out on Twitter
Compare table is screenshot-ready for finance and security review
Export shortlist directly to Cost Calculator or Multi-Model Router for the actual decision
👇 Now try the calculator below with your own AI workloads
| Model | Provider | Input $/M ↓ | Output $/M | Cached $/M | Batch | Context | Modalities | Tags |
|---|
- Shortlist your exact-fit models — filter by role, workload, price, and context
- Compare 2–3 finalists — side-by-side price, context, and capabilities
- Price your real workload — hit "Calc →" on any row to drop it into the cost calculator
What this means + what to do next
- Real workload cost — sticker price tells you nothing about your token shape
- Quality at YOUR task — same model is great on one workload, weak on another
- Cache-aware effective pricing — caching changes the effective $/M dramatically for repeat-context workloads
- Vendor lock-in cost — switching prompts between vendors often takes weeks of eval
- Get exact $/month at your token shape per candidate model Cost Calculator
- Quantifies lock-in risk before you commit Vendor Concentration Risk
- Some models support prompt caching — drops effective $/M by 50-80% above ~22% hit rate Prompt Cache Roi
This is a discovery tool. ROI conversations happen downstream:
- Is the cheapest model in the shortlist quality-acceptable for my task? (Run eval.)
- Is the most-capable model in my shortlist 2× better, or 10× better? (The price gap usually reflects 2×, the value gap often reflects 1.3×.)
- How locked in am I to my current vendor — what would migration cost?
- Mixed workloads benefit from routing — different models for different query types Multi Model Router
- For some open-weight models, self-hosting becomes cheaper above a usage threshold Self Host Breakeven
- Translate $/request into $/customer or $/feature margin Margin Calculator
If you already have specific candidates in mind, skip discovery:
- You know which model you want; you just need the dollar number Cost Calculator
- You want the cheapest model meeting a quality floor for a specific workload Cheapest Model
- You've decided you want multiple models for different traffic patterns Multi Model Router