Anthropic pricing, complete breakdown
Verified 2026-05-27, cross-checked against Anthropic pricing page, litellm, openrouter
Anthropic's frontier model lineup is led by Claude Opus 4.7 and 4.6, both priced at $5.00 per million input tokens and $25.00 per million output tokens. For high-throughput applications, Claude Sonnet 4.6 offers a balanced rate of $3.00 per million input and $15.00 per million output tokens. The most efficient model, Claude Haiku 4.5, reduces costs further to $1.00 per million input and $5.00 per million output tokens. All current models support prompt caching to lower costs on repeated context. This guide helps you navigate these API rates alongside Anthropic's various subscription and enterprise tiers.
How Anthropic's pricing universe works
Anthropic employs a multi-channel pricing strategy to balance high-margin developer usage with predictable recurring revenue from consumers and enterprises. The API track serves builders who need metered, programmatic access to models for custom applications. Subscription tiers like Claude Pro and the high-usage Max plans target individual power users, while Team and Enterprise plans provide the administrative controls and security required for organizational deployment. By offering models through cloud marketplaces like AWS and Google Vertex, Anthropic also captures enterprise spend from companies that prefer to consolidate billing within their existing cloud infrastructure.
API (per-token, metered)
- Pay only for tokens consumed
- Full model lineup including batch, caching, long context
- Programmatic via SDKs
Consumer subscriptions (Pro, Max tiers)
- Fixed monthly fee
- Generous usage caps
- Web/desktop/mobile apps
- Often includes newer models first
Business/Team plans
- Per-seat billing
- Centralized billing
- Admin & audit controls
- Sometimes shared usage pools
Enterprise (custom contract)
- Custom pricing and limits
- SLAs
- DPAs and BAAs
- Dedicated support
- Sometimes private cloud / VPC
Cloud marketplaces (AWS Bedrock, Google Vertex, Azure)
- Same models, slightly different pricing (often parity or small premium)
- Counts toward existing cloud spend commits
- Stays within cloud's data-protection boundary
⭐ Most popular Anthropic products by user type
🎁 Current promos and time-sensitive deals
📅 What changed in the last 30 days
Every Anthropic product, profiled
For each product, what it's for, who picks it, what to watch out for, pros and cons, and what we tell our consulting clients.
Claude Free
- Basic text summarization and drafting
- Casual coding assistance and debugging
- Testing Claude's reasoning capabilities
- Occasional image analysis and OCR
- Access to Claude 3.5 Sonnet
- Web, iOS, and Android access
- Basic Claude Code CLI access
- Standard response speeds
- Limited daily message caps
- No-cost access to industry-leading reasoning
- Clean, distraction-free interface
- Strong privacy reputation compared to competitors
- Strict and unpredictable usage limits
- No access to Claude Projects or Artifacts sharing
- Data may be used for training (unless opted out)
| Scenario | Monthly | Annual | Notes |
|---|---|---|---|
| Standard casual use | varies | varies | No cost, but limited by daily caps. |
Claude Pro
- Complex coding with Claude Code Pro
- Managing long-form documents in Projects
- High-volume research and analysis
- Early access to new model releases
- Access to Claude 3.5 Opus and Sonnet
- Claude Projects for organizing docs/code
- Priority access during high-traffic periods
- Claude Code Pro CLI integration
- Artifacts for live code/UI previews
- Best-in-class coding and reasoning performance
- Projects feature allows for 200k context management
- No data training on your prompts by default
- Still subject to usage caps (not unlimited)
- No native web browsing or image generation
- Mobile app can be laggy with very long Project contexts
| Scenario | Monthly | Annual | Notes |
|---|---|---|---|
| Monthly subscription | $20 | $240 | Standard monthly billing. |
| Annual sticker price | $16.67 | $200 | Paid as $200 upfront. |
Claude Max 5x
- Full-day coding sessions with Claude Code
- Processing massive document sets (1M+ tokens)
- Building complex applications via CLI
- High-frequency iterative research
- 5x the message capacity of Claude Pro
- Highest priority for new model features
- Claude Code Max 5x access
- Advanced Project management tools
- Direct support channel access
- Virtually eliminates 'usage exhausted' interruptions
- Ideal for professional developers using Claude Code
- Highest possible individual rate limits
- Significant price jump from Pro ($20 to $100)
- No annual discount currently available for Max tiers
- Overkill for 95% of standard professional users
| Scenario | Monthly | Annual | Notes |
|---|---|---|---|
| Power user monthly | $100 | $1,200 | No annual discount available. |
Claude Team Standard
- Collaborative project management
- Sharing custom AI instructions across a team
- Centralized billing for AI tools
- Internal knowledge base grounding
- Higher usage limits than Claude Pro
- Centralized admin console and billing
- Shared Projects for team collaboration
- Early access to collaboration features
- Claude Code Team access
- Shared Projects allow teams to work from the same context
- Administrative control over user access
- Higher usage caps than individual Pro accounts
- Expensive for very small teams (under 5 people)
- Lacks the advanced security of the Enterprise tier
- No SSO (Single Sign-On) in the Standard tier
| Scenario | Monthly | Annual | Notes |
|---|---|---|---|
| Minimum 5-seat team (Monthly) | $125 | $1,500 | 5 seats @ $25/mo. |
| Minimum 5-seat team (Annual) | $100 | $1,200 | 5 seats @ $20/mo. |
Claude Enterprise
- Company-wide AI deployment with SSO
- Analyzing massive internal datasets (500k context)
- Audit-ready AI interactions
- Custom integration with internal GitHub/GitLab
- 500,000 token context window
- SSO (SAML) and SCIM provisioning
- Audit logs and activity monitoring
- GitHub/GitLab integrations
- Dedicated account management
- Maximum security and compliance features
- Massive context window for large-scale analysis
- Consolidated billing and usage insights
- High entry cost and long procurement cycles
- May include features that smaller enterprises don't need
- Pricing varies wildly based on negotiation
| Scenario | Monthly | Annual | Notes |
|---|---|---|---|
| 100-seat Enterprise deployment | varies | varies | Pricing is custom; typically ranges from $30-$60 per seat depending on volume. |
Anthropic API
- Integrating Claude into custom applications
- Automated batch processing of documents
- Building autonomous agents
- Fine-tuning and prompt engineering experiments
- Access to all models (Opus, Sonnet, Haiku)
- Prompt Caching (90% discount on repeats)
- Batch API (50% discount for 24h turnaround)
- Computer Use (beta) capabilities
- Client SDKs for Python and TypeScript
- Lowest possible cost for high-volume tasks
- Prompt caching significantly reduces costs for long contexts
- No monthly subscription fee; pay only for what you use
- Requires technical expertise to implement
- Rate limits start low for new accounts (Tier 1)
- No built-in UI; must use a third-party client or build your own
| Scenario | Monthly | Annual | Notes |
|---|---|---|---|
| Light developer use (1M Sonnet tokens/mo) | $18 | $216 | Assumes $3 input / $15 output mix. |
| Heavy Batch processing (100M Haiku tokens/mo) | $300 | $3,600 | Assumes 50% Batch discount on Haiku rates. |
Claude Max 20x
- Core Extended Thinking
- Notes: Top consumer tier; 20x Pro usage; same model lineup as Max 5x, higher quotas
- Tools Projects: unlimited
- Tools Artifacts
- Tools Claude Code
- Tools File Creation
- Tools Research Mode
- Tools Code Execution
| Scenario | Monthly | Annual | Notes |
|---|---|---|---|
| Standard annual cost | $200 | $2,400 | Monthly subscription |
Claude Team Standard Seat
- All Claude features
- More usage than Pro
- Includes Claude Code and Claude Cowork
- Connect Microsoft 365, Slack, and more
- Enterprise search across your organization
- Central billing and administration
- Single sign-on (SSO)
- Admin controls for remote and local connectors
| Scenario | Monthly | Annual | Notes |
|---|---|---|---|
| Standard annual cost | $25 | $240 | Lower rate with annual commitment ($20/mo) |
All Anthropic products at a glance
Scroll up to the product profile for full detail
| Product | Price | Best for | Headline feature | Yearly estimate |
|---|---|---|---|---|
| Claude Free | $0 | Occasional chat | Access to latest Sonnet model | $0 |
| Claude Pro | $20/mo | Power users | Claude Code & 5x usage limits | $240 |
| Claude Team Standard | $25/mo/seat | Small teams (5+) | Shared Projects & Admin tools | $1,500 (5 seats) |
| Claude Max 20x | $200/mo | Heavy automation | 20x standard Pro limits | $2,400 |
| Anthropic API | Usage-based | App development | Prompt Caching & Batch API | Variable |
Anthropic vs the field
Same-tier comparison across top 5 vendors
| Comparison tier | Anthropic | OpenAI | xAI | Verdict | |
|---|---|---|---|---|---|
| Flagship Consumer Chat | Claude Pro $20/mo |
GPT-5.4 Pro $30/mo |
Gemini 3.1 Pro $20/mo |
Grok 4.20 $16/mo |
Anthropic and Google maintain the $20 price point while OpenAI's flagship has moved to $30. |
| Developer (Reasoning API) | Claude 3.5 Sonnet $3.00 / $15.00 |
GPT-5.4 $2.50 / $15.00 |
Gemini 3.1 Pro $2.00 / $12.00 |
Grok 4.20 (reasoning) $2.00 / $6.00 |
xAI and Google offer lower output pricing for reasoning-capable models compared to Anthropic. |
| Developer (Fast/Mini API) | Claude 3.5 Haiku $0.25 / $1.25 |
GPT-5.4 Mini $0.75 / $4.50 |
Gemini 3 Flash $0.50 / $3.00 |
Grok 4.1 Fast $0.20 / $0.50 |
Anthropic's Haiku pricing is highly competitive in the 'Mini' tier, undercutting OpenAI and Google on input costs. |
🌳 Which Anthropic product fits you?
How Anthropic pricing has moved
Tracking the transition from the Claude 3 series to the 4.7 generation.
API or subscription: which is cheaper for you?
Cross-over math at current rates
At a 2:1 input-to-output ratio (400 tokens in, 200 tokens out), each message costs approximately $0.0042 via API.
Using the flagship Opus model via API costs roughly $0.007 per average message.
The Max 5x tier provides significantly higher limits for power users who require consistent Opus access.
Current pricing (all production models)
| Model | Input $/M | Output $/M | Cached $/M | Context |
|---|---|---|---|---|
Claude Opus 4.7claude-opus-4-7 |
$5 | $25 | $0.50 | 1,000,000 |
Claude Opus 4.6claude-opus-4-6 |
$5 | $25 | $0.50 | 1,000,000 |
Claude Sonnet 4.6claude-sonnet-4-6 |
$3 | $15 | $0.30 | 1,000,000 |
Claude Haiku 4.5claude-haiku-4-5 |
$1 | $5 | $0.10 | 200,000 |
Prices are in USD per 1M tokens. Caching discounts apply to input tokens only. Verified 2026-05-27.
Full rate breakdown (all variants)
Variants beyond standard API: batch (async, 50% off), cached read (0.1x), cache writes (1.25x or 2x base), long-context tier (~2x above threshold).
Claude Opus 4.7 claude-opus-4-7
Claude Opus 4.7 claude-opus-4-7
| Variant | Input $/M | Output $/M | Notes |
|---|---|---|---|
| Standard | $5 | $25 | Default per-token API rate |
| Batch API | $2.5 | $12.5 | Async batch processing, results within 24 hours, typically 50% off |
| Cached read | $0.50 | $25 | Cached prompt input (~0.1x base); output rate unchanged |
| Cache write (5-min TTL) | $6.25 | — | Premium for caching prompt; ~1.25x base input |
| Cache write (1-hour TTL) | $10 | — | Premium for longer cache lifetime; ~2x base input |
Claude Opus 4.6 claude-opus-4-6
Claude Opus 4.6 claude-opus-4-6
| Variant | Input $/M | Output $/M | Notes |
|---|---|---|---|
| Standard | $5 | $25 | Default per-token API rate |
| Batch API | $2.5 | $12.5 | Async batch processing, results within 24 hours, typically 50% off |
| Cached read | $0.50 | $25 | Cached prompt input (~0.1x base); output rate unchanged |
Claude Sonnet 4.6 claude-sonnet-4-6
Claude Sonnet 4.6 claude-sonnet-4-6
| Variant | Input $/M | Output $/M | Notes |
|---|---|---|---|
| Standard | $3 | $15 | Default per-token API rate |
| Batch API | $1.5 | $7.5 | Async batch processing, results within 24 hours, typically 50% off |
| Cached read | $0.30 | $15 | Cached prompt input (~0.1x base); output rate unchanged |
Claude Haiku 4.5 claude-haiku-4-5
Claude Haiku 4.5 claude-haiku-4-5
| Variant | Input $/M | Output $/M | Notes |
|---|---|---|---|
| Standard | $1 | $5 | Default per-token API rate |
| Batch API | $0.50 | $2.5 | Async batch processing, results within 24 hours, typically 50% off |
| Cached read | $0.10 | $5 | Cached prompt input (~0.1x base); output rate unchanged |
Subscription plans (consumer + business)
| Plan | Audience | Monthly | Annual | Per seat | What's included |
|---|---|---|---|---|---|
|
Claude Team Premium
Claude Team |
team | $125 | $100/mo billed annually ($1,200/yr total) |
$1 |
All Team Standard features · 5x more usage than standard seats · Includes Claude Code and Claude Cowork · Connect Microsoft 365, Slack, and more · Enterprise search across your organization · Central billing and administration · Single sign-on (SSO) · Admin controls for remote and local connectors · Enterprise deployment for the Claude desktop app · No model training on your content by default · Mix and match seat types
Limits: usage: 5x more than Team Standard claude.com ↗ |
| Claude Enterprise Custom | enterprise | $20 | $20 | $1 |
All Team plan features · Admins set user and org spend limits · Google Docs cataloging · Role-based access with fine grained permissioning · System for Cross-domain Identity Management (SCIM) · Audit logs · Compliance API for observability and monitoring · Custom data retention controls · Network-level access control · IP allowlisting · HIPAA-ready offering available · Claude Security (beta)
Limits: usage: scales with model and task · seat price: 20 claude.com ↗ |
|
Claude Team Standard
Claude Team |
team | $25 | $20/mo billed annually ($240/yr total) |
$1 |
All Claude features · More usage than Pro · (plus all shared Team features) · Includes Claude Code and Claude Cowork · Connect Microsoft 365, Slack, and more · Enterprise search across your organization · Central billing and administration · Single sign-on (SSO) · Admin controls for remote and local connectors · Enterprise deployment for the Claude desktop app · No model training on your content by default · Mix and match seat types
Limits: max seats: 150 · min seats: 5 · limits source: transcript: 'All Claude features, plus more usage than Pro' claude.com ↗ |
|
Claude Free
Claude Free |
consumer | $0 | — | — |
Notes: Daily usage limits; no credit card required
Limits: opus access: none · default model: sonnet-4-6 · usage multiplier: baseline anthropic.com ↗ |
|
Claude Pro
Claude |
consumer | $20 | $200/yr (≈ $17/mo) |
— |
Notes: Annual = $200/yr = $17/mo (~17% discount); Claude Code inclusion contested in some changelogs ? verify on anthropic.com/pricing
Limits: opus access: limited · default model: sonnet-4-6 · usage multiplier: baseline_paid · context window tokens: 1000000 · annual effective monthly: 17 claude.com ↗ |
|
Claude Max 5x
Claude |
consumer | $100 | — | — |
Notes: Annual not publicly disclosed; designed for users running multiple Claude Code instances concurrently · Early Access · Priority Access · Priority Routing
Limits: opus access: full · default model: opus-4-7 · usage multiplier: 5x_pro · context window tokens: 1000000 anthropic.com ↗ |
|
Claude Max 20x
Claude |
consumer | $200 | — | — |
Notes: Top consumer tier; 20x Pro usage; same model lineup as Max 5x, higher quotas · Early Access · Priority Access · Priority Routing
Limits: opus access: full · default model: opus-4-7 · usage multiplier: 20x_pro · context window tokens: 1000000 anthropic.com ↗ |
|
Claude Free
Claude Free |
consumer | $0 | $0 | — |
Chat on web, iOS, Android, and on your desktop · Generate code and visualize data · Write, edit, and create content · Ability to search the web · Memory across conversations · Create files and execute code · Unlock more from Claude with desktop extensions · Connect Slack and Google Workspace services · Integrate any context or tool through connectors with remote MCP · Extended thinking for complex work
Limits: projects: unlimited · context tokens: unlimited · messages per 5h: unlimited anthropic.com ↗ |
|
Claude Pro
Claude |
consumer | $20 | $200/yr (≈ $17/mo) |
— |
Everything in Free · More usage · Includes Claude Code · Includes Claude Cowork · Access to unlimited projects to organize chats and documents · Access to Research · Ability to use more Claude models · Claude for Microsoft 365 · Claude for Microsoft Outlook
Limits: usage: more than Free tier anthropic.com ↗ |
|
Claude Max 5x
Claude |
consumer | $100 | — | — |
Everything in Pro · Choose 5x more usage than Pro · Higher output limits for all tasks · Early access to advanced Claude features · Priority access at high traffic times
Limits: usage: 5x more than Pro tier · output limits: higher anthropic.com ↗ |
|
Claude Max 20x
Claude |
consumer | $200 | — | — |
Everything in Pro · Choose 20x more usage than Pro · Higher output limits for all tasks · Early access to advanced Claude features · Priority access at high traffic times
Limits: usage: 20x more than Pro tier · output limits: higher anthropic.com ↗ |
|
Claude Team Standard Seat
Claude |
team | $25 | $20/mo billed annually ($240/yr total) |
$1 |
All Claude features · More usage than Pro · Includes Claude Code and Claude Cowork · Connect Microsoft 365, Slack, and more · Enterprise search across your organization · Central billing and administration · Single sign-on (SSO) · Admin controls for remote and local connectors · Enterprise deployment for the Claude desktop app · No model training on your content by default
Limits: usage: more than Pro tier anthropic.com ↗ |
|
Claude Team Premium Seat
Claude |
team | $125 | $100/mo billed annually ($1,200/yr total) |
$1 |
All Team Standard Seat features · 5x more usage than standard seats · Includes Claude Code and Claude Cowork · Connect Microsoft 365, Slack, and more · Enterprise search across your organization · Central billing and administration · Single sign-on (SSO) · Admin controls for remote and local connectors · Enterprise deployment for the Claude desktop app · No model training on your content by default
Limits: usage: 5x more than Team Standard Seat anthropic.com ↗ |
|
Claude Enterprise
Claude |
enterprise | — | — | $1 |
All Team plan features · Admins set user and org spend limits · Role-based access with fine grained permissioning · System for Cross-domain Identity Management (SCIM) · Audit logs · Compliance API for observability and monitoring · Custom data retention controls · Network-level access control · IP allowlisting · HIPAA-ready offering available · Claude Security (beta)
Limits: seats: 100+ · usage: scales with model and task anthropic.com ↗ |
|
Claude Code Free
Claude Code Free |
developer | $0 | $0 | — |
Included with Claude Free tier
Limits: usage: included with Claude Free tier anthropic.com ↗ |
|
Claude Code Pro
Claude Code |
developer | $20 | $200/yr (≈ $17/mo) |
— |
Included with Claude Pro tier
Limits: usage: included with Claude Pro tier anthropic.com ↗ |
|
Claude Code Max 5x
Claude Code |
developer | $100 | — | — |
Included with Claude Max 5x tier
Limits: usage: 5x more than Pro tier anthropic.com ↗ |
|
Claude Code Max 20x
Claude Code |
developer | $200 | — | — |
Included with Claude Max 20x tier
Limits: usage: 20x more than Pro tier anthropic.com ↗ |
|
Claude Code Team Standard Seat
Claude Code |
team | $25 | $20/mo billed annually ($240/yr total) |
$1 |
Included with Team Standard Seat
Limits: usage: included with Team Standard Seat anthropic.com ↗ |
|
Claude Code Team Premium Seat
Claude Code |
team | $125 | $100/mo billed annually ($1,200/yr total) |
$1 |
Included with Team Premium Seat
Limits: usage: included with Team Premium Seat anthropic.com ↗ |
|
Claude Code Enterprise
Claude Code |
enterprise | — | — | $1 |
All Team plan features · Admins set user and org spend limits · Role-based access with fine grained permissioning · System for Cross-domain Identity Management (SCIM) · Audit logs · Compliance API for observability and monitoring · Custom data retention controls · Network-level access control · IP allowlisting · HIPAA-ready offering available
Limits: seats: 100+ · usage: scales with model and task anthropic.com ↗ |
Subscription pricing is separate from per-token API rates above.
How buyers think about Anthropic pricing
Each scenario below is interactive — tweak the inputs to see how the math changes for your workload.
Cheapest Claude for high-volume classification
The problem: You need to process hundreds of thousands of short classification tasks daily without the premium cost of reasoning models. Managing these workloads at scale requires a balance between speed and per-token efficiency.
What to do: Claude Haiku 4.5 is the recommended model for high-frequency, low-latency tasks.
→ Using Haiku 4.5 with prompt caching can reduce daily operating costs for classification by up to 80 percent compared to base rates.
Strongest Claude for hard reasoning problems
The problem: Complex architectural decisions or deep code audits require the highest level of cognitive performance, where accuracy is more critical than the cost of a single inference.
What to do: Claude Opus 4.7 provides the most advanced reasoning capabilities in the Anthropic suite.
→ Opus 4.7 maintains a premium price point of $5 per million input tokens to deliver maximum accuracy on complex logic.
Is Claude Pro at the listed monthly rate worth it for solo use
The problem: You are deciding between a flat monthly subscription and paying for API usage. You need to know the break-even point where the subscription becomes more economical than per-token billing.
What to do: Claude Sonnet 4.6 is the balanced model used for comparison against the standard Pro subscription.
→ The break-even point for a solo user is approximately 132,000 total tokens processed per day at balanced tier rates.
When does prompt caching pay off
The problem: You are sending the same large context or system prompt repeatedly. You want to know how many repeat requests are needed to justify the engineering effort of implementing caching.
What to do: Anthropic's prompt caching offers a 90 percent discount on input tokens for cached content.
→ Prompt caching reduces input costs by 90 percent for any workload that reuses context at least three times in a five-minute window.
Anthropic API direct vs AWS Bedrock vs Google Vertex
The problem: You need to choose a hosting provider for Claude models. You are looking for the lowest price while maintaining enterprise security and compliance.
What to do: While base per-token rates are identical across providers, regional surcharges and commitment programs create significant total cost differences.
→ Base prices are identical, but regional configurations on Vertex and Bedrock can increase effective costs by 10 percent.
How does Claude compare on cost to GPT and Gemini
The problem: You are comparing the balanced tier of Claude against other major providers to ensure you are not overpaying for similar performance.
What to do: Claude Sonnet 4.6 serves as the benchmark for balanced performance and cost efficiency.
→ Claude Sonnet 4.6 maintains a competitive $3 per million input token rate for balanced enterprise workloads.
Volume discounts & partner programs
Claude Partner Network
Threshold: reportedly 10 team members must complete Anthropic Academy certifications for consulting partners
Typical discount (reported): varies; includes $100 million in total program funding for training and market development
Benefits:
- Early access to new products (typically 60-90 days before general availability)
- Quarterly product roadmap briefings under NDA
- Technical certification (Claude Certified Architect program)
- Joint market development and co-marketing support
- Dedicated Applied AI engineers for live customer deals
How to engage: Apply through the Anthropic Partner Portal; requires 4-8 weeks for review and a technical interview
Source: anthropic.comvendor_official · cited 2026-03-12
Claude Marketplace
Threshold: requires an existing Anthropic enterprise spend commitment
Typical discount (reported): allows spend consolidation; no commission taken by Anthropic from partner transactions
Benefits:
- Consolidate AI spend by applying Anthropic commitments to third-party apps
- Simplified procurement with managed invoicing through Anthropic
- Access to enterprise-ready partners like GitLab, Snowflake, Harvey, and Replit
- Reduces risk of cost overruns on large spending commitments
How to engage: Enterprises with existing commitments should contact their Anthropic account team
Source: claude.comvendor_official · cited 2026-03-06
Claude for Startups
Threshold: varies by stage (Pre-seed to Series B); often requires VC or accelerator partnership
Typical discount (reported): reportedly $25,000 to $100,000+ in API credits
Benefits:
- Free API credits valid for 12 months
- Premium rate limit boosts
- Direct access to Anthropic technical teams and office hours
- Access to the Anthology Fund for direct investment opportunities
How to engage: Apply via partner VC firms or directly through the Anthropic Startup Program portal
Source: anthropic.comvendor_official · cited 2025-05-22
AWS Bedrock Provisioned Throughput
Threshold: minimum 1-month or 6-month commitment
Typical discount (reported): fixed hourly rates; monthly costs range from approximately $15,000 to $37,000+ per Model Unit
Benefits:
- Guaranteed capacity and consistent throughput for high-volume workloads
- No per-token charges for reserved capacity
- Counts toward AWS Enterprise Discount Program (EDP) commitments
- 50% discount on batch inference compared to on-demand rates
How to engage: Purchase through the AWS Bedrock console or via AWS Marketplace
Source: aws.amazon.comvendor_official · cited 2026-05-14
Google Vertex AI Committed Use Discounts (CUDs)
Threshold: 1-year or 3-year spend commitment
Typical discount (reported): approximately 30% for 1-year and 50% for 3-year commitments
Benefits:
- Claude usage counts toward Google Cloud enterprise spend commitments
- Eligible for GCP startup credits and free trial credits
- Up to 90% discount on context caching
- 50% discount for non-urgent Batch API workloads
How to engage: Enable through the Google Cloud Console under Vertex AI Model Garden
Source: cloud.google.comvendor_official · cited 2026-05-04
Microsoft Foundry (Azure) Integration
Threshold: requires an active Azure subscription (Enterprise or MCA-E)
Typical discount (reported): uses standard API pricing; eligible for Microsoft Azure Consumption Commitment (MACC)
Benefits:
- Consolidated billing through existing Azure agreements
- Eliminates separate vendor approvals for procurement
- Integration with Microsoft 365 Copilot and Excel Agent Mode
- Enterprise-grade governance and Microsoft Entra authentication
How to engage: Deploy Claude models as serverless APIs through the Azure AI Foundry Model Catalog
Source: anthropic.comvendor_official · cited 2025-11-18
Multi-cloud availability
| Cloud | Model availability | Price vs vendor-direct | Reasons to pick |
|---|---|---|---|
| AWS Bedrock | Claude Opus (4.7, 4.6, 4.5, 4.1), Claude Sonnet (4.6, 4.5, 4), Claude Haiku 4.5, and Claude Mythos Preview (gated research preview). | Identical per-token rates in standard regions ($5/$25 for Opus 4.6, $3/$15 for Sonnet 4.6, $1/$5 for Haiku 4.5); cross-region inference adds approximately 10%. |
pecollective.com ↗ |
| Google Vertex AI | Claude Opus (4.7, 4.6, 4.5), Claude Sonnet (4.6, 4.5, 4), Claude Haiku 4.5, and Claude Mythos Preview (private preview). | Base per-token pricing matches direct API, but regional endpoints reportedly carry a 10% surcharge over global endpoint pricing. |
cloud.google.com ↗ |
| Microsoft Foundry (Azure) | Claude Opus (4.7, 4.6, 4.5, 4.1), Claude Sonnet (4.6, 4.5), and Claude Haiku 4.5. | Reportedly identical to direct API rates ($5/$25 per 1M tokens for Opus 4.6/4.7). |
microsoft.com ↗ |
| Claude Platform on AWS (Marketplace) | Full feature parity with direct Anthropic API, including beta features like Managed Agents and MCP connectors. | Identical per-token rates but billed in 'Claude Consumption Units' (CCUs) at a fixed rate of $0.01 per CCU. |
cloudzero.com ↗ |
Free credits & startup programs
Anthropic Startup Program
Reported value: $25,000 to $100,000+ in API credits
Eligibility: Early-stage startups (Pre-seed to Series A) building AI-powered products; primarily available through Anthropic's VC partner network or direct application for incorporated companies.
How to apply: Apply via the official startup program page or through a referral from an approved Anthropic VC partner.
Anthology Fund (Menlo Ventures partnership)
Reported value: $25,000 in free credits
Eligibility: Startups selected for investment by the $100 million Anthology Fund; focuses on AI infrastructure, frontier applications, and social impact.
How to apply: Submit an investment application through the Menlo Ventures Anthology Fund portal.
Claude for Open Source
Reported value: approximately $1,200 value (6 months of Claude Max 20x)
Eligibility: Maintainers of public GitHub repositories with 5,000+ stars or 1M+ monthly npm downloads; must have been active in the last 3 months.
How to apply: Apply through the dedicated Claude for OSS contact form (applications close June 30, 2026).
Anthropic AI for Science Program
Reported value: up to $20,000 in API credits
Eligibility: Academics, researchers, and nonprofits working on high-impact scientific projects, particularly in biology and life sciences.
How to apply: Complete the AI for Science Program application form; submissions are evaluated on the first Monday of each month.
External Researcher Access Program
Reported value: $1,000 in API credits
Eligibility: Researchers working on high-priority AI safety and alignment topics; credits apply to standard model suite API use.
How to apply: Submit the External Researcher Access Program application form with details on the research team and topic.
Google for Startups Cloud Program (AI Track)
Reported value: $10,000 specific to partner models (including Anthropic)
Eligibility: Qualifying startups in the Scale or Scale AI Tier; must be less than 5 years old and pre-Series A.
How to apply: Join the Google for Startups Cloud Program and contact a Google Cloud Account Executive to request partner model credits.
AWS Activate (Portfolio Package)
Reported value: up to $100,000 in AWS credits
Eligibility: Startups associated with an AWS Activate Provider (VC, accelerator, or incubator); credits can be used for Claude models via Amazon Bedrock.
How to apply: Apply through the AWS Activate console using an Organization ID from a partner provider.
Pricing gotchas to watch
Silent Prompt Cache TTL Expiration and Traffic Gaps
While Anthropic's documentation specifies a 5-minute default TTL for ephemeral prompt caching, users have reportedly observed silent reductions in effective cache life to approximately 3 minutes in practice. Additionally, for users on the 1-hour TTL tier, community reports suggest a quiet shift in default behavior back to 5 minutes for certain session types, leading to unexpected full-price re-processing of large contexts (up to 1M tokens) if a user pauses for more than 5 minutes.
Workaround: Implement a 'heartbeat' request or 'Claude Pulse' style monitoring to track live cache countdowns and ensure follow-up replies occur within the 3-5 minute window to maintain the 90% read discount.
Source: github.comgithub_issue · cited 2026-05-27
Minimum Token Thresholds Vary by Model Generation
Prompt caching only activates if the prefix meets a minimum token threshold, which varies significantly by model. As of May 2026, Claude Sonnet 4.6 and 4.5 require 1,024 tokens, while higher-tier models like Claude Opus 4.7, 4.6, 4.5, and the Claude Mythos Preview require a much higher 4,096 tokens. Claude Haiku 4.5 also reportedly requires 4,096 tokens due to its longer attention window architecture.
Workaround: Verify system prompt length using the count_tokens API; if below the threshold, 'pad' the prompt with additional few-shot examples or domain context to trigger the 90% read discount.
Source: claude.comvendor_docs · cited 2026-05-27
Dynamic Content and Whitespace Cache Invalidation
Anthropic's prompt caching requires an exact byte-for-byte prefix match. Common 'gotchas' that invalidate the cache include injecting dynamic timestamps or session IDs into system prompts, changing the order of tool definitions, or even inconsistent trailing whitespace (e.g., concatenating with '\n' in one process but not another).
Workaround: Move dynamic elements like timestamps to the user message block rather than the system prompt, and ensure tool definitions are sorted or stored in a stable-order Map.
Source: dev.toblog_post · cited 2026-05-27
Google Vertex AI Regional Endpoint Surcharge
While base per-token pricing for Claude models is generally identical across the Anthropic API, AWS Bedrock, and Google Vertex AI, Vertex AI reportedly applies a 10% surcharge specifically for regional endpoints compared to its global endpoint pricing.
Workaround: Use global endpoints where data residency requirements allow to avoid the 10% regional premium.
Source: hikari-dev.comblog_post · cited 2026-05-27
Managed vs. Unmanaged Agent Billing Disparity
Anthropic's beta 'Managed Agents' (Tier 3) provides a hosted runtime with optimized prompt caching and pay-as-you-go billing. In contrast, users of the 'Agent SDK' (Tier 2) reportedly face a 'hard-capped' credit system and must manually implement caching logic, which can lead to significantly higher costs if third-party tools reprocess context inefficiently.
Workaround: Migrate agentic workloads to Managed Agents or Routines to leverage Anthropic's native runtime optimizations and avoid the 'unmanaged' credit squeeze.
Source: reddit.comreddit · cited 2026-05-27
Hidden costs (25-40% beyond per-token rates)
- Retry overhead from rate limits and network errors adds 5-15% to effective costs.
- Cache invalidation from inconsistent whitespace or dynamic timestamps in system prompts.
- Minimum token thresholds (up to 4,096 tokens) prevent caching on shorter prompts.
- Regional endpoint surcharges of 10% on specific cloud provider platforms.
- Silent cache TTL expiration if requests are not sent within the 3-5 minute window.
- Unmanaged agent billing disparities where manual caching logic is not optimized.
- Input padding required to trigger caching discounts for prompts just below the threshold.
Typical overhead: 25-40% beyond raw per-token rates.
What it costs to leave Anthropic
Leaving Anthropic requires rewriting prompts to remove XML-style tagging and adjusting tool-calling schemas. Lock-in is highest for users of Managed Agents or the Claude Partner Network due to proprietary runtime optimizations.
- small project (1-5 prompts): 1-2 engineer-days
- mid-size (10-50 prompts): 1-2 engineer-weeks
- large agentic system: 1-3 engineer-months
Who is this for?
For vibe coders & solo devs
Focus on maximizing the value of Claude Pro or the Claude for Open Source program if you maintain popular repositories. Use prompt caching for long coding sessions to keep costs low while maintaining context. Leverage the $25,000 to $100,000 in startup credits if you are building a new product through a VC partner.* Apply for the Claude for Open Source program for approximately $1,200 in value.
* Use the count_tokens API to ensure prompts meet the 1,024 token caching threshold.
* Move dynamic timestamps to user messages to avoid cache invalidation.
* Implement a heartbeat request to maintain the 5-minute cache TTL during long pauses.
For SMBs and growing teams
Consolidate your spend through the Claude Marketplace if you already have an enterprise commitment. Consider the Claude Partner Network to access training and $100 million in total program funding for market development. Use the Team Standard or Premium plans to manage user access without the complexity of direct API integration.* Apply for the Claude Partner Network to get dedicated Applied AI engineer support.
* Use the Claude Marketplace to consolidate third-party app spend into one invoice.
* Monitor the 10 percent regional surcharge on Google Vertex AI if using regional endpoints.
* Evaluate the $20,000 AI for Science program if your business conducts academic research.
For enterprise buyers
Prioritize AWS Bedrock or Google Vertex AI to use existing cloud credits and meet strict security requirements like VPC PrivateLink. Use Provisioned Throughput on AWS for guaranteed capacity at fixed hourly rates, which can range from $15,000 to $37,000 per Model Unit. Leverage Committed Use Discounts (CUDs) on Google Cloud for up to 50 percent savings on three-year terms.* Negotiate AWS Bedrock Provisioned Throughput for high-volume, consistent workloads.
* Use Google Vertex AI global endpoints to avoid the 10 percent regional premium.
* Apply AWS Activate or Google Cloud startup credits to offset initial Claude usage.
* Utilize the Claude Marketplace to reduce the risk of cost overruns on large commitments.
Sources verified for this page
Primary: Anthropic pricing page
View all 22 cited insider sources across 13 domains
- Claude Partner Network (vendor_official, verified 2026-03-12)
- Claude Marketplace (vendor_official, verified 2026-03-06)
- Claude for Startups (vendor_official, verified 2025-05-22)
- AWS Bedrock Provisioned Throughput (vendor_official, verified 2026-05-14)
- Google Vertex AI Committed Use Discounts (CUDs) (vendor_official, verified 2026-05-04)
- Microsoft Foundry (Azure) Integration (vendor_official, verified 2025-11-18)
- Silent Prompt Cache TTL Expiration and Traffic Gaps (github_issue, verified 2026-05-27)
- Minimum Token Thresholds Vary by Model Generation (vendor_docs, verified 2026-05-27)
- Dynamic Content and Whitespace Cache Invalidation (blog_post, verified 2026-05-27)
- Google Vertex AI Regional Endpoint Surcharge (blog_post, verified 2026-05-27)
- Managed vs. Unmanaged Agent Billing Disparity (reddit, verified 2026-05-27)
- AWS Bedrock (grounded_research, verified 2026-05-27)
- Google Vertex AI (grounded_research, verified 2026-05-27)
- Microsoft Foundry (Azure) (grounded_research, verified 2026-05-27)
- Claude Platform on AWS (Marketplace) (grounded_research, verified 2026-05-27)
- Anthropic Startup Program (grounded_research, verified 2026-05-27)
- Anthology Fund (Menlo Ventures partnership) (grounded_research, verified 2026-05-27)
- Claude for Open Source (grounded_research, verified 2026-05-27)
- Anthropic AI for Science Program (grounded_research, verified 2026-05-27)
- External Researcher Access Program (grounded_research, verified 2026-05-27)
- Google for Startups Cloud Program (AI Track) (grounded_research, verified 2026-05-27)
- AWS Activate (Portfolio Package) (grounded_research, verified 2026-05-27)
Generator: gen-v5.0.8-2026-05-25 · Last refreshed: Tue May 26 2026 22:46:16 GMT-0400 (Eastern Daylight Time) · Pricing snapshot: Tue May 26 2026 00:00:00 GMT-0400 (Eastern Daylight Time)