Contact Us

Last updated: April 18, 2026

Get in Touch

Have questions about AI cost, vendor selection, compliance, or your specific architecture? We'd love to help. Whether you want to chat with our AI Cost Genie, book a consulting call, or ask a general question — we typically respond within 1-2 business days.

How Can We Help?

Choose the best way to reach us based on your needs:

Free AI Cost Therapy

Chat live with the AI Cost Genie on our homepage. Instant answers from 115K+ tools. No email required.

Ask the Genie →
⏱️ Response: Instant
Book a 30-Min Consultation

Talk to a human about your specific AI cost situation. Free initial call. No obligation. Three concrete actions by Monday morning.

Book a call →
⏱️ Response: Next available slot
General Inquiries

Questions about the platform, partnerships, media, or anything else

⏱️ Response: 1-2 business days
Privacy & Data Requests

Data deletion, access requests, or privacy concerns

⏱️ Response: Within 45 days per CCPA/GDPR
Report an Error

Found incorrect pricing, outdated info, or wrong guidance? We'll fix it

⏱️ Response: 3-5 business days

Important: We Are Not Your Advisor

aicost.ai provides general cost intelligence, frameworks, and tool recommendations based on publicly available information and AI-assisted research. We are NOT your financial, legal, compliance, or security advisor. All advice is provided as-is. You are responsible for validating any recommendation against your own organization's constraints — including but not limited to budget, infrastructure, security posture, privacy requirements, compliance obligations (HIPAA, SOC2, EU AI Act, CCPA, etc.), vendor contracts, and architectural realities. Always cross-check pricing, features, and terms directly with vendors before making decisions.

Company Information

CloudIntelligence.ai LLC
Founders: Subu Vdaygiri & Hanvish Vdaygiri
Irvine, California 92618
United States

Connect With Us

Follow our founder on LinkedIn: https://www.linkedin.com/in/subu-vdaygiri/

Before You Reach Out

You might find answers to common questions here:

Is aicost.ai free to use?

Yes. The AI Cost Genie chat, the 4-min cost assessment, the pricing watch, and all the guides are free. We offer paid consulting for complex situations that need a human — but the self-serve tools are always free.

Do you sell my data?

No. We never sell personal information. When you chat with the Genie or use our tools, we log anonymized usage for product improvement. When you provide an email for a download, we use it only to deliver what you requested and to send relevant follow-ups (which you can unsubscribe from anytime).

How accurate is the pricing you show?

Pricing is sourced from vendor websites and our tools database of 115K+ AI tools. We make reasonable efforts to keep it current, but pricing changes frequently in AI/cloud. Always verify pricing directly with the vendor before making a purchase decision. We disclaim liability for outdated pricing — see our terms of service.

Can the AI Cost Genie give me bad advice?

The Genie is an AI system. It can be wrong, miss context about your specific organization, or be based on information that has become outdated. Treat its guidance as a starting point for your own research — not as final advice. Cross-check with your team, your architecture, your compliance officer, and the actual vendors before implementing.

Do you have affiliate relationships with vendors?

The ToolsInfo network (our parent data source) may have affiliate links in certain categories. Affiliate relationships never influence our tool rankings or recommendations. All advice is based on features, value, and fit — not commissions.

How do I delete my data?

Email [email protected] with 'Data Deletion Request' in the subject line. We'll process within 45 days per CCPA/GDPR requirements.

NVIDIA Inception Program Member

CloudIntelligence.ai LLC, the parent company of aicost.ai, has been accepted into the NVIDIA Inception Program — a free global accelerator for AI startups building on NVIDIA’s platform.

NVIDIA Inception Program Member Badge

Proud member of the NVIDIA Inception Program

CloudIntelligence.ai LLC — the company behind aicost.ai, ToolsInfo.com, AIPapers.ai, AINewsCycle.com, and AvatarVA — is an NVIDIA Inception startup accelerating innovation on NVIDIA’s platform.

📖 Data sources & methodology 161 text models · 9 embeddings · 24 vision · 41 audio · 8 vector DBs across 10 vendor pages · last verified 2026-06-05

Methodology

  • All prices are USD per 1 million tokens, current as of 2026-06-05.
  • Vendor-published values have no mark. Inferred/extrapolated values are marked with * and listed below.
  • Batch API discounts are 50% off standard rates across providers that offer Batch mode.
  • Prompt caching discounts vary by provider (typically 80-90% off cached input tokens).
  • Regional data-residency surcharges (Anthropic 1.1x, OpenAI 1.1x, Google regional tiers) are NOT included in base rates.
  • Long-context pricing tiers apply when input exceeds model threshold.
  • Embedding prices are input-only (no output tokens generated).

Primary sources

Last-verified date is the most recent successful daily snapshot (aicost_pricing_snapshots) or, when no snapshot exists yet, the latest successful crawler run (aicost_crawler_runs). 10 of 10 vendors are currently verified. Aggregator services (TokenCost, AI Pricing Guru, etc.) are not listed.

Anthropic
2026-06-05
https://www.anthropic.com/pricing
Daily snapshot since Sep 2023 · 578 days captured
Anthropic Docs
2026-06-05
https://platform.claude.com/docs/en/about-claude/pricing
Daily snapshot since Sep 2023 · 578 days captured
OpenAI
2026-06-05
https://openai.com/api/pricing/
Daily snapshot since Sep 2023 · 579 days captured
Google AI
2026-06-05
https://ai.google.dev/gemini-api/docs/pricing
Daily snapshot since Dec 2023 · 554 days captured
Google Vertex
2026-06-05
https://cloud.google.com/vertex-ai/generative-ai/pricing
Daily snapshot since Dec 2023 · 554 days captured
DeepSeek
2026-06-05
https://api-docs.deepseek.com/quick_start/pricing
Daily snapshot since May 2024 · 493 days captured
xAI
2026-06-05
https://x.ai/api
Daily snapshot since Nov 2024 · 411 days captured
Mistral
2026-06-05
https://mistral.ai/pricing
Daily snapshot since Dec 2023 · 552 days captured
Cohere
2026-06-05
https://cohere.com/pricing
Daily snapshot since Sep 2023 · 578 days captured

Inferred values (marked with * in calculator tables)

Derived from industry conventions, not directly published by the vendor. Typical conventions: cached input = 10% of base (90% off), Batch API = 50% of base (50% off).

Vendor / Model Field Why it’s inferred
Anthropic — Claude Sonnet 4.6 cachedInput Derived at 10% of input rate — Anthropic publishes 90% cache-hit discount on this tier.
Anthropic — Claude Sonnet 4.5 cachedInput Derived at 10% of input rate; same 90% cache-hit convention as Sonnet 4.6.
Anthropic — Claude Sonnet 4.5 batchInput Derived at 50% of standard input — Anthropic documents uniform 50% Batch discount.
Anthropic — Claude Sonnet 4.5 batchOutput Derived at 50% of standard output — Anthropic documents uniform 50% Batch discount.
Anthropic — Claude Haiku 4.5 cachedInput Derived at 10% of input rate — Anthropic 90% cache-hit discount convention.
OpenAI — GPT-5.4 Mini cachedInput Derived at 10% of input — OpenAI documents automatic 90% discount on cache hits across GPT-5.x tier.
OpenAI — GPT-5.4 Nano cachedInput Derived at 10% of input — OpenAI 90% cache-hit convention.
OpenAI — GPT-5.4 Nano batchInput Derived at 50% of input — OpenAI Batch API uniform 50% discount.
OpenAI — GPT-5.4 Nano batchOutput Derived at 50% of output — OpenAI Batch API uniform 50% discount.
OpenAI — GPT-5.4 Pro cachedInput Derived at 10% of input — OpenAI 90% cache-hit convention.
OpenAI — GPT-5.4 Pro batchInput Derived at 50% of input — OpenAI Batch API uniform 50% discount.
OpenAI — GPT-5.4 Pro batchOutput Derived at 50% of output — OpenAI Batch API uniform 50% discount.
OpenAI — GPT-5.2 cachedInput Derived at 10% of input; no residency uplift.
OpenAI — GPT-5.2 batchInput Derived at 50% of input.
OpenAI — GPT-5.2 batchOutput Derived at 50% of output.
OpenAI — GPT-5 cachedInput Derived at 10% of input.
OpenAI — GPT-5 batchInput Derived at 50% of input.
OpenAI — GPT-5 batchOutput Derived at 50% of output.
OpenAI — GPT-5.5 Pro cachedInput Derived at 10% of input — OpenAI does not publish a cached rate for *-pro models; using the family convention.
OpenAI — GPT-5.5 Pro batchInput Derived at 50% of input.
OpenAI — GPT-5.5 Pro batchOutput Derived at 50% of output.
OpenAI — GPT-5.2 Pro cachedInput Derived at 10% of input — pro-tier convention.
OpenAI — GPT-5.2 Pro batchInput Derived at 50% of input.
OpenAI — GPT-5.2 Pro batchOutput Derived at 50% of output.
OpenAI — GPT-5.1 batchInput Derived at 50% of input.
OpenAI — GPT-5.1 batchOutput Derived at 50% of output.
OpenAI — GPT-5 Pro batchInput Derived at 50% of input.
OpenAI — GPT-5 Pro batchOutput Derived at 50% of output.
OpenAI — GPT-5 Nano cachedInput Derived at 10% of input.
OpenAI — GPT-5 Nano batchInput Derived at 50% of input.
OpenAI — GPT-5 Nano batchOutput Derived at 50% of output.
Google — Gemini 3 Flash cachedInput Derived at 10% of input — Google caching discount convention ~90%.
Google — Gemini 3.1 Flash-Lite cachedInput Derived at 10% of input — Google caching convention.
Google — Gemini 3.1 Flash-Lite batchInput Derived at 50% of input — Google Batch API uniform 50% discount.
Google — Gemini 3.1 Flash-Lite batchOutput Derived at 50% of output — Google Batch API uniform 50% discount.
Google — Gemini 2.5 Pro cachedInput Derived at 10% of input.
Google — Gemini 2.5 Flash cachedInput Derived at 10% of input.
Google — Gemini 2.5 Flash-Lite cachedInput Derived at 10% of input — Google caching convention.
Google — Gemini 2.5 Flash-Lite batchInput Derived at 50% of input — Google Batch API uniform 50% discount.
Google — Gemini 2.5 Flash-Lite batchOutput Derived at 50% of output — Google Batch API uniform 50% discount.
Google — Gemini 2.0 Flash cachedInput Derived at 25% of input per Google 2.0 family caching rates.
Google — Gemini 2.0 Flash batchInput Derived at 50% of input — Google Batch API uniform 50% discount.
Google — Gemini 2.0 Flash batchOutput Derived at 50% of output — Google Batch API uniform 50% discount.
Google — Gemini 2.0 Flash-Lite cachedInput Derived at 10% of input — Google caching convention.
Google — Gemini 2.0 Flash-Lite batchInput Derived at 50% of input — Google Batch API uniform 50% discount.
Google — Gemini 2.0 Flash-Lite batchOutput Derived at 50% of output — Google Batch API uniform 50% discount.
xAI — Grok 4 (legacy) cachedInput Extrapolated at 25% of base.

Pricing is cross-verified against the LiteLLM community registry when available. Daily snapshots are kept in aicost_pricing_snapshots; every change is logged to aicost_price_changelog with old & new values for full audit trail. Read the full methodology →