Embedding Cost · for RAG builders

What does your RAG setup cost to build + run?

Indexing, re-indexing, query-side embeddings, vector storage. Compare 9 embedding models side-by-side.

Pricing verified: 2026-06-05 9 embedding models
Embedding Cost full size
What this calculator does

Compare 9 embedding models on indexing + query-time cost for your RAG corpus.

Why use it
  • Embedding choice affects not just cost but retrieval quality — see both at once
  • Separate indexing cost (one-time) from query cost (recurring) — most people conflate them
  • Compare OpenAI, Voyage, Cohere, Gemini, BGE, Nomic side-by-side
📊 Calculator at a glance
🎛 CALCULATOR
📚 Your document corpus

What goes into the vector database.

Doc gets chunked into smaller pieces for embedding.
How often content changes + needs re-embedding.
🔍 Query patterns
🧬 Your pick
📈 RESULTS
📋 Example Workload - change any field to see your actual cost
Total monthly embedding cost
-
-
🏗 One-time indexing
-
-
🔄 Monthly re-indexing
-
-
🔍 Monthly query cost
-
-
💾 Vector storage estimate
Total vectors (chunks)-
Dimensions-
Raw vector size-
With index overhead (~1.3x)-

Storage note: Pinecone, Weaviate, Qdrant, pgvector - pricing varies widely. Typical: $0.025-$0.30 per GB/mo. Our vector DB cost guide breaks down each option.

💡 Recommendations
    📊 Cost across every embedding model

    Same corpus + queries, different embedding provider. Current selection highlighted.

    Model Dimensions Max input Indexing cost Monthly cost Annual cost
    Query-side LLM cost → Margin calculator → Get a RAG architecture review →
    🎯 Use this result to
    📅 Schedule a call to apply this to your workload
    📋 What now?
    📅 Book a working session to apply this to your workload →

    Go deeper

    Our playbooks on cutting this number.

    📚
    RAG Cost Optimization
    Cut RAG bills 60%+ with the right patterns
    🧮
    Full Cost Calculator
    Model the query-side LLM costs
    🗄
    Vector DB Costs
    Pinecone vs pgvector vs Qdrant
    🔍
    AI Model Finder
    Pick the RAG-side LLM

    The calculator's an estimate. Want the real number?

    A 5-day Quickscan ($1,500) reviews your actual usage across every pillar — financial, reliability, governance, privacy, MLOps, observability — and returns a concrete savings plan.

    Book a Quickscan →