RAG & embeddings cost

A retrieval pipeline has two costs: a one-time charge to embed your documents, then a per-query charge to feed retrieved context through an LLM. Embeddings are cheap; the generation step usually dominates. Here's the split.

Your pipeline

Embedding price examples: OpenAI small ~$0.02, large ~$0.13; Voyage/Cohere ~$0.10–0.12 per 1M. Set yours above.

Estimated cost

One-time indexing
Per query
Monthly (queries)
Monthly with caching