Caching & batch savings

Two of the biggest levers on an LLM bill: prompt caching (re-used context billed at ~10% of input) and the Batch API (50% off for async work). Enter your usage to see the difference.

Your usage

Caching helps when a big chunk of the prompt (system instructions, context, examples) repeats across requests. Batch suits work that can wait minutes-to-hours.

Monthly cost

Standard
With prompt caching
With Batch API
Caching + Batch