Caching & batch savings
Two of the biggest levers on an LLM bill: prompt caching (re-used context billed at ~10% of input) and the Batch API (50% off for async work). Enter your usage to see the difference.
Monthly cost
Standard—
With prompt caching—
With Batch API—
Caching + Batch—