Cost & billing
Cached tokens
Input tokens the provider has already processed and can reuse at a steep discount, so repeating the same context costs much less the second time.
In practice
Reusing a long system prompt across calls bills the cached portion at a fraction of the normal input price.
Related terms
See what your tokens really cost
Track usage and spend across every model and platform, free.
Image: RDNE Stock project on Pexels. Definition free to reuse under CC BY 4.0.