Cost & billing concept illustration

Cost & billing

Cached tokens

Input tokens the provider has already processed and can reuse at a steep discount, so repeating the same context costs much less the second time.

In practice

Reusing a long system prompt across calls bills the cached portion at a fraction of the normal input price.

Related terms

See what your tokens really cost

Track usage and spend across every model and platform, free.

Image: RDNE Stock project on Pexels. Definition free to reuse under CC BY 4.0.