Cost & billing concept illustration

Cost & billing

Rate limit

A cap a provider sets on how much you can use in a window of time, measured in requests or tokens per minute. Hit it and calls get rejected until the window resets.

In practice

A burst of traffic trips your rate limit and the API returns 429 errors.

Related terms

See what your tokens really cost

Track usage and spend across every model and platform, free.

Image: RDNE Stock project on Pexels. Definition free to reuse under CC BY 4.0.