Cost & billing
Rate limit
A cap a provider sets on how much you can use in a window of time, measured in requests or tokens per minute. Hit it and calls get rejected until the window resets.
In practice
A burst of traffic trips your rate limit and the API returns 429 errors.
Related terms
See what your tokens really cost
Track usage and spend across every model and platform, free.
Image: RDNE Stock project on Pexels. Definition free to reuse under CC BY 4.0.