Models & architecture
Attention
The mechanism that lets a model decide which other tokens to focus on when processing each token. It is why models can track context across a long passage.
In practice
When reading "the dog chased it", attention links "it" back to "the dog".
Related terms
See what your tokens really cost
Track usage and spend across every model and platform, free.
Image: Jakub Pabis on Pexels. Definition free to reuse under CC BY 4.0.