Models & architecture
Transformer
The neural network design behind nearly every modern language model. Its attention mechanism lets the model weigh how much each token matters to every other token.
In practice
The "T" in GPT stands for Transformer.
Related terms
See what your tokens really cost
Track usage and spend across every model and platform, free.
Image: Jakub Pabis on Pexels. Definition free to reuse under CC BY 4.0.