Models & architecture concept illustration

Models & architecture

Transformer

The neural network design behind nearly every modern language model. Its attention mechanism lets the model weigh how much each token matters to every other token.

In practice

The "T" in GPT stands for Transformer.

Related terms

See what your tokens really cost

Track usage and spend across every model and platform, free.

Image: Jakub Pabis on Pexels. Definition free to reuse under CC BY 4.0.