Models & architecture concept illustration

Models & architecture

Multimodal

A model that handles more than one type of input or output, such as text plus images, audio, or video.

In practice

You upload a screenshot and ask a multimodal model to explain the error in it.

Related terms

See what your tokens really cost

Track usage and spend across every model and platform, free.

Image: Jakub Pabis on Pexels. Definition free to reuse under CC BY 4.0.