Models & architecture
Multimodal
A model that handles more than one type of input or output, such as text plus images, audio, or video.
In practice
You upload a screenshot and ask a multimodal model to explain the error in it.
Related terms
See what your tokens really cost
Track usage and spend across every model and platform, free.
Image: Jakub Pabis on Pexels. Definition free to reuse under CC BY 4.0.