OpenAIs flagship multimodal model. Fast, highly capable across text, vision, and audio. The most widely used frontier model.
Context Window
128K tokens
≈ 96K words
Input Price
$2.5
per 1M tokens
Output Price
$10
per 1M tokens
Released
May 2024
Fast multimodal tasks — text, vision, and audio
API identifier
gpt-4oCompare GPT-4o side-by-side with any other model
Compare models →