← All models
OpenAICurrent

GPT-4o

OpenAIs flagship multimodal model. Fast, highly capable across text, vision, and audio. The most widely used frontier model.

Context Window

128K tokens

≈ 96K words

Input Price

$2.5

per 1M tokens

Output Price

$10

per 1M tokens

Released

May 2024

Capabilities

Vision / Image inputTool / Function callingVoice / Audio

Best for

Fast multimodal tasks — text, vision, and audio

Strengths

  • Multimodal
  • Speed
  • Vision
  • Voice
  • Broad capability

API identifier

gpt-4o

Compare GPT-4o side-by-side with any other model

Compare models →