DeepSeeks frontier open-source model. 1.6T total / 49B active params with 1M context. Rivals top closed-source models in coding, math, and STEM reasoning.

1049K ctx$0.435/M in$0.87/M out

Google

20 models

Gemini 1.5 Flash

👁🔧

Googles fast, efficient multimodal model with a massive 1M token context window. Excellent price-to-performance for most tasks.

1000K ctx$0.075/M in$0.3/M out

Gemini 1.5 Pro

👁🔧

Googles most capable Gemini 1.5 model with a 2M token context window — the largest of any commercially available model. Excellent for long documents and video.

2000K ctx$1.25/M in$5/M out

Gemini 2.0 Flash

👁🔧

Next-generation fast model from Google. Improved performance over 1.5 Flash with added agentic capabilities and real-time information access.

1000K ctx$0.1/M in$0.4/M out

Gemini 2.0 Flash-Lite

👁🔧

Gemini 2.0 Flash-Lite

1049K ctx$0.075/M in$0.3/M out

Gemini 2.0 Pro

👁🔧

Googles most capable Gemini 2.0 model. Strongest coding and world knowledge of the Gemini family. Designed for complex agentic tasks.

1000K ctx$0/M in$0/M out

Gemini 2.5 Flash

👁🔧

Low-latency, high-volume tasks that require reasoning with best price-performance.

1049K ctx$0.3/M in$2.5/M out

Gemini 2.5 Flash-Lite

👁🔧

The fastest and most budget-friendly multimodal model in the 2.5 family.

1049K ctx$0.1/M in$0.4/M out

Gemini 2.5 Pro

👁🔧

Complex tasks featuring deep reasoning and coding capabilities.

1049K ctx$1.25/M in$10/M out

Gemini 2.5 Pro Flash

👁🔧

Efficient processing of multimodal content with fast inference.

1049K ctx$1.25/M in$10/M out

Gemini 3 Flash

👁🔧

Frontier-class performance rivaling larger models at a fraction of the cost.

1049K ctx$0.3/M in$2.5/M out

Gemini 3 Pro

👁🔧

Complex multimodal tasks and advanced reasoning

1049K ctx

Gemini 3.1 Flash

👁🔧

Fast multimodal processing with image generation capabilities

1049K ctx$0.3/M in$2.5/M out

Gemini 3.1 Pro

👁🔧

Advanced intelligence, complex problem-solving, and powerful agentic and vibe coding capabilities.

1049K ctx$2/M in$12/M out

Gemini 3.5 Flash

👁🔧

Most intelligent model for sustained frontier performance on agentic and coding tasks.

1049K ctx$1.5/M in$9/M out

Gemini Deep Research

🔧

Autonomous multi-step research across hundreds of sources to produce cited, interactive reports.

1000K ctx$3.5/M in$10.5/M out

Gemini Deep Research Max

🔧

Maximum comprehensiveness for automated context gathering and synthesis across hundreds of sources.

1000K ctx$5/M in$15/M out

Gemini Robotics-ER 1.6

👁🔧

Advanced embodied reasoning model for robotic agents with spatial and physical reasoning.

Imagen 4

High-quality image generation and editing

Lyria 3

High-quality music generation from text prompts

Veo 3.1

Advanced video generation from text and image prompts

Microsoft

8 models

GPT-5.1 Chat

🔧

Preview reasoning-enabled chat model with built-in problem-solving for conversational interfaces.

128K ctx$1.25/M in$10/M out

GPT-5.1 Codex Max

👁🔧

Maximum reasoning effort code model with extended capabilities.

400K ctx$1.25/M in$10/M out

GPT-5.1 Codex Mini

👁🔧

Lightweight code-focused model for developer tooling.

400K ctx$0.25/M in$2/M out

GPT-5.3 Chat

🔧

Conversational AI with structured outputs and tool calling support.

128K ctx$1.75/M in$14/M out

GPT-chat-latest

👁🔧

Latest preview chat model with reasoning capabilities and structured outputs for general conversational AI tasks.

128K ctx$5/M in$30/M out

Microsoft Copilot

👁🔧

Microsofts consumer and enterprise AI assistant powered by GPT-4o. Deeply integrated into Windows, Microsoft 365, Edge, and Bing.

128K ctx$0/M in$0/M out

Phi-4

🔧

Microsofts small but mighty language model. Designed for on-device and edge deployment. Punches well above its weight class on reasoning tasks.

16K ctx$0.07/M in$0.14/M out

Sora 2

Video generation from text instructions.

OpenAI

27 models

GPT-3.5-TURBO

🔧

General-purpose language tasks

16K ctx$0.5/M in$1.5/M out

GPT-4

🔧

General-purpose language tasks

8K ctx$30/M in$60/M out

GPT-4 Turbo

👁🔧

Previous generation flagship from OpenAI. Largely superseded by GPT-4o but still available. Solid performance on complex reasoning tasks.

128K ctx$10/M in$30/M out

GPT-4.1

👁🔧

Instruction-following and agentic coding workflows

1048K ctx$2/M in$8/M out

GPT-4.1 mini

👁🔧

Instruction-following and agentic coding workflows

1048K ctx$0.4/M in$1.6/M out

GPT-4.1 nano

👁🔧

Instruction-following and agentic coding workflows

1048K ctx$0.1/M in$0.4/M out

GPT-4o

👁🔧

OpenAIs flagship multimodal model. Fast, highly capable across text, vision, and audio. The most widely used frontier model.

128K ctx$2.5/M in$10/M out

GPT-4o mini

👁🔧

A smaller, cheaper version of GPT-4o. Surprisingly capable for its price point. Ideal for high-volume, cost-sensitive applications.

128K ctx$0.15/M in$0.6/M out

GPT-5

🔧

General-purpose language tasks

400K ctx$1.25/M in$10/M out

GPT-5-MINI

🔧

General-purpose language tasks

400K ctx$0.25/M in$2/M out

GPT-5-NANO

🔧

General-purpose language tasks

400K ctx$0.05/M in$0.4/M out

GPT-5-PRO

🔧

General-purpose language tasks

400K ctx$15/M in$120/M out

GPT-5.1

🔧

General-purpose language tasks

400K ctx$1.25/M in$10/M out

GPT-5.2

🔧

General-purpose language tasks

400K ctx$1.75/M in$14/M out

GPT-5.2-PRO

🔧

General-purpose language tasks

400K ctx$21/M in$168/M out

GPT-5.4

🔧

General-purpose language tasks

1050K ctx$2.5/M in$15/M out

GPT-5.4-MINI

🔧

General-purpose language tasks

400K ctx$0.75/M in$4.5/M out

GPT-5.4-NANO

🔧

General-purpose language tasks

400K ctx$0.2/M in$1.25/M out

GPT-5.4-PRO

🔧

General-purpose language tasks

1050K ctx$30/M in$180/M out

GPT-5.5

🔧

General-purpose language tasks

1050K ctx$5/M in$30/M out

GPT-5.5-PRO

🔧

General-purpose language tasks

1050K ctx$30/M in$180/M out

GPT-IMAGE-1

🔧

General-purpose language tasks

$5/M in$15/M out

o1

🔧

OpenAIs first reasoning model. Thinks before answering using chain-of-thought internally. Excels at math, science, and complex coding problems.

200K ctx$15/M in$60/M out

o1 Pro

🔧

Deep reasoning tasks requiring careful step-by-step thinking

200K ctx$150/M in$600/M out

o3

👁🔧

Advanced reasoning and complex problem-solving

200K ctx$2/M in$8/M out

o3 mini

👁🔧

Advanced reasoning and complex problem-solving

200K ctx$1.1/M in$4.4/M out

o4 mini

👁🔧

Advanced reasoning and complex problem-solving

200K ctx$1.1/M in$4.4/M out

Perplexity

2 models

Sonar

Perplexitys lightweight search-augmented model. Every response is grounded in live web search results with citations. Fast and cost-effective.

127K ctx$1/M in$1/M out

Sonar Pro

Perplexitys most powerful search model. Deeper research capability with more search queries per response and stronger reasoning over retrieved information.

127K ctx$3/M in$15/M out

xAI

6 models

Grok 2

👁🔧

xAIs capable conversational model with real-time internet access baked in by default. Deep integration with X (Twitter) data and current events.

131K ctx$2/M in$10/M out

Grok 3

👁🔧

xAIs most advanced model. Significant leap in reasoning and intelligence. Trained on a massive compute cluster and competitive with frontier models.

131K ctx$3/M in$15/M out

Grok 4.20

🔧

Agentic tool calling tasks requiring low hallucination rates, strict prompt adherence, and fast response times.

2000K ctx$2/M in$6/M out

Grok 4.20 Non-Reasoning

Recommended for non-reasoning workloads as a replacement for deprecated models.

Grok 4.3

🔧

Delivering truthful and insightful answers with a focus on truth-seeking capabilities.

1000K ctx$1.25/M in$2.5/M out

Grok Build 0.1

🔧

Fast coding model trained specifically for agentic coding workflows.

256K ctx$1/M in$2/M out

AI Model Encyclopedia

Anthropic

Claude 3 Haiku

Claude 3 Opus

Claude 3 Sonnet

Claude 3.5 Haiku

Claude 3.5 Sonnet

Claude 3.7 Sonnet

Claude Fable 5

Claude Haiku 4

Claude Haiku 4.5

Claude Opus 4

Claude Opus 4.1

Claude Opus 4.5

Claude Opus 4.6

Claude Opus 4.7

Claude Opus 4.8

Claude Sonnet 4

Claude Sonnet 4.5

Claude Sonnet 4.6

Claude Sonnet 5

DeepSeek

DeepSeek V4 Flash

DeepSeek V4 Pro

Google

Gemini 1.5 Flash

Gemini 1.5 Pro

Gemini 2.0 Flash

Gemini 2.0 Flash-Lite

Gemini 2.0 Pro

Gemini 2.5 Flash

Gemini 2.5 Flash-Lite

Gemini 2.5 Pro

Gemini 2.5 Pro Flash

Gemini 3 Flash

Gemini 3 Pro

Gemini 3.1 Flash

Gemini 3.1 Pro

Gemini 3.5 Flash

Gemini Deep Research

Gemini Deep Research Max

Gemini Robotics-ER 1.6

Imagen 4

Lyria 3

Veo 3.1

Meta

Llama 3 70B

Llama 3.1

Llama 3.1 405B

Llama 3.1 405B

Llama 3.1 70B

Llama 3.1 70B

Llama 3.1 8B

Llama 3.1 8B

Llama 3.2

Llama 3.2 11B

Llama 3.2 11B Vision

Llama 3.2 1B

Llama 3.2 3B

Llama 3.2 90B

Llama 3.3

Llama 3.3 70B

Llama 3.3 70B

Llama 4 Maverick

Llama 4 Scout

Microsoft

GPT-5.1 Chat

GPT-5.1 Codex Max

GPT-5.1 Codex Mini

GPT-5.3 Chat

GPT-chat-latest

Microsoft Copilot

Phi-4

Sora 2

OpenAI

GPT-3.5-TURBO

GPT-4

GPT-4 Turbo

GPT-4.1

GPT-4.1 mini