xAI Models — Pricing, Benchmarks & Capabilities

Metric	Input	Output
Cheapest	$0.20	$0.50
Average	$1.21	$5.82
Most Expensive	$5.00	$25.00

xAIGrok 4

#30

Grok 4 1 Fast Reasoning

A frontier multimodal model optimized specifically for high-performance agentic tool calling.

Context

2.0M

Max Output

—

Input/1M

$0.20

👁 Vision🔧 Tools⚡ Cache🖥 Computer

Pricing (per 1M tokens)

Requesty★$0.20 / $0.50

Vercel AI$0.20 / $0.50

Martian$0.20 / $0.50

2025-07-09View details →

xAIGrok 4

#39

Grok 4 Fast

Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window. It comes in two flavors: non-reasoning and reasoning. Read more about the model on xAI's [news post](http://x.ai/news/grok-4-fast). Reasoning can be enabled/disabled using the `reasoning` `enabled` parameter in the API. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#controlling-reasoning-tokens)

Context

2.0M

Max Output

30K

Input/1M

$0.20

🔧 Tools⚡ Cache

Pricing (per 1M tokens)

Requesty★$0.20 / $0.50

OpenRouter$0.20 / $0.50

2025-09-19View details →

xAIGrok 3

#54

Grok 3

Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in finance, healthcare, law, and science.

Context

131K

Max Output

—

Input/1M

$3.00

🔧 Tools

Pricing (per 1M tokens)

Requesty★$5.00 / $25.00

OpenRouter$3.00 / $15.00

Vercel AI$3.00 / $15.00

Martian$3.00 / $15.00

2025-06-10View details →

xAIGrok 4

#58

Grok 4

Grok 4 is xAI's latest reasoning model with a 256k context window. It supports parallel tool calling, structured outputs, and both image and text inputs. Note that reasoning is not exposed, reasoning cannot be disabled, and the reasoning effort cannot be specified. Pricing increases once the total tokens in a given request is greater than 128k tokens. See more details on the [xAI docs](https://docs.x.ai/docs/models/grok-4-0709)

Context

256K

Max Output

—

Input/1M

$3.00

👁 Vision🔧 Tools⚡ Cache🖥 Computer

Pricing (per 1M tokens)

Requesty★$3.00 / $15.00

OpenRouter$3.00 / $15.00

Vercel AI$3.00 / $15.00

Martian$3.00 / $15.00

2025-07-09View details →

xAIGrok 4

#61

Grok 4 Fast Reasoning

Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window. It comes in two flavors: non-reasoning and reasoning.

Context

2.0M

Max Output

256K

Input/1M

$0.20

🧠 Reasoning🔧 Tools

Pricing (per 1M tokens)

Vercel AI$0.20 / $0.50

Martian$0.20 / $0.50

2025-07-09View details →

xAIGrok 3

#105

Grok 3 Mini

A lightweight model that thinks before responding. Fast, smart, and great for logic-based tasks that do not require deep domain knowledge. The raw thinking traces are accessible.

Context

131K

Max Output

—

Input/1M

$0.30

🧠 Reasoning🔧 Tools

Pricing (per 1M tokens)

Requesty★$0.30 / $0.50

OpenRouter$0.30 / $0.50

Vercel AI$0.30 / $0.50

Martian$0.30 / $0.50

2025-06-10View details →

xAI

Grok Imagine Image

Generate high-quality images from text prompts with xAI's imagine API.

Context

0

Max Output

—

Input/1M

Free

Pricing (per 1M tokens)

Vercel AIFree / Free

2026-01-28View details →

xAI

Grok Imagine Image Pro

Generate high-quality images from text prompts with xAI's imagine API.

Context

0

Max Output

—

Input/1M

Free

Pricing (per 1M tokens)

Vercel AIFree / Free

2026-01-28View details →

xAIGrok 4.1

xAI: Grok 4.1 Fast

Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases like customer support and deep research. 2M context window. Reasoning can be enabled/disabled using the `reasoning` `enabled` parameter in the API. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#controlling-reasoning-tokens)

Context

2.0M

Max Output

30K

Input/1M

$0.20

👁 Vision🧠 Reasoning🔧 Tools⚡ Cache

Pricing (per 1M tokens)

OpenRouter$0.20 / $0.50

2025-11-19View details →

xAIGrok 4

Grok 4 Fast Non Reasoning

xAI's latest advancement in cost-efficient reasoning models

Context

2.0M

Max Output

—

Input/1M

$0.20

🔧 Tools⚡ Cache🖥 Computer

Pricing (per 1M tokens)

Requesty★$0.20 / $0.50

Vercel AI$0.20 / $0.50

Martian$0.20 / $0.50

2025-09-19View details →

xAI

Grok Code Fast 1

Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding. With reasoning traces visible in the response, developers can steer Grok Code for high-quality work flows.

Context

256K

Max Output

10K

Input/1M

$0.20

👁 Vision🔧 Tools⚡ Cache

Pricing (per 1M tokens)

Requesty★$0.20 / $1.50

OpenRouter$0.20 / $1.50

Vercel AI$0.20 / $1.50

Martian$0.20 / $1.50

2025-08-26View details →

xAIGrok 4

Grok 4 1 Fast Non Reasoning

A frontier multimodal model optimized specifically for high-performance agentic tool calling.

Context

2.0M

Max Output

—

Input/1M

$0.20

👁 Vision🔧 Tools⚡ Cache🖥 Computer

Pricing (per 1M tokens)

Requesty★$0.20 / $0.50

Vercel AI$0.20 / $0.50

Martian$0.20 / $0.50

2025-07-09View details →

xAIGrok 3

xAI: Grok 3 Mini Beta

Grok 3 Mini is a lightweight, smaller thinking model. Unlike traditional models that generate answers immediately, Grok 3 Mini thinks before responding. It’s ideal for reasoning-heavy tasks that don’t demand extensive domain knowledge, and shines in math-specific and quantitative use cases, such as solving challenging puzzles or math problems. Transparent "thinking" traces accessible. Defaults to low reasoning, can boost with setting `reasoning: { effort: "high" }` Note: That there are two xAI endpoints for this model. By default when using this model we will always route you to the base endpoint. If you want the fast endpoint you can add `provider: { sort: throughput}`, to sort by throughput instead.

Context

131K

Max Output

—

Input/1M

$0.30

🧠 Reasoning🔧 Tools⚡ Cache

Pricing (per 1M tokens)

OpenRouter$0.30 / $0.50

2025-04-09View details →

xAIGrok 3

xAI: Grok 3 Beta

Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in finance, healthcare, law, and science. Excels in structured tasks and benchmarks like GPQA, LCB, and MMLU-Pro where it outperforms Grok 3 Mini even on high thinking. Note: That there are two xAI endpoints for this model. By default when using this model we will always route you to the base endpoint. If you want the fast endpoint you can add `provider: { sort: throughput}`, to sort by throughput instead.

Context

131K

Max Output

—

Input/1M

$3.00

🔧 Tools⚡ Cache

Pricing (per 1M tokens)

OpenRouter$3.00 / $15.00

2025-04-09View details →

xAIGrok 3

Grok 3 Fast Beta

xAI's flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in finance, healthcare, law, and science. The fast model variant is served on faster infrastructure, offering response times that are significantly faster than the standard. The increased speed comes at a higher cost per output token.

Context

131K

Max Output

131K

Input/1M

$5.00

🔧 Tools

Pricing (per 1M tokens)

Vercel AI$5.00 / $25.00

2025-02-17View details →

xAIGrok 3

Grok 3 Mini Fast Beta

xAI's lightweight model that thinks before responding. Great for simple or logic-based tasks that do not require deep domain knowledge. The raw thinking traces are accessible. The fast model variant is served on faster infrastructure, offering response times that are significantly faster than the standard. The increased speed comes at a higher cost per output token.

Context

131K

Max Output

131K

Input/1M

$0.60

🔧 Tools

Pricing (per 1M tokens)

Vercel AI$0.60 / $4.00

2025-02-17View details →

xAIGrok 2

Grok 2 Vision

Grok 2 vision model excels in vision-based tasks, delivering state-of-the-art performance in visual math reasoning (MathVista) and document-based question answering (DocVQA). It can process a wide variety of visual information including documents, diagrams, charts, screenshots, and photographs.

Context

33K

Max Output

33K

Input/1M

$2.00

🔧 Tools

Pricing (per 1M tokens)

Vercel AI$2.00 / $10.00

2024-08-20View details →

xAIGrok 2

Grok 2 1212

x AI's Our previous generation chat model.

Context

131K

Max Output

—

Input/1M

$2.00

🔧 Tools

Pricing (per 1M tokens)

Requesty★$2.00 / $10.00

View details →

xAI

$ Pricing Summary(per 1M tokens)

⚙ Capabilities

🤖 All xAI Models(18)

Grok 4 1 Fast Reasoning

Grok 4 Fast

Grok 3

Grok 4

Grok 4 Fast Reasoning

Grok 3 Mini

Grok Imagine Image

Grok Imagine Image Pro

xAI: Grok 4.1 Fast

Grok 4 Fast Non Reasoning

Grok Code Fast 1

Grok 4 1 Fast Non Reasoning

xAI: Grok 3 Mini Beta

xAI: Grok 3 Beta

Grok 3 Fast Beta

Grok 3 Mini Fast Beta

Grok 2 Vision

Grok 2 1212

xAI

$ Pricing Summary(per 1M tokens)

⚙ Capabilities

🤖 All xAI Models(18)

Grok 4 1 Fast Reasoning

Grok 4 Fast

Grok 3

Grok 4

Grok 4 Fast Reasoning

Grok 3 Mini

Grok Imagine Image

Grok Imagine Image Pro

xAI: Grok 4.1 Fast

Grok 4 Fast Non Reasoning

Grok Code Fast 1

Grok 4 1 Fast Non Reasoning

xAI: Grok 3 Mini Beta

xAI: Grok 3 Beta

Grok 3 Fast Beta

Grok 3 Mini Fast Beta

Grok 2 Vision

Grok 2 1212