OpenAIo3Arena #29Apr 16, 2025
o3
o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following. Use it to think through multi-step problems that involve analysis across text, code, and images.
Context Window
200K
tokens
Max Output
100K
tokens
Released
Apr 16, 2025
Arena Rank
#29
of 305 models
Output Speed
118
tokens/sec
Time to First Token
18.2s
TTFT
Capabilities
👁Vision
🧠Reasoning
🔧Tool Calling
⚡Prompt Caching
🖥Computer Use
🎨Image Generation
Supported Parameters
Include Reasoning
Show reasoning tokens
Max Tokens
Output length limit
Reasoning
Extended thinking
Response Format
JSON mode / structured output
Seed
Deterministic outputs
structured_outputs
Tool Choice
Control tool usage
Tool Calling
Function calling support
Pricing Comparison
| Router | Input / 1M | Output / 1M | Cached Input / 1M |
|---|---|---|---|
| Requesty★ | $1.00 | $4.00 | $0.25 |
| OpenRouter | $2.00 | $8.00 | $0.50 |
| Vercel AI | $2.00 | $8.00 | — |
| Martian | $2.00 | $8.00 | $0.50 |
Benchmarks
Artificial Analysis
Intelligence IndexArtificial Analysis
78/100Coding IndexArtificial Analysis
72/100Math IndexArtificial Analysis
96/100MMLU-PROArtificial Analysis
0.818/1GPQA DiamondArtificial Analysis
0.838/1MATH-500Artificial Analysis
0.978/1AIME 2024Artificial Analysis
0.917/1Humanity's Last ExamArtificial Analysis
0.208/1LiveCodeBenchArtificial Analysis
0.772/1SciCodeArtificial Analysis
0.462/1Aider Polyglot
Aider PolyglotAider Polyglot
40.9/100Model IDs
Requesty
openai/o3:flexOpenRouter
openai/o3Tags
visionreasoningtool-callingcaching