AnthropicClaude 4.1Arena #19Aug 5, 2025

Claude Opus 4.1

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performance in coding, reasoning, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains in multi-file code refactoring, debugging precision, and detail-oriented reasoning. The model supports extended thinking up to 64K tokens and is optimized for tasks involving research, data analysis, and tool-assisted reasoning.

Context Window
200K
tokens
Max Output
32K
tokens
Released
Aug 5, 2025
Arena Rank
#19
of 305 models
Output Speed
68
tokens/sec
Time to First Token
1.8s
TTFT

Capabilities

👁Vision
🧠Reasoning
🔧Tool Calling
Prompt Caching
🖥Computer Use
🎨Image Generation

Supported Parameters

Include Reasoning
Show reasoning tokens
Max Tokens
Output length limit
Reasoning
Extended thinking
Response Format
JSON mode / structured output
Stop Sequences
Custom stop tokens
structured_outputs
Temperature
Controls randomness
Tool Choice
Control tool usage
Tool Calling
Function calling support
Top K
Top-K sampling
Top P
Nucleus sampling

Pricing Comparison

RouterInput / 1MOutput / 1MCached Input / 1M
Requesty$15.00$75.00$1.50
OpenRouter$15.00$75.00$1.50
Vercel AI$15.00$75.00
Martian$15.00$75.00$1.50

Benchmarks

Artificial Analysis
Intelligence IndexArtificial Analysis
72/100
Coding IndexArtificial Analysis
68/100
Math IndexArtificial Analysis
80/100
MMLU-PROArtificial Analysis
0.795/1
GPQA DiamondArtificial Analysis
0.715/1
MATH-500Artificial Analysis
0.905/1
AIME 2024Artificial Analysis
0.55/1
Humanity's Last ExamArtificial Analysis
0.112/1
LiveCodeBenchArtificial Analysis
0.685/1
SciCodeArtificial Analysis
0.395/1

Model IDs

Requestyanthropic/claude-opus-4-1
OpenRouteranthropic/claude-opus-4.1

Tags

visionreasoningtool-callingcachingcomputer-use

Available Regions

EUUS
Compare with another model

Related Models