AnthropicClaude 4Arena #77May 22, 2025

Claude Sonnet 4

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%), Sonnet 4 balances capability and computational efficiency, making it suitable for a broad range of applications from routine coding tasks to complex software development projects. Key enhancements include improved autonomous codebase navigation, reduced error rates in agent-driven workflows, and increased reliability in following intricate instructions. Sonnet 4 is optimized for practical everyday use, providing advanced reasoning capabilities while maintaining efficiency and responsiveness in diverse internal and external scenarios. Read more at the blog post here

Context Window
1.0M
tokens
Max Output
64K
tokens
Released
May 22, 2025
Arena Rank
#77
of 305 models
Output Speed
82
tokens/sec
Time to First Token
1.2s
TTFT

Capabilities

👁Vision
🧠Reasoning
🔧Tool Calling
Prompt Caching
🖥Computer Use
🎨Image Generation

Supported Parameters

Include Reasoning
Show reasoning tokens
Max Tokens
Output length limit
Reasoning
Extended thinking
Stop Sequences
Custom stop tokens
Temperature
Controls randomness
Tool Choice
Control tool usage
Tool Calling
Function calling support
Top K
Top-K sampling
Top P
Nucleus sampling

Pricing Comparison

RouterInput / 1MOutput / 1MCached Input / 1M
Requesty$3.00$15.00$0.30
OpenRouter$3.00$15.00$0.30
Vercel AI$3.00$15.00
Martian$3.00$15.00$0.30

Benchmarks

Artificial Analysis
Intelligence IndexArtificial Analysis
67/100
Coding IndexArtificial Analysis
65/100
Math IndexArtificial Analysis
72/100
MMLU-PROArtificial Analysis
0.775/1
GPQA DiamondArtificial Analysis
0.685/1
MATH-500Artificial Analysis
0.872/1
AIME 2024Artificial Analysis
0.42/1
Humanity's Last ExamArtificial Analysis
0.075/1
LiveCodeBenchArtificial Analysis
0.648/1
SciCodeArtificial Analysis
0.355/1

Model IDs

Requestybedrock/claude-sonnet-4@us-east-2
OpenRouteranthropic/claude-sonnet-4

Tags

visionreasoningtool-callingcachingcomputer-use

Available Regions

EUUS
Compare with another model

Related Models