GoogleGemini 2.5Arena #59Jun 17, 2025

Gemini 2.5 Flash

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning).

Context Window
1.0M
tokens
Max Output
66K
tokens
Released
Jun 17, 2025
Arena Rank
#59
of 305 models
Output Speed
312
tokens/sec
Time to First Token
450ms
TTFT

Capabilities

👁Vision
🧠Reasoning
🔧Tool Calling
Prompt Caching
🖥Computer Use
🎨Image Generation

Supported Parameters

Include Reasoning
Show reasoning tokens
Max Tokens
Output length limit
Reasoning
Extended thinking
Response Format
JSON mode / structured output
Seed
Deterministic outputs
Stop Sequences
Custom stop tokens
structured_outputs
Temperature
Controls randomness
Tool Choice
Control tool usage
Tool Calling
Function calling support
Top P
Nucleus sampling

Pricing Comparison

RouterInput / 1MOutput / 1MCached Input / 1M
Requesty$0.30$2.50$0.07
OpenRouter$0.30$2.50$0.03
Vercel AI$0.30$2.50
Martian$0.30$2.50
DeepInfra$0.30$2.50

Benchmarks

Artificial Analysis
Intelligence IndexArtificial Analysis
65/100
Coding IndexArtificial Analysis
58/100
Math IndexArtificial Analysis
82/100
MMLU-PROArtificial Analysis
0.775/1
GPQA DiamondArtificial Analysis
0.658/1
MATH-500Artificial Analysis
0.928/1
AIME 2024Artificial Analysis
0.62/1
Humanity's Last ExamArtificial Analysis
0.065/1
LiveCodeBenchArtificial Analysis
0.598/1
SciCodeArtificial Analysis
0.315/1

Model IDs

Requestygoogle/gemini-2.5-flash
OpenRoutergoogle/gemini-2.5-flash

Tags

visionreasoningtool-callingcaching

Available Regions

EUUS
Compare with another model

Related Models