GoogleGemini 2.0Feb 5, 2025
Gemini 2.0 Flash 001
Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5. It introduces notable enhancements in multimodal understanding, coding capabilities, complex instruction following, and function calling. These advancements come together to deliver more seamless and robust agentic experiences.
Context Window
1.0M
tokens
Max Output
8K
tokens
Released
Feb 5, 2025
Arena Rank
—
Output Speed
285
tokens/sec
Time to First Token
320ms
TTFT
Capabilities
👁Vision
🧠Reasoning
🔧Tool Calling
⚡Prompt Caching
🖥Computer Use
🎨Image Generation
Supported Parameters
Max Tokens
Output length limit
Response Format
JSON mode / structured output
Seed
Deterministic outputs
Stop Sequences
Custom stop tokens
structured_outputs
Temperature
Controls randomness
Tool Choice
Control tool usage
Tool Calling
Function calling support
Top P
Nucleus sampling
Pricing Comparison
| Router | Input / 1M | Output / 1M | Cached Input / 1M |
|---|---|---|---|
| Requesty★ | $0.10 | $0.40 | $0.10 |
| OpenRouter | $0.10 | $0.40 | $0.02 |
| Martian | $0.10 | $0.40 | — |
Benchmarks
Artificial Analysis
Intelligence IndexArtificial Analysis
53/100Coding IndexArtificial Analysis
44/100Math IndexArtificial Analysis
58/100MMLU-PROArtificial Analysis
0.708/1GPQA DiamondArtificial Analysis
0.518/1MATH-500Artificial Analysis
0.748/1AIME 2024Artificial Analysis
0.08/1LiveCodeBenchArtificial Analysis
0.378/1SciCodeArtificial Analysis
0.148/1Model IDs
Requesty
google/gemini-2.0-flash-001OpenRouter
google/gemini-2.0-flash-001Tags
visiontool-calling