GoogleGemini 2.0Feb 25, 2025
Google: Gemini 2.0 Flash Lite
Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5, all at extremely economical token prices.
Context Window
1.0M
tokens
Max Output
8K
tokens
Released
Feb 25, 2025
Arena Rank
—
Output Speed
285
tokens/sec
Time to First Token
320ms
TTFT
Capabilities
👁Vision
🧠Reasoning
🔧Tool Calling
⚡Prompt Caching
🖥Computer Use
🎨Image Generation
Supported Parameters
Max Tokens
Output length limit
Response Format
JSON mode / structured output
Seed
Deterministic outputs
Stop Sequences
Custom stop tokens
structured_outputs
Temperature
Controls randomness
Tool Choice
Control tool usage
Tool Calling
Function calling support
Top P
Nucleus sampling
Pricing Comparison
| Router | Input / 1M | Output / 1M | Cached Input / 1M |
|---|---|---|---|
| OpenRouter | $0.07 | $0.30 | — |
| Martian | $0.07 | $0.30 | — |
Benchmarks
Artificial Analysis
Intelligence IndexArtificial Analysis
53/100Coding IndexArtificial Analysis
44/100Math IndexArtificial Analysis
58/100MMLU-PROArtificial Analysis
0.708/1GPQA DiamondArtificial Analysis
0.518/1MATH-500Artificial Analysis
0.748/1AIME 2024Artificial Analysis
0.08/1LiveCodeBenchArtificial Analysis
0.378/1SciCodeArtificial Analysis
0.148/1Model IDs
OpenRouter
google/gemini-2.0-flash-lite-001Tags
visiontool-calling