GoogleGemini 2.0Feb 25, 2025

Google: Gemini 2.0 Flash Lite

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5, all at extremely economical token prices.

Context Window
1.0M
tokens
Max Output
8K
tokens
Released
Feb 25, 2025
Arena Rank
Output Speed
285
tokens/sec
Time to First Token
320ms
TTFT

Capabilities

👁Vision
🧠Reasoning
🔧Tool Calling
Prompt Caching
🖥Computer Use
🎨Image Generation

Supported Parameters

Max Tokens
Output length limit
Response Format
JSON mode / structured output
Seed
Deterministic outputs
Stop Sequences
Custom stop tokens
structured_outputs
Temperature
Controls randomness
Tool Choice
Control tool usage
Tool Calling
Function calling support
Top P
Nucleus sampling

Pricing Comparison

RouterInput / 1MOutput / 1MCached Input / 1M
OpenRouter$0.07$0.30
Martian$0.07$0.30

Benchmarks

Artificial Analysis
Intelligence IndexArtificial Analysis
53/100
Coding IndexArtificial Analysis
44/100
Math IndexArtificial Analysis
58/100
MMLU-PROArtificial Analysis
0.708/1
GPQA DiamondArtificial Analysis
0.518/1
MATH-500Artificial Analysis
0.748/1
AIME 2024Artificial Analysis
0.08/1
LiveCodeBenchArtificial Analysis
0.378/1
SciCodeArtificial Analysis
0.148/1

Model IDs

OpenRoutergoogle/gemini-2.0-flash-lite-001

Tags

visiontool-calling
Compare with another model

Related Models