MistralOct 1, 2024
Ministral 3B
A compact, efficient model for on-device tasks like smart assistants and local analytics, offering low-latency performance.
Context Window
128K
tokens
Max Output
4K
tokens
Released
Oct 1, 2024
Arena Rank
—
Capabilities
👁Vision
🧠Reasoning
🔧Tool Calling
⚡Prompt Caching
🖥Computer Use
🎨Image Generation
Pricing Comparison
| Router | Input / 1M | Output / 1M | Cached Input / 1M |
|---|---|---|---|
| Vercel AI | $0.04 | $0.04 | — |
Benchmarks
Open LLM Leaderboard
AverageOpen LLM Leaderboard
3.52/100IFEvalOpen LLM Leaderboard
13.58/100BBHOpen LLM Leaderboard
4.68/100MATH Lvl 5Open LLM Leaderboard
0.83/100GPQAOpen LLM Leaderboard
0.22/100MUSROpen LLM Leaderboard
0.78/100MMLU-PROOpen LLM Leaderboard
1.03/100Model IDs
Tags
tool-use
Related Models
Other#66
Meituan: LongCat Flash Chat
131K ctxFree/1M in
Other#72
Xiaomi: MiMo-V2-Flash
262K ctx$0.09/1M in
Other#106
Prime Intellect: INTELLECT-3
131K ctx$0.20/1M in
OpenAI#108
OpenAI: gpt-oss-120b (free)
131K ctxFree/1M in
Other#139
AllenAI: Olmo 3.1 32B Instruct
66K ctx$0.20/1M in
OpenAI#159
OpenAI: gpt-oss-20b (free)
131K ctxFree/1M in