OtherOpen SourceJan 9, 2026
AllenAI: Molmo2 8B
Molmo2-8B is an open vision-language model developed by the Allen Institute for AI (Ai2) as part of the Molmo2 family, supporting image, video, and multi-image understanding and grounding. It is based on Qwen3-8B and uses SigLIP 2 as its vision backbone, outperforming other open-weight, open-data models on short videos, counting, and captioning, while remaining competitive on long-video tasks.
Context Window
37K
tokens
Max Output
37K
tokens
Released
Jan 9, 2026
Arena Rank
—
Capabilities
👁Vision
🧠Reasoning
🔧Tool Calling
⚡Prompt Caching
🖥Computer Use
🎨Image Generation
Supported Parameters
Frequency Penalty
Reduce repetition
Logit Bias
Adjust token weights
Max Tokens
Output length limit
Presence Penalty
Encourage new topics
Repetition Penalty
Penalize repeated tokens
Seed
Deterministic outputs
Stop Sequences
Custom stop tokens
Temperature
Controls randomness
Top K
Top-K sampling
Top P
Nucleus sampling
Pricing Comparison
| Router | Input / 1M | Output / 1M | Cached Input / 1M |
|---|---|---|---|
| OpenRouter | $0.20 | $0.20 | — |
Model IDs
OpenRouter
allenai/molmo-2-8bHugging Faceallenai/Molmo2-8B ↗
Tags
vision
Related Models
Other#66
Meituan: LongCat Flash Chat
131K ctxFree/1M in
Other#72
Xiaomi: MiMo-V2-Flash
262K ctx$0.09/1M in
Other#106
Prime Intellect: INTELLECT-3
131K ctx$0.20/1M in
OpenAI#108
OpenAI: gpt-oss-120b (free)
131K ctxFree/1M in
Other#139
AllenAI: Olmo 3.1 32B Instruct
66K ctx$0.20/1M in
OpenAI#159
OpenAI: gpt-oss-20b (free)
131K ctxFree/1M in