OtherOpen SourceJan 9, 2026

AllenAI: Molmo2 8B

Molmo2-8B is an open vision-language model developed by the Allen Institute for AI (Ai2) as part of the Molmo2 family, supporting image, video, and multi-image understanding and grounding. It is based on Qwen3-8B and uses SigLIP 2 as its vision backbone, outperforming other open-weight, open-data models on short videos, counting, and captioning, while remaining competitive on long-video tasks.

Context Window
37K
tokens
Max Output
37K
tokens
Released
Jan 9, 2026
Arena Rank

Capabilities

👁Vision
🧠Reasoning
🔧Tool Calling
Prompt Caching
🖥Computer Use
🎨Image Generation

Supported Parameters

Frequency Penalty
Reduce repetition
Logit Bias
Adjust token weights
Max Tokens
Output length limit
Presence Penalty
Encourage new topics
Repetition Penalty
Penalize repeated tokens
Seed
Deterministic outputs
Stop Sequences
Custom stop tokens
Temperature
Controls randomness
Top K
Top-K sampling
Top P
Nucleus sampling

Pricing Comparison

RouterInput / 1MOutput / 1MCached Input / 1M
OpenRouter$0.20$0.20

Model IDs

OpenRouterallenai/molmo-2-8b

Tags

vision
Compare with another model

Related Models