Grok 2 vision model excels in vision-based tasks, delivering state-of-the-art performance in visual math reasoning (MathVista) and document-based question answering (DocVQA). It can process a wide variety of visual information including documents, diagrams, charts, screenshots, and photographs.
| Router | Input / 1M | Output / 1M | Cached Input / 1M |
|---|---|---|---|
| Vercel AI | $2.00 | $10.00 | — |
Ranked by provider, pricing, capabilities, and arena performance
Same family · Similar price
Same provider · Similar price
Same provider · Similar price
Similar price · Both support tools
Similar price · Both support tools
Similar price · Both support tools