Instruction-tuned image reasoning generative model (text + images in / text out) optimized for visual recognition, image reasoning, captioning and answering general questions about the image.
| Router | Input / 1M | Output / 1M | Cached Input / 1M |
|---|---|---|---|
| Vercel AI | $0.16 | $0.16 | — |
Ranked by provider, pricing, capabilities, and arena performance
Same family · Similar price
Same family · Similar price
Same provider · Similar price
Same provider · Similar price
Same provider · Similar price
Same provider · Similar price