A powerful multimodal Mixture-of-Experts chat model featuring 28B total parameters with 3B activated per token, delivering exceptional text and vision understanding through its innovative heterogeneous MoE structure with modality-isolated routing. Built with scaling-efficient infrastructure for high-throughput training and inference, the model leverages advanced post-training techniques including SFT, DPO, and UPO for optimized performance, while supporting an impressive 131K context length and RLVR alignment for superior cross-modal reasoning and generation capabilities.
| Router | Input / 1M | Output / 1M | Cached Input / 1M |
|---|---|---|---|
| OpenRouter | $0.14 | $0.56 | — |
| Martian | $0.14 | $0.56 | — |
baidu/ernie-4.5-vl-28b-a3bRanked by provider, pricing, capabilities, and arena performance
Similar price · Both support reasoning & vision
Same provider · Similar price
Similar price · Both support vision & tools
Similar price · Both support reasoning & tools
Similar price · Both support reasoning & tools
Similar price · Both support vision & tools