LLM Router
HomeRoutersModelsProvidersBenchmarksPricingCompareBlogAbout
HomeModelsBenchmarksPricingCompareBlog
LLM Router

Independent comparison platform for LLM routing infrastructure.

Platform

  • Home
  • Routers
  • Models
  • Pricing
  • Blog
  • About

Routers

  • Requesty
  • OpenRouter
  • Martian
  • Unify
  • LiteLLM

© 2026 LLM Router

Data from public sources. May not reflect real-time pricing.

Providers›Baidu

Baidu

↗ Website

Baidu develops the ERNIE (Enhanced Representation through Knowledge Integration) family of AI models. As one of China's largest tech companies, Baidu's models combine large-scale language understanding with deep knowledge integration.

Pricing available from OpenRouter, Martian.

Total Models
5
Arena Ranked
0
Open Source
0
Cheapest Input
$0.07
per 1M tokens

$ Pricing Summary(per 1M tokens)

MetricInputOutput
Cheapest$0.07$0.28
Average$0.20$0.69
Most Expensive$0.42$1.25

⚙ Capabilities

👁
Vision
2
of 5 models
🧠
Reasoning
3
of 5 models
🔧
Tool Calling
2
of 5 models
⚡
Prompt Caching
0
of 5 models
🖥
Computer Use
0
of 5 models
🎨
Image Generation
0
of 5 models

🤖 All Baidu Models(5)

Baidu

Baidu: ERNIE 4.5 21B A3B Thinking

ERNIE-4.5-21B-A3B-Thinking is Baidu's upgraded lightweight MoE model, refined to boost reasoning depth and quality for top-tier performance in logical puzzles, math, science, coding, text generation, and expert-level academic benchmarks.

Context
131K
Max Output
66K
Input/1M
$0.07
🧠 Reasoning
Pricing (per 1M tokens)
OpenRouter$0.07 / $0.28
Martian$0.07 / $0.28
2025-10-09View details →
Baidu

Baidu: ERNIE 4.5 21B A3B

A sophisticated text-based Mixture-of-Experts (MoE) model featuring 21B total parameters with 3B activated per token, delivering exceptional multimodal understanding and generation through heterogeneous MoE structures and modality-isolated routing. Supporting an extensive 131K token context length, the model achieves efficient inference via multi-expert parallel collaboration and quantization, while advanced post-training techniques including SFT, DPO, and UPO ensure optimized performance across diverse applications with specialized routing and balancing losses for superior task handling.

Context
120K
Max Output
8K
Input/1M
$0.07
🔧 Tools
Pricing (per 1M tokens)
OpenRouter$0.07 / $0.28
Martian$0.07 / $0.28
2025-08-12View details →
Baidu

Baidu: ERNIE 4.5 VL 28B A3B

A powerful multimodal Mixture-of-Experts chat model featuring 28B total parameters with 3B activated per token, delivering exceptional text and vision understanding through its innovative heterogeneous MoE structure with modality-isolated routing. Built with scaling-efficient infrastructure for high-throughput training and inference, the model leverages advanced post-training techniques including SFT, DPO, and UPO for optimized performance, while supporting an impressive 131K context length and RLVR alignment for superior cross-modal reasoning and generation capabilities.

Context
30K
Max Output
8K
Input/1M
$0.14
👁 Vision🧠 Reasoning🔧 Tools
Pricing (per 1M tokens)
OpenRouter$0.14 / $0.56
Martian$0.14 / $0.56
2025-08-12View details →
Baidu

Baidu: ERNIE 4.5 VL 424B A47B

ERNIE-4.5-VL-424B-A47B is a multimodal Mixture-of-Experts (MoE) model from Baidu’s ERNIE 4.5 series, featuring 424B total parameters with 47B active per token. It is trained jointly on text and image data using a heterogeneous MoE architecture and modality-isolated routing to enable high-fidelity cross-modal reasoning, image understanding, and long-context generation (up to 131k tokens). Fine-tuned with techniques like SFT, DPO, UPO, and RLVR, this model supports both “thinking” and non-thinking inference modes. Designed for vision-language tasks in English and Chinese, it is optimized for efficient scaling and can operate under 4-bit/8-bit quantization.

Context
123K
Max Output
16K
Input/1M
$0.42
👁 Vision🧠 Reasoning
Pricing (per 1M tokens)
OpenRouter$0.42 / $1.25
Martian$0.42 / $1.25
2025-06-30View details →
Baidu

Baidu: ERNIE 4.5 300B A47B

ERNIE-4.5-300B-A47B is a 300B parameter Mixture-of-Experts (MoE) language model developed by Baidu as part of the ERNIE 4.5 series. It activates 47B parameters per token and supports text generation in both English and Chinese. Optimized for high-throughput inference and efficient scaling, it uses a heterogeneous MoE structure with advanced routing and quantization strategies, including FP8 and 2-bit formats. This version is fine-tuned for language-only tasks and supports reasoning, tool parameters, and extended context lengths up to 131k tokens. Suitable for general-purpose LLM applications with high reasoning and throughput demands.

Context
123K
Max Output
12K
Input/1M
$0.28
Pricing (per 1M tokens)
OpenRouter$0.28 / $1.10
Martian$0.28 / $1.10
2025-06-30View details →
← Back to all providers