BaiduAug 12, 2025

Baidu: ERNIE 4.5 21B A3B

A sophisticated text-based Mixture-of-Experts (MoE) model featuring 21B total parameters with 3B activated per token, delivering exceptional multimodal understanding and generation through heterogeneous MoE structures and modality-isolated routing. Supporting an extensive 131K token context length, the model achieves efficient inference via multi-expert parallel collaboration and quantization, while advanced post-training techniques including SFT, DPO, and UPO ensure optimized performance across diverse applications with specialized routing and balancing losses for superior task handling.

Context Window

120K

tokens

Max Output

8K

tokens

Released

Aug 12, 2025

Arena Rank

—

Capabilities

👁Vision

🧠Reasoning

🔧Tool Calling

⚡Prompt Caching

🖥Computer Use

🎨Image Generation

Supported Parameters

Frequency Penalty

Reduce repetition

Max Tokens

Output length limit

Presence Penalty

Encourage new topics

Repetition Penalty

Penalize repeated tokens

Seed

Deterministic outputs

Stop Sequences

Custom stop tokens

Temperature

Controls randomness

Tool Choice

Control tool usage

Tool Calling

Function calling support

Top K

Top-K sampling

Top P

Nucleus sampling

Pricing Comparison

Router	Input / 1M	Output / 1M	Cached Input / 1M
OpenRouter	$0.07	$0.28	—
Martian	$0.07	$0.28	—

Model IDs

OpenRouterbaidu/ernie-4.5-21b-a3b

Hugging Facebaidu/ERNIE-4.5-21B-A3B-PT ↗

Tags

tool-calling

Compare with another model

Compare with…

Baidu: ERNIE 4.5 21B A3B Thinking Qwen/Qwen2.5 Coder 32B Instruct Qwen: Qwen3 Coder 30B A3B Instruct

Similar Models

Ranked by provider, pricing, capabilities, and arena performance

Baidu: ERNIE 4.5 21B A3B Thinking

131K ctx$0.07/1M in

Same provider · Similar price

Qwen/Qwen2.5 Coder 32B Instruct

16K ctx$0.07/1M in

Similar price · Both support tools

Qwen: Qwen3 Coder 30B A3B Instruct

160K ctx$0.07/1M in

Similar price · Both support tools

Qwen3-235B-A22B

41K ctx$0.07/1M in

Similar price · Both support tools

Baidu: ERNIE 4.5 VL 28B A3B

30K ctx$0.14/1M in

Same provider · Similar price

Qwen: Qwen3 Coder Next

262K ctx$0.07/1M in

Similar price · Both support tools

← Back to all models