LLM Router
HomeRoutersModelsProvidersBenchmarksPricingCompareBlogAbout
HomeModelsBenchmarksPricingCompareBlog
LLM Router

Independent comparison platform for LLM routing infrastructure.

Platform

  • Home
  • Routers
  • Models
  • Pricing
  • Blog
  • About

Routers

  • Requesty
  • OpenRouter
  • Martian
  • Unify
  • LiteLLM

© 2026 LLM Router

Data from public sources. May not reflect real-time pricing.

QwenQwenOpen SourceSep 19, 2024

Qwen2.5 72B Instruct

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and mathematics, thanks to our specialized expert models in these domains. - Significant improvements in instruction following, generating long texts (over 8K tokens), understanding structured data (e.g, tables), and generating structured outputs especially JSON. More resilient to the diversity of system prompts, enhancing role-play implementation and condition-setting for chatbots. - Long-context Support up to 128K tokens and can generate up to 8K tokens. - Multilingual support for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more. Usage of this model is subject to Tongyi Qianwen LICENSE AGREEMENT.

Context Window
33K
tokens
Max Output
16K
tokens
Released
Sep 19, 2024
Arena Rank
—
Output Speed
92
tokens/sec
Time to First Token
580ms
TTFT

Capabilities

👁Vision
🧠Reasoning
🔧Tool Calling
⚡Prompt Caching
🖥Computer Use
🎨Image Generation

Supported Parameters

Frequency Penalty
Reduce repetition
Logit Bias
Adjust token weights
Max Tokens
Output length limit
min_p
Presence Penalty
Encourage new topics
Repetition Penalty
Penalize repeated tokens
Response Format
JSON mode / structured output
Seed
Deterministic outputs
Stop Sequences
Custom stop tokens
structured_outputs
Temperature
Controls randomness
Tool Choice
Control tool usage
Tool Calling
Function calling support
Top K
Top-K sampling
Top P
Nucleus sampling

Pricing Comparison

RouterInput / 1MOutput / 1MCached Input / 1M
OpenRouter$0.12$0.39—
Martian$0.12$0.39—

Benchmarks

Open LLM Leaderboard
AverageOpen LLM Leaderboard
47.98/100
IFEvalOpen LLM Leaderboard
86.38/100
BBHOpen LLM Leaderboard
61.87/100
MATH Lvl 5Open LLM Leaderboard
59.82/100
GPQAOpen LLM Leaderboard
16.67/100
MUSROpen LLM Leaderboard
11.74/100
MMLU-PROOpen LLM Leaderboard
51.4/100
Artificial Analysis
Intelligence IndexArtificial Analysis
48/100
Coding IndexArtificial Analysis
42/100
Math IndexArtificial Analysis
52/100
MMLU-PROArtificial Analysis
0.668/1
GPQA DiamondArtificial Analysis
0.458/1
MATH-500Artificial Analysis
0.718/1
AIME 2024Artificial Analysis
0.067/1
LiveCodeBenchArtificial Analysis
0.368/1

Model IDs

OpenRouterqwen/qwen-2.5-72b-instruct
Hugging FaceQwen/Qwen2.5-72B-Instruct ↗

Tags

tool-calling
Compare with another model

Compare with…

Parasail Qwen3 235b A22b Instruct 2507Qwen/Qwen3 32BQwen: Qwen3 VL 30B A3B Instruct

Similar Models

Ranked by provider, pricing, capabilities, and arena performance

Qwen
70% match

Parasail Qwen3 235b A22b Instruct 2507

262K ctx$0.15/1M in

Same provider · Similar price

Qwen
70% match

Qwen/Qwen3 32B

40K ctx$0.10/1M in

Same provider · Similar price

Qwen
69% match

Qwen: Qwen3 VL 30B A3B Instruct

131K ctx$0.13/1M in

Same provider · Similar price

Qwen
67% match

Qwen: Qwen3 VL 32B Instruct

131K ctx$0.10/1M in

Same provider · Similar price

Qwen
67% match

Qwen: Qwen2.5-VL 7B Instruct

33K ctx$0.20/1M in

Same family · Similar price

Qwen
66% match

Qwen: Qwen3 VL 8B Thinking

131K ctx$0.12/1M in

Same provider · Similar price

← Back to all models