LLM Router
HomeRoutersModelsProvidersBenchmarksPricingCompareBlogAbout
HomeModelsBenchmarksPricingCompareBlog
LLM Router

Independent comparison platform for LLM routing infrastructure.

Platform

  • Home
  • Routers
  • Models
  • Pricing
  • Blog
  • About

Routers

  • Requesty
  • OpenRouter
  • Martian
  • Unify
  • LiteLLM

© 2026 LLM Router

Data from public sources. May not reflect real-time pricing.

Providers›Moonshot AI

Moonshot AI

↗ Website

Moonshot AI (月之暗面) develops the Kimi series of AI models, known for extremely long context windows. Their models excel at processing and reasoning over very long documents, making them ideal for research and analysis tasks.

Pricing available from Requesty, OpenRouter, Vercel AI, Martian, DeepInfra.

Total Models
8
Arena Ranked
3
of 8
Open Source
0
Cheapest Input
$0.39
per 1M tokens

$ Pricing Summary(per 1M tokens)

MetricInputOutput
Cheapest$0.39$1.75
Average$0.61$2.52
Most Expensive$1.20$5.00

⚙ Capabilities

👁
Vision
1
of 8 models
🧠
Reasoning
1
of 8 models
🔧
Tool Calling
8
of 8 models
⚡
Prompt Caching
2
of 8 models
🖥
Computer Use
0
of 8 models
🎨
Image Generation
0
of 8 models

🤖 All Moonshot AI Models(8)

Moonshot AIKimi K2
#31

Kimi K2 Thinking

A thinking model with general agentic and reasoning capabilities, specializing in deep reasoning tasks.

Context
131K
Max Output
—
Input/1M
$0.40
🔧 Tools
Pricing (per 1M tokens)
Requesty★$0.60 / $2.50
Vercel AI$0.47 / $2.00
Martian$0.40 / $1.75
DeepInfra$0.47 / $2.00
2025-11-06View details →
Moonshot AIKimi K2
#45

Kimi K2 0905 Preview

Kimi K2 0905 is the September update of [Kimi K2 0711](moonshotai/kimi-k2). It is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It supports long-context inference up to 256k tokens, extended from the previous 128k. This update improves agentic coding with higher accuracy and better generalization across scaffolds, and enhances frontend coding with more aesthetic and functional outputs for web, 3D, and related tasks. Kimi K2 is optimized for agentic capabilities, including advanced tool use, reasoning, and code synthesis. It excels across coding (LiveCodeBench, SWE-bench), reasoning (ZebraLogic, GPQA), and tool-use (Tau2, AceBench) benchmarks. The model is trained with a novel stack incorporating the MuonClip optimizer for stable large-scale MoE training.

Context
262K
Max Output
262K
Input/1M
$0.39
🔧 Tools
Pricing (per 1M tokens)
Requesty★$0.60 / $2.50
OpenRouter$0.39 / $1.90
Vercel AI$0.60 / $2.50
Martian$0.39 / $1.90
2025-09-04View details →
Moonshot AIKimi K2
#47

Kimi K2 0711 Preview

A Mixture-of-Experts (MoE) foundation model with exceptional coding and agent capabilities, featuring 1 trillion total parameters and 32 billion activated parameters. In benchmark evaluations covering general knowledge reasoning, programming, mathematics, and agent-related tasks, the K2 model outperforms other leading open-source models.

Context
131K
Max Output
—
Input/1M
$0.60
🔧 Tools
Pricing (per 1M tokens)
Requesty★$0.60 / $2.50
View details →
Moonshot AIKimi K2.5

Kimi K2.5

Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm. Built on Kimi K2 with continued pretraining over approximately 15T mixed visual and text tokens, it delivers strong performance in general reasoning, visual coding, and agentic tool-calling.

Context
262K
Max Output
262K
Input/1M
$0.45
👁 Vision🧠 Reasoning🔧 Tools⚡ Cache
Pricing (per 1M tokens)
Requesty★$0.60 / $3.00
OpenRouter$0.45 / $2.25
Vercel AI$0.50 / $2.80
Martian$0.45 / $2.25
DeepInfra$0.45 / $2.25
2026-01-27View details →
Moonshot AIKimi K2

Kimi K2 Turbo Preview

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for agentic capabilities, including advanced tool use, reasoning, and code synthesis. Kimi K2 excels across a broad range of benchmarks, particularly in coding (LiveCodeBench, SWE-bench), reasoning (ZebraLogic, GPQA), and tool-use (Tau2, AceBench) tasks. It supports long-context inference up to 128K tokens and is designed with a novel training stack that includes the MuonClip optimizer for stable large-scale MoE training.

Context
131K
Max Output
—
Input/1M
$0.50
🔧 Tools
Pricing (per 1M tokens)
Requesty★$1.20 / $5.00
OpenRouter$0.50 / $2.40
Vercel AI$0.50 / $2.00
2025-07-11View details →
Moonshot AIKimi K2

Kimi K2 Instruct

 

Context
131K
Max Output
16K
Input/1M
$1.00
🔧 Tools
Pricing (per 1M tokens)
Requesty★$1.00 / $3.00
View details →
Moonshot AIKimi K2

Parasail Kimi K2 Instruct

 

Context
131K
Max Output
16K
Input/1M
$0.99
🔧 Tools
Pricing (per 1M tokens)
Requesty★$0.99 / $2.99
View details →
Moonshot AIKimi K2

Kimi K2 Instruct 0905

Moonshot AI’s cutting‑edge model, moonshotai/Kimi-K2-Instruct-0905, is now live on GroqCloud.

Context
256K
Max Output
16K
Input/1M
$1.00
🔧 Tools⚡ Cache
Pricing (per 1M tokens)
Requesty★$1.00 / $3.00
View details →
← Back to all providers