DeepSeekDeepSeek R1Open SourceJan 30, 2025

Deepseek R1 Distill Qwen 14b

Name: Deepseek R1 Distill Qwen 14b
Price: 0.15 USD
Author: DeepSeek

DeepSeek R1 Distill Qwen 14B is a distilled large language model based on Qwen 2.5 14B, using outputs from DeepSeek R1. It outperforms OpenAI's o1-mini across various benchmarks, achieving new state-of-the-art results for dense models. Other benchmark results include: AIME 2024 pass@1: 69.7 MATH-500 pass@1: 93.9 CodeForces Rating: 1481 The model leverages fine-tuning from DeepSeek R1's outputs, enabling competitive performance comparable to larger frontier models.

Context Window

128K

tokens

Max Output

—

tokens

Released

—

Arena Rank

—

Output Speed

135

tokens/sec

Time to First Token

4.8s

TTFT

Capabilities

👁Vision

🧠Reasoning

🔧Tool Calling

⚡Prompt Caching

🖥Computer Use

🎨Image Generation

Pricing Comparison

Router	Input / 1M	Output / 1M	Cached Input / 1M
Requesty★	$0.15	$0.15	$0.15

Benchmarks

Artificial Analysis

Intelligence IndexArtificial Analysis

70/100

Coding IndexArtificial Analysis

62/100

Math IndexArtificial Analysis

94/100

MMLU-PROArtificial Analysis

0.798/1

GPQA DiamondArtificial Analysis

0.718/1

MATH-500Artificial Analysis

0.972/1

AIME 2024Artificial Analysis

0.897/1

Humanity's Last ExamArtificial Analysis

0.088/1

LiveCodeBenchArtificial Analysis

0.655/1

SciCodeArtificial Analysis

0.408/1

Model IDs

Requestynovita/deepseek/deepseek-r1-distill-qwen-14b

Related Models

DeepSeek#42

Capabilities

Pricing Comparison

Benchmarks

Model IDs

Tags

Related Models

DeepSeek: R1 0528 (free)

DeepSeek: R1

DeepSeek R1 Distill Llama 70B

DeepSeek R1

DeepSeek R1 0528

DeepSeek R1