MetaLlama 4Open SourceJan 30, 2025

Llama 4 Maverick 17b 128e Instruct Fp8

Name: Llama 4 Maverick 17b 128e Instruct Fp8
Price: 0.19999999999999998 USD
Author: Meta

A lightweight and ultra-fast variant of Llama 3.3 70B, for use when quick response times are needed most.

Context Window

1.0M

tokens

Max Output

1.0M

tokens

Released

—

Arena Rank

—

Output Speed

165

tokens/sec

Time to First Token

480ms

TTFT

Capabilities

👁Vision

🧠Reasoning

🔧Tool Calling

⚡Prompt Caching

🖥Computer Use

🎨Image Generation

Pricing Comparison

Router	Input / 1M	Output / 1M	Cached Input / 1M
Requesty★	$0.20	$0.85	$0.20

Benchmarks

Artificial Analysis

Intelligence IndexArtificial Analysis

58/100

Coding IndexArtificial Analysis

52/100

Math IndexArtificial Analysis

62/100

MMLU-PROArtificial Analysis

0.748/1

GPQA DiamondArtificial Analysis

0.585/1

MATH-500Artificial Analysis

0.802/1

AIME 2024Artificial Analysis

0.18/1

LiveCodeBenchArtificial Analysis

0.478/1

Model IDs

Requestynovita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8

Compare with another model

Related Models

Meta#142

Capabilities

Pricing Comparison

Benchmarks

Model IDs

Related Models

Meta: Llama 4 Maverick

Meta: Llama 4 Scout

Meta: Llama 3.1 405B (base)

Meta: Llama 3.3 70B Instruct (free)

NVIDIA: Llama 3.1 Nemotron 70B Instruct

Meta: Llama 3.1 70B Instruct