MetaLlama 3.1Open SourceApr 8, 2025

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural Architecture Search (NAS), resulting in enhanced efficiency, reduced memory usage, and improved inference latency. The model supports a context length of up to 128K tokens and can operate efficiently on an 8x NVIDIA H100 node. Note: you must include `detailed thinking on` in the system prompt to enable reasoning. Please see Usage Recommendations for more.

Context Window

131K

tokens

Max Output

—

tokens

Released

Apr 8, 2025

Arena Rank

—

Capabilities

👁Vision

🧠Reasoning

🔧Tool Calling

⚡Prompt Caching

🖥Computer Use

🎨Image Generation

Supported Parameters

Frequency Penalty

Reduce repetition

Include Reasoning

Show reasoning tokens

Max Tokens

Output length limit

Presence Penalty

Encourage new topics

Reasoning

Extended thinking

Repetition Penalty

Penalize repeated tokens

Response Format

JSON mode / structured output

structured_outputs

Temperature

Controls randomness

Top K

Top-K sampling

Top P

Nucleus sampling

Pricing Comparison

Router	Input / 1M	Output / 1M	Cached Input / 1M
OpenRouter	$0.60	$1.80	—
Martian	$0.60	$1.80	—

Model IDs

OpenRouternvidia/llama-3.1-nemotron-ultra-253b-v1

Hugging Facenvidia/Llama-3_1-Nemotron-Ultra-253B-v1 ↗

Tags

reasoning

Compare with another model

Compare with…

Meta Llama 3.1 405B Instruct AionLabs: Aion-RP 1.0 (8B)Sao10K: Llama 3.3 Euryale 70B

Similar Models

Ranked by provider, pricing, capabilities, and arena performance

Meta Llama 3.1 405B Instruct

131K ctx$0.80/1M in

Same family · Similar price

AionLabs: Aion-RP 1.0 (8B)

33K ctx$0.80/1M in

Same family · Similar price

Sao10K: Llama 3.3 Euryale 70B

131K ctx$0.65/1M in

Same provider · Similar price

Meta Llama 3.1 70B Instruct Turbo

131K ctx$0.88/1M in

Same family · Similar price

Llama 3 70b Instruct

8K ctx$0.51/1M in

Same provider · Similar price

Sao10K: Llama 3.1 Euryale 70B v2.2

33K ctx$0.65/1M in

Same provider · Similar price

← Back to all models