LLM Router
HomeRoutersModelsProvidersBenchmarksPricingCompareBlogAbout
HomeModelsBenchmarksPricingCompareBlog
LLM Router

Independent comparison platform for LLM routing infrastructure.

Platform

  • Home
  • Routers
  • Models
  • Pricing
  • Blog
  • About

Routers

  • Requesty
  • OpenRouter
  • Martian
  • Unify
  • LiteLLM

© 2026 LLM Router

Data from public sources. May not reflect real-time pricing.

MetaLlama 3.1Open SourceApr 8, 2025

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural Architecture Search (NAS), resulting in enhanced efficiency, reduced memory usage, and improved inference latency. The model supports a context length of up to 128K tokens and can operate efficiently on an 8x NVIDIA H100 node. Note: you must include `detailed thinking on` in the system prompt to enable reasoning. Please see Usage Recommendations for more.

Context Window
131K
tokens
Max Output
—
tokens
Released
Apr 8, 2025
Arena Rank
—

Capabilities

👁Vision
🧠Reasoning
🔧Tool Calling
⚡Prompt Caching
🖥Computer Use
🎨Image Generation

Supported Parameters

Frequency Penalty
Reduce repetition
Include Reasoning
Show reasoning tokens
Max Tokens
Output length limit
Presence Penalty
Encourage new topics
Reasoning
Extended thinking
Repetition Penalty
Penalize repeated tokens
Response Format
JSON mode / structured output
structured_outputs
Temperature
Controls randomness
Top K
Top-K sampling
Top P
Nucleus sampling

Pricing Comparison

RouterInput / 1MOutput / 1MCached Input / 1M
OpenRouter$0.60$1.80—
Martian$0.60$1.80—

Model IDs

OpenRouternvidia/llama-3.1-nemotron-ultra-253b-v1
Hugging Facenvidia/Llama-3_1-Nemotron-Ultra-253B-v1 ↗

Tags

reasoning
Compare with another model

Compare with…

Meta Llama 3.1 405B InstructAionLabs: Aion-RP 1.0 (8B)Sao10K: Llama 3.3 Euryale 70B

Similar Models

Ranked by provider, pricing, capabilities, and arena performance

Meta
75% match

Meta Llama 3.1 405B Instruct

131K ctx$0.80/1M in

Same family · Similar price

Meta
75% match

AionLabs: Aion-RP 1.0 (8B)

33K ctx$0.80/1M in

Same family · Similar price

Meta
69% match

Sao10K: Llama 3.3 Euryale 70B

131K ctx$0.65/1M in

Same provider · Similar price

Meta
69% match

Meta Llama 3.1 70B Instruct Turbo

131K ctx$0.88/1M in

Same family · Similar price

Meta
66% match

Llama 3 70b Instruct

8K ctx$0.51/1M in

Same provider · Similar price

Meta
65% match

Sao10K: Llama 3.1 Euryale 70B v2.2

33K ctx$0.65/1M in

Same provider · Similar price

← Back to all models