MetaLlama 3.1Open SourceArena #178Oct 15, 2024

NVIDIA: Llama 3.1 Nemotron 70B Instruct

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging Llama 3.1 70B architecture and Reinforcement Learning from Human Feedback (RLHF), it excels in automatic alignment benchmarks. This model is tailored for applications requiring high accuracy in helpfulness and response generation, suitable for diverse user queries across multiple domains. Usage of this model is subject to Meta's Acceptable Use Policy.

Context Window
131K
tokens
Max Output
16K
tokens
Released
Oct 15, 2024
Arena Rank
#178
of 305 models

Capabilities

👁Vision
🧠Reasoning
🔧Tool Calling
Prompt Caching
🖥Computer Use
🎨Image Generation

Supported Parameters

Frequency Penalty
Reduce repetition
Max Tokens
Output length limit
min_p
Presence Penalty
Encourage new topics
Repetition Penalty
Penalize repeated tokens
Response Format
JSON mode / structured output
Seed
Deterministic outputs
Stop Sequences
Custom stop tokens
Temperature
Controls randomness
Tool Choice
Control tool usage
Tool Calling
Function calling support
Top K
Top-K sampling
Top P
Nucleus sampling

Pricing Comparison

RouterInput / 1MOutput / 1MCached Input / 1M
OpenRouter$1.20$1.20
Martian$1.20$1.20
DeepInfra$1.20$1.20

Benchmarks

Open LLM Leaderboard
AverageOpen LLM Leaderboard
36.91/100
IFEvalOpen LLM Leaderboard
73.81/100
BBHOpen LLM Leaderboard
47.11/100
MATH Lvl 5Open LLM Leaderboard
42.67/100
GPQAOpen LLM Leaderboard
1.12/100
MUSROpen LLM Leaderboard
13.2/100
MMLU-PROOpen LLM Leaderboard
43.54/100

Model IDs

OpenRouternvidia/llama-3.1-nemotron-70b-instruct

Tags

tool-calling
Compare with another model

Related Models