NVIDIANemotronOpen SourceSep 5, 2025

NVIDIA: Nemotron Nano 9B V2 (free)

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

Context Window
128K
tokens
Max Output
tokens
Released
Sep 5, 2025
Arena Rank

Capabilities

👁Vision
🧠Reasoning
🔧Tool Calling
Prompt Caching
🖥Computer Use
🎨Image Generation

Supported Parameters

Include Reasoning
Show reasoning tokens
Max Tokens
Output length limit
Reasoning
Extended thinking
Response Format
JSON mode / structured output
Seed
Deterministic outputs
structured_outputs
Temperature
Controls randomness
Tool Choice
Control tool usage
Tool Calling
Function calling support
Top P
Nucleus sampling

Pricing Comparison

RouterInput / 1MOutput / 1MCached Input / 1M
OpenRouterFreeFree
Vercel AI$0.04$0.16
Martian$0.04$0.16

Model IDs

OpenRouternvidia/nemotron-nano-9b-v2:free

Tags

reasoningtool-calling
Compare with another model

Related Models