NVIDIANemotronOpen SourceSep 5, 2025
NVIDIA: Nemotron Nano 9B V2 (free)
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.
Context Window
128K
tokens
Max Output
—
tokens
Released
Sep 5, 2025
Arena Rank
—
Capabilities
👁Vision
🧠Reasoning
🔧Tool Calling
⚡Prompt Caching
🖥Computer Use
🎨Image Generation
Supported Parameters
Include Reasoning
Show reasoning tokens
Max Tokens
Output length limit
Reasoning
Extended thinking
Response Format
JSON mode / structured output
Seed
Deterministic outputs
structured_outputs
Temperature
Controls randomness
Tool Choice
Control tool usage
Tool Calling
Function calling support
Top P
Nucleus sampling
Pricing Comparison
| Router | Input / 1M | Output / 1M | Cached Input / 1M |
|---|---|---|---|
| OpenRouter | Free | Free | — |
| Vercel AI | $0.04 | $0.16 | — |
| Martian | $0.04 | $0.16 | — |
Model IDs
OpenRouter
nvidia/nemotron-nano-9b-v2:freeHugging Facenvidia/NVIDIA-Nemotron-Nano-9B-v2 ↗
Tags
reasoningtool-calling