Author's Description
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
NVIDIA: Nemotron Nano 9B V2, created on September 5, 2025, is a 9B parameter LLM designed for both reasoning and non-reasoning tasks, capable of generating reasoning traces or direct answers based on system prompt configuration. This model consistently ranks among the fastest, achieving an Infinityth percentile in speed across 8 benchmarks. It also offers competitive pricing, placing in the 65th percentile for cost-effectiveness. Reliability is a significant strength, with a 99% success rate across 8 benchmarks, indicating minimal technical failures. In terms of performance across categories, the model demonstrates exceptional ethical reasoning with a perfect 100% accuracy, notably achieving this while being among the fastest models in this category. It also shows strong capabilities in hallucination avoidance (98.0% accuracy) and general knowledge (98.6% accuracy), though the latter comes with a very long duration. Reasoning tasks are handled effectively with 89.8% accuracy. However, the model exhibits significant weaknesses in instruction following and mathematics, both scoring 0.0% accuracy, suggesting these areas require substantial improvement. Coding performance is moderate at 83.0% accuracy, while email classification is a notable weakness at 94.1% accuracy, placing it in the 27th percentile.
Model Pricing
Current Pricing
| Feature | Price (per 1M tokens) |
|---|---|
| Prompt | $0.04 |
| Completion | $0.16 |
Price History
Available Endpoints
| Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
|---|---|---|---|---|
|
Nvidia
|
Nvidia | nvidia/nemotron-nano-9b-v2 | 128K | $0.04 / 1M tokens | $0.16 / 1M tokens |
|
Nvidia
|
Nvidia | nvidia/nemotron-nano-9b-v2 | 128K | $0.04 / 1M tokens | $0.16 / 1M tokens |
|
DeepInfra
|
DeepInfra | nvidia/nemotron-nano-9b-v2 | 131K | $0.04 / 1M tokens | $0.16 / 1M tokens |
|
Together
|
Together | nvidia/nemotron-nano-9b-v2 | 131K | $0.04 / 1M tokens | $0.16 / 1M tokens |
Benchmark Results
| Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
|---|
Other Models by nvidia
|
|
Released | Params | Context |
|
Speed | Ability | Cost |
|---|---|---|---|---|---|---|---|
| NVIDIA: Nemotron 3 Super | Mar 11, 2026 | 120B | 262K |
Text input
Text output
|
★★★ | ★★★ | $$$$ |
| NVIDIA: Nemotron 3 Nano 30B A3B | Dec 14, 2025 | 30B | 262K |
Text input
Text output
|
★★★ | ★★★★★ | $$$ |
| NVIDIA: Nemotron Nano 12B 2 VL | Oct 28, 2025 | 12B | 131K |
Text input
Image input
Video input
Text output
|
★ | ★★ | $$$$ |
| NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 | Oct 10, 2025 | 49B | 131K |
Text input
Text output
|
★★ | ★★★★ | $$$$ |
| NVIDIA: Llama 3.3 Nemotron Super 49B v1 Unavailable | Apr 08, 2025 | 49B | 131K |
Text input
Text output
|
★★★ | ★★ | $$ |
| NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 | Apr 08, 2025 | 253B | 131K |
Text input
Text output
|
★ | ★★ | $$$$$ |
| NVIDIA: Llama 3.1 Nemotron 70B Instruct | Oct 14, 2024 | 70B | 131K |
Text input
Text output
|
★★★ | ★★ | $$ |