Author's Description
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
NVIDIA Nemotron 3 Nano 30B A3B, a small language MoE model designed for specialized agentic AI systems, demonstrates a balanced performance profile with notable strengths in reliability and cost-effectiveness. Created on December 14, 2025, with an extensive context length of 262144, this open-weight model offers developers significant customization and deployment flexibility. The model performs among the faster models, ranking in the 64th percentile for speed across benchmarks. It also offers competitive pricing, placing in the 66th percentile for cost-effectiveness. A standout feature is its exceptional reliability, achieving a perfect 100% success rate across all benchmarks, indicating consistent and usable responses without technical failures. In terms of specific benchmark performance, Nemotron 3 Nano 30B A3B excels in Coding, achieving an impressive 94.0% accuracy (89th percentile), making it a strong contender for programming-related tasks. Its performance in Ethics is also very strong, with 99.0% accuracy (58th percentile) and the lowest cost per query ($0.0063). While its General Knowledge accuracy at 95.0% is respectable, it falls into the 36th percentile, suggesting it may not be its primary strength compared to other models. The model's cost efficiency is particularly evident in Ethics, and its speed is generally good, though the Coding benchmark showed a longer duration. Overall, Nemotron 3 Nano 30B A3B is a highly reliable and cost-efficient model, particularly strong in coding and ethical reasoning, making it well-suited for developers building specialized AI agents.
Model Pricing
Current Pricing
| Feature | Price (per 1M tokens) |
|---|---|
| Prompt | $0.05 |
| Completion | $0.2 |
Price History
Available Endpoints
| Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
|---|---|---|---|---|
|
DeepInfra
|
DeepInfra | nvidia/nemotron-3-nano-30b-a3b | 262K | $0.05 / 1M tokens | $0.2 / 1M tokens |
|
Chutes
|
Chutes | nvidia/nemotron-3-nano-30b-a3b | 262K | $0.05 / 1M tokens | $0.2 / 1M tokens |
|
DeepInfra
|
DeepInfra | nvidia/nemotron-3-nano-30b-a3b | 262K | $0.05 / 1M tokens | $0.2 / 1M tokens |
Benchmark Results
| Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
|---|
Other Models by nvidia
|
|
Released | Params | Context |
|
Speed | Ability | Cost |
|---|---|---|---|---|---|---|---|
| NVIDIA: Nemotron 3 Super | Mar 11, 2026 | 120B | 262K |
Text input
Text output
|
★★★ | ★★★ | $$$$ |
| NVIDIA: Nemotron Nano 12B 2 VL | Oct 28, 2025 | 12B | 131K |
Text input
Image input
Video input
Text output
|
★ | ★★ | $$$$ |
| NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 | Oct 10, 2025 | 49B | 131K |
Text input
Text output
|
★★ | ★★★★ | $$$$ |
| NVIDIA: Nemotron Nano 9B V2 | Sep 05, 2025 | 9B | 128K |
Text input
Text output
|
★ | ★★ | $ |
| NVIDIA: Llama 3.3 Nemotron Super 49B v1 Unavailable | Apr 08, 2025 | 49B | 131K |
Text input
Text output
|
★★★ | ★★ | $$ |
| NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 | Apr 08, 2025 | 253B | 131K |
Text input
Text output
|
★ | ★★ | $$$$$ |
| NVIDIA: Llama 3.1 Nemotron 70B Instruct | Oct 14, 2024 | 70B | 131K |
Text input
Text output
|
★★★ | ★★ | $$ |