Author's Description
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully open with open-weights, datasets and recipes so developers can easily customize, optimize, and deploy the model on their infrastructure for maximum privacy and security. Note: For the free endpoint, all prompts and output are logged to improve the provider's model and its product and services. Please do not upload any personal, confidential, or otherwise sensitive information. This is a trial use only. Do not use for production or business-critical systems.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
NVIDIA Nemotron 3 Nano 30B A3B, a small language MoE model, demonstrates a balanced performance profile with notable strengths in reliability and specific domains. It performs among the faster models, ranking in the 63rd percentile for speed across benchmarks, and offers competitive pricing, placing in the 64th percentile. A standout feature is its exceptional reliability, achieving a 100% success rate across all evaluated benchmarks, indicating consistent and dependable operation. In terms of specific performance, the model excels in Coding, achieving 94.0% accuracy (90th percentile), suggesting strong capabilities for developers building specialized agentic AI systems. Its Ethics performance is also very strong at 99.0% accuracy (59th percentile), coupled with the lowest cost and fastest duration among the benchmarks. However, its General Knowledge accuracy is a relative weakness at 95.0% (38th percentile), indicating that while acceptable, it may not be its primary strength. The model's open nature, with open-weights, datasets, and recipes, further enhances its appeal for customization and deployment.
Model Pricing
Current Pricing
| Feature | Price (per 1M tokens) |
|---|---|
| Prompt | $0.06 |
| Completion | $0.24 |
Price History
Available Endpoints
| Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
|---|---|---|---|---|
|
DeepInfra
|
DeepInfra | nvidia/nemotron-3-nano-30b-a3b | 262K | $0.06 / 1M tokens | $0.24 / 1M tokens |
Benchmark Results
| Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
|---|
Other Models by nvidia
|
|
Released | Params | Context |
|
Speed | Ability | Cost |
|---|---|---|---|---|---|---|---|
| NVIDIA: Nemotron Nano 12B 2 VL | Oct 28, 2025 | 12B | 131K |
Text input
Image input
Video input
Text output
|
★ | ★★ | $$$$ |
| NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 | Oct 10, 2025 | 49B | 131K |
Text input
Text output
|
★ | ★★★★ | $$$$ |
| NVIDIA: Nemotron Nano 9B V2 | Sep 05, 2025 | 9B | 128K |
Text input
Text output
|
★ | ★★ | $ |
| NVIDIA: Llama 3.3 Nemotron Super 49B v1 Unavailable | Apr 08, 2025 | 49B | 131K |
Text input
Text output
|
★★★ | ★★ | $$ |
| NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 | Apr 08, 2025 | 253B | 131K |
Text input
Text output
|
★ | ★★ | $$$$$ |
| NVIDIA: Llama 3.1 Nemotron 70B Instruct | Oct 14, 2024 | 70B | 131K |
Text input
Text output
|
★★★ | ★★ | $$$ |