NVIDIA: Nemotron 3 Nano 30B A3B

Text input Text output Free Option
Author's Description

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully open with open-weights, datasets and recipes so developers can easily customize, optimize, and deploy the model on their infrastructure for maximum privacy and security. Note: For the free endpoint, all prompts and output are logged to improve the provider's model and its product and services. Please do not upload any personal, confidential, or otherwise sensitive information. This is a trial use only. Do not use for production or business-critical systems.

Key Specifications
Cost
$$$
Context
262K
Parameters
30B
Released
Dec 14, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Reasoning Frequency Penalty Include Reasoning Top P Seed Tool Choice Min P Temperature Stop Response Format Max Tokens Tools Presence Penalty
Features

This model supports the following features:

Response Format Reasoning Tools
Performance Summary

NVIDIA Nemotron 3 Nano 30B A3B, a small language MoE model, demonstrates a balanced performance profile with notable strengths in reliability and specific domains. It performs among the faster models, ranking in the 63rd percentile for speed across benchmarks, and offers competitive pricing, placing in the 64th percentile. A standout feature is its exceptional reliability, achieving a 100% success rate across all evaluated benchmarks, indicating consistent and dependable operation. In terms of specific performance, the model excels in Coding, achieving 94.0% accuracy (90th percentile), suggesting strong capabilities for developers building specialized agentic AI systems. Its Ethics performance is also very strong at 99.0% accuracy (59th percentile), coupled with the lowest cost and fastest duration among the benchmarks. However, its General Knowledge accuracy is a relative weakness at 95.0% (38th percentile), indicating that while acceptable, it may not be its primary strength. The model's open nature, with open-weights, datasets, and recipes, further enhances its appeal for customization and deployment.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.06
Completion $0.24

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | nvidia/nemotron-3-nano-30b-a3b 262K $0.06 / 1M tokens $0.24 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by nvidia