NVIDIA: Nemotron 3 Nano 30B A3B

Text input Text output Free Option
Author's Description

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

Key Specifications
Cost
$$$
Context
262K
Parameters
30B
Released
Dec 14, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Min P Response Format Reasoning Temperature Presence Penalty Include Reasoning Tools Frequency Penalty Top P Stop Tool Choice Max Tokens
Features

This model supports the following features:

Response Format Tools Reasoning
Performance Summary

NVIDIA Nemotron 3 Nano 30B A3B, a small language MoE model designed for specialized agentic AI systems, demonstrates a balanced performance profile with notable strengths in reliability and cost-effectiveness. Created on December 14, 2025, with an extensive context length of 262144, this open-weight model offers developers significant customization and deployment flexibility. The model performs among the faster models, ranking in the 64th percentile for speed across benchmarks. It also offers competitive pricing, placing in the 66th percentile for cost-effectiveness. A standout feature is its exceptional reliability, achieving a perfect 100% success rate across all benchmarks, indicating consistent and usable responses without technical failures. In terms of specific benchmark performance, Nemotron 3 Nano 30B A3B excels in Coding, achieving an impressive 94.0% accuracy (89th percentile), making it a strong contender for programming-related tasks. Its performance in Ethics is also very strong, with 99.0% accuracy (58th percentile) and the lowest cost per query ($0.0063). While its General Knowledge accuracy at 95.0% is respectable, it falls into the 36th percentile, suggesting it may not be its primary strength compared to other models. The model's cost efficiency is particularly evident in Ethics, and its speed is generally good, though the Coding benchmark showed a longer duration. Overall, Nemotron 3 Nano 30B A3B is a highly reliable and cost-efficient model, particularly strong in coding and ethical reasoning, making it well-suited for developers building specialized AI agents.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.05
Completion $0.2

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | nvidia/nemotron-3-nano-30b-a3b 262K $0.05 / 1M tokens $0.2 / 1M tokens
Chutes
Chutes | nvidia/nemotron-3-nano-30b-a3b 262K $0.05 / 1M tokens $0.2 / 1M tokens
DeepInfra
DeepInfra | nvidia/nemotron-3-nano-30b-a3b 262K $0.05 / 1M tokens $0.2 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by nvidia