NVIDIA: Nemotron 3 Nano 30B A3B

Name: NVIDIA: Nemotron 3 Nano 30B A3B
Brand: nvidia
Price: 5e-8 USD
Availability: InStock
Rating: 4.6 (9 reviews)

Back

Text input Text output

Author's Description

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

Key Specifications

Cost

$$$

Context

262K

Parameters

30B

Released

Dec 14, 2025

Speed

★★★

Ability

★★★★★

Reliability

★★★

Hugging Face

Supported Parameters

This model supports the following parameters:

Stop Max Tokens Seed Reasoning Top P Frequency Penalty Presence Penalty Temperature Include Reasoning Tool Choice Tools Response Format Min P

Features

This model supports the following features:

Response Format Tools Reasoning

Performance Summary

NVIDIA Nemotron 3 Nano 30B A3B, a small language MoE model designed for specialized agentic AI systems, demonstrates a balanced performance profile with notable strengths in reliability and cost-effectiveness. Created on December 14, 2025, with an extensive context length of 262144, this open-weight model offers developers significant customization and deployment flexibility. The model performs among the faster models, ranking in the 64th percentile for speed across benchmarks. It also offers competitive pricing, placing in the 66th percentile for cost-effectiveness. A standout feature is its exceptional reliability, achieving a perfect 100% success rate across all benchmarks, indicating consistent and usable responses without technical failures. In terms of specific benchmark performance, Nemotron 3 Nano 30B A3B excels in Coding, achieving an impressive 94.0% accuracy (89th percentile), making it a strong contender for programming-related tasks. Its performance in Ethics is also very strong, with 99.0% accuracy (58th percentile) and the lowest cost per query ($0.0063). While its General Knowledge accuracy at 95.0% is respectable, it falls into the 36th percentile, suggesting it may not be its primary strength compared to other models. The model's cost efficiency is particularly evident in Ethics, and its speed is generally good, though the Coding benchmark showed a longer duration. Overall, Nemotron 3 Nano 30B A3B is a highly reliable and cost-efficient model, particularly strong in coding and ethical reasoning, making it well-suited for developers building specialized AI agents.

Model Pricing

Current Pricing

Feature	Price (per 1M tokens)
Prompt	$0.05
Completion	$0.2

Price History

Available Endpoints

Provider	Endpoint Name	Context Length	Pricing (Input)	Pricing (Output)
DeepInfra	DeepInfra \| nvidia/nemotron-3-nano-30b-a3b	262K	$0.05 / 1M tokens	$0.2 / 1M tokens
Chutes	Chutes \| nvidia/nemotron-3-nano-30b-a3b	262K	$0.05 / 1M tokens	$0.2 / 1M tokens
DeepInfra	DeepInfra \| nvidia/nemotron-3-nano-30b-a3b	262K	$0.05 / 1M tokens	$0.2 / 1M tokens
Ambient	Ambient \| nvidia/nemotron-3-nano-30b-a3b	262K	$0.05 / 1M tokens	$0.2 / 1M tokens
Novita	Novita \| nvidia/nemotron-3-nano-30b-a3b	262K	$0.05 / 1M tokens	$0.2 / 1M tokens
Nebius	Nebius \| nvidia/nemotron-3-nano-30b-a3b	262K	$0.06 / 1M tokens	$0.24 / 1M tokens

Benchmark Results

Benchmark	Category	Reasoning	Strategy	Free	Executions	Accuracy	Cost	Duration

Other Models by nvidia

	Released	Params	Context	Filter by Modalities All Modalities	Speed	Ability	Cost
NVIDIA: Nemotron 3.5 Content Safety (free) Unavailable	Jun 04, 2026	~4B	N/A	Image input Text input Text output	—	—	—
NVIDIA: Nemotron 3 Ultra	Jun 03, 2026	550B	512K	Text input Text output	★	★	$$$$
NVIDIA: Nemotron 3 Nano Omni (free) Unavailable	Apr 28, 2026	30B	N/A	Image input Audio input Text input Video input Text output	—	—	—
NVIDIA: Nemotron 3 Super	Mar 11, 2026	120B	262K	Text input Text output	★★★	★★★	$$$$
NVIDIA: Nemotron Nano 12B 2 VL (free)	Oct 28, 2025	12B	131K	Image input Text input Video input Text output	★	★★	$$$$
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5	Oct 10, 2025	49B	131K	Text input Text output	★★	★★★★	$$$$
NVIDIA: Nemotron Nano 9B V2 (free)	Sep 05, 2025	9B	128K	Text input Text output	★	★★	$
NVIDIA: Llama 3.3 Nemotron Super 49B v1 Unavailable	Apr 08, 2025	49B	131K	Text input Text output	★★★★	★★	$$
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 Unavailable	Apr 08, 2025	253B	131K	Text input Text output	★	★★	$$$$
NVIDIA: Llama 3.1 Nemotron 70B Instruct Unavailable	Oct 14, 2024	70B	131K	Text input Text output	★★★	★★	$$