NVIDIA: Nemotron Nano 9B V2

Text input Text output Free Option
Author's Description

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...

Key Specifications
Cost
$
Context
128K
Parameters
9B
Released
Sep 05, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Structured Outputs Response Format Reasoning Tool Choice Include Reasoning
Features

This model supports the following features:

Structured Outputs Response Format Tools Reasoning
Performance Summary

NVIDIA: Nemotron Nano 9B V2, created on September 5, 2025, is a 9B parameter LLM designed for both reasoning and non-reasoning tasks, capable of generating reasoning traces or direct answers based on system prompt configuration. This model consistently ranks among the fastest, achieving an Infinityth percentile in speed across 8 benchmarks. It also offers competitive pricing, placing in the 65th percentile for cost-effectiveness. Reliability is a significant strength, with a 99% success rate across 8 benchmarks, indicating minimal technical failures. In terms of performance across categories, the model demonstrates exceptional ethical reasoning with a perfect 100% accuracy, notably achieving this while being among the fastest models in this category. It also shows strong capabilities in hallucination avoidance (98.0% accuracy) and general knowledge (98.6% accuracy), though the latter comes with a very long duration. Reasoning tasks are handled effectively with 89.8% accuracy. However, the model exhibits significant weaknesses in instruction following and mathematics, both scoring 0.0% accuracy, suggesting these areas require substantial improvement. Coding performance is moderate at 83.0% accuracy, while email classification is a notable weakness at 94.1% accuracy, placing it in the 27th percentile.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.04
Completion $0.16

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Nvidia
Nvidia | nvidia/nemotron-nano-9b-v2 128K $0.04 / 1M tokens $0.16 / 1M tokens
Nvidia
Nvidia | nvidia/nemotron-nano-9b-v2 128K $0.04 / 1M tokens $0.16 / 1M tokens
DeepInfra
DeepInfra | nvidia/nemotron-nano-9b-v2 131K $0.04 / 1M tokens $0.16 / 1M tokens
Together
Together | nvidia/nemotron-nano-9b-v2 131K $0.04 / 1M tokens $0.16 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by nvidia