OpenAI: GPT-4.1 Nano

File input Text input Image input Text output
Author's Description

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It’s ideal for tasks like classification or autocompletion.

Key Specifications
Cost
$$
Context
1M
Parameters
50B (Rumoured)
Released
Apr 14, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tool Choice Top P Logit Bias Temperature Logprobs Presence Penalty Stop Response Format Structured Outputs Tools Max Tokens Frequency Penalty Top Logprobs Seed
Features

This model supports the following features:

Response Format Tools Structured Outputs
Performance Summary

GPT-4.1 Nano, released on April 14, 2025, is designed for low-latency tasks, offering a 1 million token context window. It consistently ranks among the fastest models, performing in the 82nd percentile across seven benchmarks, and offers competitive pricing, typically in the 78th percentile. Notably, it demonstrates exceptional reliability with a 100% success rate across all benchmarks, indicating consistent and usable responses. The model exhibits strong performance in specific areas. It achieves perfect accuracy in the Ethics benchmark, standing out as the most accurate model at its price point and among models of similar speed. It also shows high accuracy in Hallucinations Baseline (96.0%) and General Knowledge (96.5%), though its General Knowledge accuracy is mid-range. A key strength is its Coding performance, achieving 84.0% accuracy and being the most accurate among models of comparable speed. However, its Email Classification accuracy is relatively low at 92.0% (21st percentile), and its Instruction Following (55.5%) and Reasoning (56.0%) capabilities are moderate. Overall, GPT-4.1 Nano is a cost-effective and reliable choice for tasks requiring speed and a large context window, particularly excelling in ethical reasoning and coding, while offering solid general knowledge and hallucination mitigation.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.1
Completion $0.4
Input Cache Read $0.025
Web Search $10000

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
OpenAI
OpenAI | openai/gpt-4.1-nano-2025-04-14 1M $0.1 / 1M tokens $0.4 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by openai