OpenAI: GPT-4.1 Mini

Image input File input Text input Text output
Author's Description

GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard instruction evals, 35.8% on MultiChallenge, and 84.1% on IFEval. Mini also shows strong coding ability (e.g., 31.6% on Aider’s polyglot diff benchmark) and vision understanding, making it suitable for interactive applications with tight performance constraints.

Key Specifications
Cost
$$$
Context
1M
Parameters
300B (Rumoured)
Released
Apr 14, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Structured Outputs Response Format Seed Max Tokens Tool Choice Tools
Features

This model supports the following features:

Response Format Tools Structured Outputs
Performance Summary

GPT-4.1 Mini, released on April 14, 2025, is a mid-sized model designed for high performance at reduced latency and cost. It performs among the fastest models, ranking in the 77th percentile for speed, and offers competitive pricing, placing in the 55th percentile. The model demonstrates exceptional reliability with a 100% success rate across all benchmarks, indicating minimal technical failures. In terms of performance, GPT-4.1 Mini exhibits strong capabilities across various domains. It achieves perfect accuracy in Ethics and near-perfect scores in General Knowledge (99.0%), Mathematics (95.0%), Email Classification (99.0%), and Coding (92.0%), often being the most accurate among models of comparable speed or price in these categories. Its instruction following is also robust at 76.4% accuracy. A notable weakness is its hallucination rate, with 70.0% accuracy on the Hallucinations benchmark, suggesting it may not always appropriately acknowledge uncertainty. Despite this, its 1 million token context window and strong coding and vision understanding make it highly suitable for interactive applications requiring tight performance constraints.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.4
Completion $1.6
Input Cache Read $0.1
Web Search $10000

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
OpenAI
OpenAI | openai/gpt-4.1-mini-2025-04-14 1M $0.4 / 1M tokens $1.6 / 1M tokens
Azure
Azure | openai/gpt-4.1-mini-2025-04-14 1M $0.4 / 1M tokens $1.6 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by openai