OpenAI: GPT-4o-mini (2024-07-18)

Image input File input Text input Text output
Author's Description

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable than other recent frontier models, and more than 60% cheaper than [GPT-3.5 Turbo](/models/openai/gpt-3.5-turbo). It maintains SOTA intelligence, while being significantly more cost-effective. GPT-4o mini achieves an 82% score on MMLU and presently ranks higher than GPT-4 on chat preferences [common leaderboards](https://arena.lmsys.org/). Check out the [launch announcement](https://openai.com/index/gpt-4o-mini-advancing-cost-efficient-intelligence/) to learn more. #multimodal

Key Specifications
Cost
$$
Context
128K
Parameters
8B (Rumoured)
Released
Jul 17, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Logit Bias Tool Choice Seed Top P Top Logprobs Temperature Response Format Logprobs Max Tokens Presence Penalty Structured Outputs Tools Frequency Penalty Stop
Features

This model supports the following features:

Structured Outputs Response Format Tools
Performance Summary

GPT-4o-mini (2024-07-18) demonstrates a strong overall performance profile, positioning itself as a highly competitive and cost-effective AI model. It performs among the fastest models, consistently ranking in the top tier for speed (78th percentile). The model also offers competitive pricing, typically providing cost-effective solutions (73rd percentile). Notably, GPT-4o-mini exhibits exceptional reliability, achieving a perfect 100th percentile across all benchmarks, indicating minimal technical failures and consistent evaluable responses. In terms of benchmark performance, GPT-4o-mini excels in several areas. It achieved perfect accuracy in the Ethics (Baseline) benchmark, standing out as the most accurate model at its price point and among models of similar speed. Its General Knowledge (Baseline) performance was also very strong at 99.0% accuracy. The model showed high accuracy in Coding (Baseline) at 87.0%, being the most accurate among models of comparable speed. While its Email Classification (Baseline) accuracy was solid at 98.0%, its Reasoning (Baseline) score of 56.0% indicates a relative weakness in complex multi-step problem-solving compared to its other strengths. Instruction Following (Baseline) was moderate at 61.0%. Overall, GPT-4o-mini's key strengths lie in its ethical alignment, general knowledge, coding proficiency, and remarkable reliability, all delivered with impressive speed and cost-efficiency.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.15
Completion $0.6
Input Cache Read $0.075

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
OpenAI
OpenAI | openai/gpt-4o-mini-2024-07-18 128K $0.15 / 1M tokens $0.6 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by openai