OpenAI: o3 Mini

File input Text input Text output
Author's Description

OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and coding. This model supports the `reasoning_effort` parameter, which can be set to "high", "medium", or "low" to control the thinking time of the model. The default is "medium". OpenRouter also offers the model slug `openai/o3-mini-high` to default the parameter to "high". The model features three adjustable reasoning effort levels and supports key developer capabilities including function calling, structured outputs, and streaming, though it does not include vision processing capabilities. The model demonstrates significant improvements over its predecessor, with expert testers preferring its responses 56% of the time and noting a 39% reduction in major errors on complex questions. With medium reasoning effort settings, o3-mini matches the performance of the larger o1 model on challenging reasoning evaluations like AIME and GPQA, while maintaining lower latency and cost.

Key Specifications
Cost
$$$$$
Context
200K
Parameters
200B
Released
Jan 31, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tool Choice Response Format Structured Outputs Tools Max Tokens Seed
Features

This model supports the following features:

Response Format Tools Structured Outputs
Performance Summary

OpenAI o3-mini, released on January 31, 2025, is a cost-efficient language model specifically optimized for STEM reasoning tasks, including science, mathematics, and coding. The model demonstrates moderate speed performance, ranking in the 36th percentile across benchmarks, and is positioned at premium pricing levels, falling into the 18th percentile. Notably, o3-mini exhibits exceptional reliability with a 100% success rate, indicating minimal technical failures. Its key strengths lie in its reasoning capabilities, achieving 98.0% accuracy (93rd percentile) on the Reasoning benchmark, and strong performance in General Knowledge (99.5% accuracy, 78th percentile) and Coding (93.0% accuracy, 87th percentile). These results align with its stated optimization for STEM tasks. The model also performs well in Ethics and Hallucinations, showing a strong ability to acknowledge uncertainty. A notable weakness is its Instruction Following accuracy, which is 59.6% (64th percentile), suggesting room for improvement in handling highly complex, multi-layered instructions. The model supports adjustable reasoning effort levels ("high," "medium," "low"), function calling, structured outputs, and streaming, enhancing its developer utility. With medium reasoning effort, it matches the performance of the larger o1 model on challenging reasoning evaluations while offering lower latency and cost.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.1
Completion $4.4
Input Cache Read $0.55

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
OpenAI
OpenAI | openai/o3-mini-2025-01-31 200K $1.1 / 1M tokens $4.4 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by openai