OpenAI: GPT-5.1

File input Image input Text input Text output
Author's Description

GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style compared to GPT-5. It uses adaptive reasoning to allocate computation dynamically, responding quickly to simple queries while spending more depth on complex tasks. The model produces clearer, more grounded explanations with reduced jargon, making it easier to follow even on technical or multi-step problems. Built for broad task coverage, GPT-5.1 delivers consistent gains across math, coding, and structured analysis workloads, with more coherent long-form answers and improved tool-use reliability. It also features refined conversational alignment, enabling warmer, more intuitive responses without compromising precision. GPT-5.1 serves as the primary full-capability successor to GPT-5

Key Specifications
Cost
$$$$$
Context
400K
Released
Nov 13, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Response Format Structured Outputs Tool Choice Seed Max Tokens Reasoning Include Reasoning
Features

This model supports the following features:

Reasoning Structured Outputs Tools Response Format
Performance Summary

GPT-5.1 demonstrates competitive response times, performing among the faster models with a 60th percentile speed ranking across various benchmarks. However, it is positioned at a premium pricing level, ranking in the 17th percentile for cost-effectiveness. A standout feature is its exceptional reliability, achieving a perfect 100% success rate across all evaluated benchmarks, indicating robust and consistent operation. The model exhibits strong performance across a broad range of tasks. It achieves perfect accuracy in Hallucinations, General Knowledge, and Ethics, often being the most accurate model at its price point and speed. Instruction Following and Reasoning also show high accuracy at 83% and 96% respectively, with Reasoning being particularly fast. Coding performance is robust at 94% accuracy. While Mathematics scores 92%, it is notable for its ability to tackle complex problems. Email Classification, at 98% accuracy, is solid but not a top-tier performance compared to other categories. Key strengths include its perfect accuracy in critical areas like hallucination prevention and general knowledge, coupled with strong reasoning and coding capabilities. Its primary weakness lies in its premium pricing, which may limit accessibility for some use cases despite its high performance.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.25
Completion $10
Input Cache Read $0.125
Web Search $10000

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
OpenAI
OpenAI | openai/gpt-5.1-20251113 400K $1.25 / 1M tokens $10 / 1M tokens
Azure
Azure | openai/gpt-5.1-20251113 400K $1.25 / 1M tokens $10 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by openai