OpenAI: GPT-5.1-Codex-Max

Image input Text input Text output
Author's Description

GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, high-context software development tasks. It is based on an updated version of the 5.1 reasoning stack and trained on agentic workflows spanning software engineering, mathematics, and research. GPT-5.1-Codex-Max delivers faster performance, improved reasoning, and higher token efficiency across the development lifecycle.

Key Specifications
Cost
$$$$$
Context
400K
Released
Dec 04, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Include Reasoning Seed Tools Reasoning Max Tokens Structured Outputs Response Format Tool Choice
Features

This model supports the following features:

Response Format Tools Structured Outputs Reasoning
Performance Summary

GPT-5.1-Codex-Max, OpenAI’s latest agentic coding model, demonstrates exceptional reliability with a 99% success rate, consistently providing usable responses. Its speed performance is moderate, ranking in the 27th percentile across benchmarks, while its pricing tends to be premium, positioned in the 6th percentile. The model excels in knowledge-based tasks, achieving perfect accuracy in General Knowledge and Ethics, often being the most accurate at its price point and speed. It also shows strong performance in Instruction Following (95th percentile accuracy) and Coding (93rd percentile accuracy), indicating robust capabilities for software development tasks. Mathematics performance is solid at 93% accuracy. A notable weakness is its performance in Hallucinations, where it achieved 88% accuracy, placing it in the 36th percentile, suggesting room for improvement in acknowledging uncertainty. Email Classification is moderate at 97% accuracy. Overall, GPT-5.1-Codex-Max is a highly reliable model with strong reasoning and coding capabilities, albeit with a premium cost and moderate speed.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.25
Completion $10
Input Cache Read $0.125
Web Search $10000

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
OpenAI
OpenAI | openai/gpt-5.1-codex-max-20251204 400K $1.25 / 1M tokens $10 / 1M tokens
Azure
Azure | openai/gpt-5.1-codex-max-20251204 400K $1.25 / 1M tokens $10 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by openai