OpenAI: GPT-5 Codex

Text input Image input Text output
Author's Description

GPT-5-Codex is a specialized version of GPT-5 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks. The model supports building projects from scratch, feature development, debugging, large-scale refactoring, and code review. Compared to GPT-5, Codex is more steerable, adheres closely to developer instructions, and produces cleaner, higher-quality code outputs. Reasoning effort can be adjusted with the `reasoning.effort` parameter. Read the [docs here](https://openrouter.ai/docs/use-cases/reasoning-tokens#reasoning-effort-level) Codex integrates into developer environments including the CLI, IDE extensions, GitHub, and cloud tasks. It adapts reasoning effort dynamically—providing fast responses for small tasks while sustaining extended multi-hour runs for large projects. The model is trained to perform structured code reviews, catching critical flaws by reasoning over dependencies and validating behavior against tests. It also supports multimodal inputs such as images or screenshots for UI development and integrates tool use for search, dependency installation, and environment setup. Codex is intended specifically for agentic coding applications.

Key Specifications
Cost
$$$$$
Context
400K
Released
Sep 23, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Tools Structured Outputs Response Format Reasoning Include Reasoning Tool Choice Max Tokens
Features

This model supports the following features:

Structured Outputs Response Format Tools Reasoning
Performance Summary

OpenAI's GPT-5 Codex, created on September 23, 2025, is a specialized AI model designed for advanced software engineering and coding workflows. It demonstrates competitive response times, ranking in the 51st percentile across seven benchmarks, and offers moderate pricing, placing it in the 21st percentile. A standout feature is its exceptional reliability, achieving a perfect 100% success rate across all benchmarks, indicating minimal technical failures. In terms of performance across categories, GPT-5 Codex excels in Hallucinations and Ethics, achieving 100% accuracy in both, with the former also being the most accurate and fastest at its price point. It shows strong capabilities in Coding (94.0% accuracy, 87th percentile) and Reasoning (96.0% accuracy, 82nd percentile), highlighting its proficiency in complex problem-solving and programming tasks. General Knowledge and Mathematics also show solid performance at 99.5% and 93.0% accuracy respectively. Its Email Classification accuracy is 98.0%. The model's ability to adapt reasoning effort dynamically and integrate into various developer environments further enhances its utility for agentic coding applications.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.25
Completion $10
Input Cache Read $0.125

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
OpenAI
OpenAI | openai/gpt-5-codex 400K $1.25 / 1M tokens $10 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by openai