OpenAI: GPT-5 Codex

Image input Text input Text output
Author's Description

GPT-5-Codex is a specialized version of GPT-5 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks. The model supports building projects from scratch, feature development, debugging, large-scale refactoring, and code review. Compared to GPT-5, Codex is more steerable, adheres closely to developer instructions, and produces cleaner, higher-quality code outputs. Reasoning effort can be adjusted with the `reasoning.effort` parameter. Read the [docs here](https://openrouter.ai/docs/use-cases/reasoning-tokens#reasoning-effort-level) Codex integrates into developer environments including the CLI, IDE extensions, GitHub, and cloud tasks. It adapts reasoning effort dynamically—providing fast responses for small tasks while sustaining extended multi-hour runs for large projects. The model is trained to perform structured code reviews, catching critical flaws by reasoning over dependencies and validating behavior against tests. It also supports multimodal inputs such as images or screenshots for UI development and integrates tool use for search, dependency installation, and environment setup. Codex is intended specifically for agentic coding applications.

Key Specifications
Cost
$$$$$
Context
400K
Released
Sep 23, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Reasoning Structured Outputs Max Tokens Seed Response Format Include Reasoning Tools Tool Choice
Features

This model supports the following features:

Structured Outputs Tools Reasoning Response Format
Performance Summary

OpenAI's GPT-5 Codex, released September 23, 2025, is a specialized AI model designed for comprehensive software engineering and coding workflows. It demonstrates competitive response times, ranking in the 48th percentile across seven benchmarks, and exhibits exceptional reliability with a 100% success rate. However, its pricing tends to be at premium levels, placing it in the 20th percentile for cost-effectiveness. Codex excels in specific areas, achieving perfect accuracy in Hallucinations (100%) and Ethics (100%), indicating strong adherence to factual constraints and ethical principles. It also shows robust performance in Coding (94.0% accuracy), Reasoning (96.0% accuracy), and Mathematics (93.0% accuracy), highlighting its capabilities for complex problem-solving and code generation. While its General Knowledge (99.5%) and Email Classification (98.0%) scores are high, they are not its absolute strongest categories relative to other models. A notable strength is its ability to adjust reasoning effort dynamically, providing fast responses for small tasks and sustaining multi-hour runs for large projects, making it highly adaptable for agentic coding applications. Its primary weakness lies in its premium pricing structure.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.25
Completion $10
Input Cache Read $0.125

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
OpenAI
OpenAI | openai/gpt-5-codex 400K $1.25 / 1M tokens $10 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by openai