OpenAI: GPT-5.2-Codex

Image input Text input Text output
Author's Description

GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks. The model supports building projects from scratch, feature development, debugging, large-scale refactoring, and code review. Compared to GPT-5.1-Codex, 5.2-Codex is more steerable, adheres closely to developer instructions, and produces cleaner, higher-quality code outputs. Reasoning effort can be adjusted with the `reasoning.effort` parameter. Read the [docs here](https://openrouter.ai/docs/use-cases/reasoning-tokens#reasoning-effort-level) Codex integrates into developer environments including the CLI, IDE extensions, GitHub, and cloud tasks. It adapts reasoning effort dynamically—providing fast responses for small tasks while sustaining extended multi-hour runs for large projects. The model is trained to perform structured code reviews, catching critical flaws by reasoning over dependencies and validating behavior against tests. It also supports multimodal inputs such as images or screenshots for UI development and integrates tool use for search, dependency installation, and environment setup. Codex is intended specifically for agentic coding applications.

Key Specifications
Cost
$$$$$
Context
400K
Released
Jan 14, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Stop Reasoning Presence Penalty Top Logprobs Frequency Penalty Logit Bias Logprobs Include Reasoning Tools Max Tokens Structured Outputs Response Format Tool Choice
Features

This model supports the following features:

Response Format Tools Structured Outputs Reasoning
Performance Summary

OpenAI's GPT-5.2-Codex, released on January 14, 2026, demonstrates strong performance tailored for software engineering workflows. It performs among the fastest models, ranking in the 61st percentile for speed across eight benchmarks, and offers moderate pricing, placing in the 22nd percentile. Notably, the model exhibits exceptional reliability with a 100% success rate, indicating minimal technical failures. Its core strengths lie in its perfect accuracy in Hallucinations, General Knowledge, and Ethics benchmarks, alongside high accuracy in Coding (94.0%), Mathematics (95.0%), and Reasoning (96.0%). This highlights its robust understanding and problem-solving capabilities, particularly in areas critical for development. While its Instruction Following (89.0%) and Email Classification (98.0%) are strong, they are not at the absolute peak of its performance. The model's ability to adapt reasoning effort, integrate into developer environments, and support multimodal inputs further enhances its utility for agentic coding applications, making it a highly capable and reliable tool for complex software development tasks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.75
Completion $14
Input Cache Read $0.175
Web Search $10000

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
OpenAI
OpenAI | openai/gpt-5.2-codex-20260114 400K $1.75 / 1M tokens $14 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by openai