OpenAI: GPT-4o

Image input File input Text input Text output
Author's Description

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as fast and 50% more cost-effective. GPT-4o also offers improved performance in processing non-English languages and enhanced visual capabilities. For benchmarking against other models, it was briefly called ["im-also-a-good-gpt2-chatbot"](https://twitter.com/LiamFedus/status/1790064963966370209) #multimodal

Key Specifications
Cost
$$$$$
Context
128K
Parameters
200B (Rumoured)
Released
May 12, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Top Logprobs Stop Structured Outputs Logprobs Presence Penalty Frequency Penalty Top P Max Tokens Tool Choice Response Format Logit Bias Seed Temperature Tools
Features

This model supports the following features:

Response Format Tools Structured Outputs
Performance Summary

OpenAI's GPT-4o demonstrates a strong overall performance profile, particularly excelling in speed and reliability. It consistently performs among the fastest models, ranking in the top tier (75th percentile) across various benchmarks. In terms of cost, GPT-4o offers competitive pricing, generally falling within the 21st percentile, making it a cost-effective option for its capabilities. The model exhibits exceptional reliability, achieving a perfect 100% success rate across all evaluated benchmarks, indicating minimal technical failures and consistent response delivery. Across specific categories, GPT-4o shows remarkable accuracy in Hallucinations (100%), Ethics (100%), and Email Classification (99.0%), often being the most accurate model at its price point or speed. It also performs very well in Coding (93.0% accuracy, 86th percentile) and Instruction Following (69.0% accuracy, 80th percentile), showcasing strong capabilities in these areas. While its General Knowledge (99.5% accuracy, 74th percentile), Mathematics (90.0% accuracy, 64th percentile), and Reasoning (84.0% accuracy, 74th percentile) scores are solid, they are not always at the absolute top tier compared to its perfect scores in other areas. Its key strengths lie in its speed, reliability, and high accuracy in critical areas like ethics and avoiding hallucinations, coupled with its multimodal capabilities and improved non-English language processing. No significant weaknesses were identified in its core performance metrics.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $2.5
Completion $10
Input Cache Read $1.25

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
OpenAI
OpenAI | openai/gpt-4o 128K $2.5 / 1M tokens $10 / 1M tokens
Azure
Azure | openai/gpt-4o 128K $2.5 / 1M tokens $10 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by openai