Anthropic: Claude Opus 4

Text input Image input File input Text output
Author's Description

Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in software engineering, achieving leading results on SWE-bench (72.5%) and Terminal-bench (43.2%). Opus 4 supports extended, agentic workflows, handling thousands of task steps continuously for hours without degradation. Read more at the [blog post here](https://www.anthropic.com/news/claude-4)

Key Specifications
Cost
+$$$$$
Context
200K
Released
May 22, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Tool Choice Reasoning Include Reasoning Stop Max Tokens Temperature
Features

This model supports the following features:

Tools Reasoning
Performance Summary

Anthropic's Claude Opus 4, released May 22, 2025, demonstrates a strong performance profile, particularly in specialized domains. While its speed performance is moderate, ranking in the 30th percentile, its pricing tends to be premium, positioned in the 3rd percentile. A standout feature is its exceptional reliability, achieving a 100% success rate across all benchmarks, indicating minimal technical failures. Opus 4 excels in coding, achieving 94.0% accuracy and being benchmarked as the world's best coding model at release, with leading results on SWE-bench (72.5%) and Terminal-bench (43.2%). It also shows perfect accuracy in Hallucinations, General Knowledge, and Ethics, often being the most accurate model at its price point and speed for these categories. Its Reasoning and Mathematics capabilities are also very strong, scoring 90.0% and 94.0% respectively. Instruction Following is solid at 71.0%. The model's ability to handle extended, agentic workflows for hours without degradation is a significant strength. The primary weakness appears to be its premium pricing across most benchmarks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $15
Completion $75
Input Cache Read $1.5
Input Cache Write $18.8
Web Search $10000

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Anthropic
Anthropic | anthropic/claude-4-opus-20250522 200K $15 / 1M tokens $75 / 1M tokens
Amazon Bedrock
Amazon Bedrock | anthropic/claude-4-opus-20250522 200K $15 / 1M tokens $75 / 1M tokens
Google
Google | anthropic/claude-4-opus-20250522 200K $15 / 1M tokens $75 / 1M tokens
Google
Google | anthropic/claude-4-opus-20250522 200K $15 / 1M tokens $75 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by anthropic