Anthropic: Claude Sonnet 4

File input Text input Image input Text output
Author's Description

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%), Sonnet 4 balances capability and computational efficiency, making it suitable for a broad range of applications from routine coding tasks to complex software development projects. Key enhancements include improved autonomous codebase navigation, reduced error rates in agent-driven workflows, and increased reliability in following intricate instructions. Sonnet 4 is optimized for practical everyday use, providing advanced reasoning capabilities while maintaining efficiency and responsiveness in diverse internal and external scenarios. Read more at the [blog post here](https://www.anthropic.com/news/claude-4)

Key Specifications
Cost
$$$$$
Context
200K
Released
May 22, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Include Reasoning Stop Tool Choice Temperature Tools Reasoning Max Tokens
Features

This model supports the following features:

Tools Reasoning
Performance Summary

Claude Sonnet 4, released by Anthropic on May 22, 2025, demonstrates a strong overall performance profile, particularly excelling in reliability. It consistently provides usable responses, achieving a perfect 100th percentile in reliability, indicating minimal technical failures. In terms of speed, Sonnet 4 exhibits competitive response times, ranking in the 41st percentile across benchmarks. However, its pricing tends to be at a premium level, positioned in the 12th percentile. The model showcases exceptional accuracy across several key benchmarks. It achieved perfect accuracy in Ethics (Baseline), making it the most accurate model at its price point and speed. Sonnet 4 also performed very strongly in Coding (93.0% accuracy), being the most accurate among models of comparable speed, and in Reasoning (96.0% accuracy) and General Knowledge (99.8% accuracy), both ranking in the top percentiles. While its Instruction Following accuracy (66.0%) is solid, it is not as dominant as its performance in other areas. Email Classification also shows high accuracy at 99.0%. Sonnet 4's key strengths lie in its advanced reasoning, coding capabilities, and ethical alignment, making it suitable for complex software development and agent-driven workflows, as highlighted by its state-of-the-art SWE-bench performance. Its primary limitation is its premium cost.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $3
Completion $15
Input Cache Read $0.3
Input Cache Write $3.75

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google
Google | anthropic/claude-4-sonnet-20250522 200K $3 / 1M tokens $15 / 1M tokens
Google
Google | anthropic/claude-4-sonnet-20250522 200K $3 / 1M tokens $15 / 1M tokens
Amazon Bedrock
Amazon Bedrock | anthropic/claude-4-sonnet-20250522 1M $3 / 1M tokens $15 / 1M tokens
Anthropic
Anthropic | anthropic/claude-4-sonnet-20250522 1M $3 / 1M tokens $15 / 1M tokens
Google
Google | anthropic/claude-4-sonnet-20250522 200K $3 / 1M tokens $15 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by anthropic