Anthropic: Claude Opus 4.1

Text input File input Image input Text output
Author's Description

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performance in coding, reasoning, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains in multi-file code refactoring, debugging precision, and detail-oriented reasoning. The model supports extended thinking up to 64K tokens and is optimized for tasks involving research, data analysis, and tool-assisted reasoning.

Key Specifications
Cost
+$$$$$
Context
200K
Released
Aug 05, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Temperature Include Reasoning Tool Choice Max Tokens Reasoning Stop
Features

This model supports the following features:

Reasoning Tools
Performance Summary

Anthropic's Claude Opus 4.1, released August 5, 2025, is an advanced model excelling in coding, reasoning, and agentic tasks, with a substantial 200,000 context length. It demonstrates moderate speed performance, ranking in the 34th percentile across benchmarks, and is positioned at premium pricing levels, in the 3rd percentile. A standout feature is its exceptional reliability, achieving a 100% success rate with minimal technical failures. The model exhibits perfect accuracy in Hallucinations (100%) and General Knowledge (100%), often being the most accurate at its price point and speed. It also achieves perfect accuracy in Ethics (100%). Strong performance is noted in Coding (94.0% accuracy, 92nd percentile) and Reasoning (92.0% accuracy, 82nd percentile), aligning with its description of improved capabilities in these areas. Mathematics (92.9% accuracy, 76th percentile) and Instruction Following (69.0% accuracy, 80th percentile) also show solid results. Email Classification, while accurate at 98.0%, ranks in the 54th percentile, indicating average performance relative to its peers in this specific task. Its key strengths lie in its accuracy across critical domains and its robust reliability, making it suitable for demanding applications requiring precision and consistency.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $15
Completion $75
Input Cache Read $1.5
Input Cache Write $18.8
Web Search $10000

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Anthropic
Anthropic | anthropic/claude-4.1-opus-20250805 200K $15 / 1M tokens $75 / 1M tokens
Google
Google | anthropic/claude-4.1-opus-20250805 200K $15 / 1M tokens $75 / 1M tokens
Google
Google | anthropic/claude-4.1-opus-20250805 200K $15 / 1M tokens $75 / 1M tokens
Google
Google | anthropic/claude-4.1-opus-20250805 200K $15 / 1M tokens $75 / 1M tokens
Amazon Bedrock
Amazon Bedrock | anthropic/claude-4.1-opus-20250805 200K $15 / 1M tokens $75 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by anthropic