Anthropic: Claude 3.5 Sonnet

Text input Image input File input Text output
Author's Description

New Claude 3.5 Sonnet delivers better-than-Opus capabilities, faster-than-Sonnet speeds, at the same Sonnet prices. Sonnet is particularly good at: - Coding: Scores ~49% on SWE-Bench Verified, higher than the last best score, and without any fancy prompt scaffolding - Data science: Augments human data science expertise; navigates unstructured data while using multiple tools for insights - Visual processing: excelling at interpreting charts, graphs, and images, accurately transcribing text to derive insights beyond just the text alone - Agentic tasks: exceptional tool use, making it great at agentic tasks (i.e. complex, multi-step problem solving tasks that require engaging with other systems) #multimodal

Key Specifications
Cost
$$$$$
Context
200K
Parameters
400B (Rumoured)
Released
Oct 21, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Tool Choice Stop Top P Max Tokens Temperature
Features

This model supports the following features:

Tools
Performance Summary

Anthropic's Claude 3.5 Sonnet demonstrates competitive response times, performing among the faster models with a 48th percentile speed ranking. However, its pricing is positioned at premium levels, ranking in the 13th percentile for cost-effectiveness. A standout feature is its exceptional reliability, achieving a perfect 100% success rate across all benchmarks, indicating minimal technical failures. The model excels in foundational areas, achieving perfect 100% accuracy in Hallucinations (Baseline), General Knowledge (Baseline), and Ethics (Baseline), often being the most accurate at its respective price and speed points. It also shows strong performance in Instruction Following (89th percentile accuracy) and respectable capabilities in Mathematics (89.0% accuracy) and Reasoning (76.0% accuracy). While its Coding performance (82.0% accuracy) is solid, it falls in the 49th percentile, suggesting room for improvement compared to top-tier coding models. Email Classification is competent at 98.0% accuracy. Overall, Claude 3.5 Sonnet is a highly reliable model with strong general knowledge, ethical understanding, and instruction following, making it well-suited for tasks requiring accuracy and dependability, despite its premium pricing.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $3
Completion $15
Input Cache Read $0.3
Input Cache Write $3.75

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Anthropic
Anthropic | anthropic/claude-3.5-sonnet 200K $3 / 1M tokens $15 / 1M tokens
Google
Google | anthropic/claude-3.5-sonnet 200K $3 / 1M tokens $15 / 1M tokens
Amazon Bedrock
Amazon Bedrock | anthropic/claude-3.5-sonnet 200K $3 / 1M tokens $15 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by anthropic