Anthropic: Claude 3.5 Sonnet (2024-06-20)

Text input Image input File input Text output
Author's Description

Claude 3.5 Sonnet delivers better-than-Opus capabilities, faster-than-Sonnet speeds, at the same Sonnet prices. Sonnet is particularly good at: - Coding: Autonomously writes, edits, and runs code with reasoning and troubleshooting - Data science: Augments human data science expertise; navigates unstructured data while using multiple tools for insights - Visual processing: excelling at interpreting charts, graphs, and images, accurately transcribing text to derive insights beyond just the text alone - Agentic tasks: exceptional tool use, making it great at agentic tasks (i.e. complex, multi-step problem solving tasks that require engaging with other systems) For the latest version (2024-10-23), check out [Claude 3.5 Sonnet](/anthropic/claude-3.5-sonnet). #multimodal

Key Specifications
Cost
$$$$$
Context
200K
Parameters
400B (Rumoured)
Released
Jun 19, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Tool Choice Stop Top P Max Tokens Temperature
Features

This model supports the following features:

Tools
Performance Summary

Anthropic's Claude 3.5 Sonnet (2024-06-20) demonstrates a strong overall performance profile, particularly excelling in reliability and specific accuracy benchmarks. The model exhibits exceptional reliability with a 98% success rate, indicating minimal technical failures and consistent response generation. In terms of speed, it offers competitive response times, ranking in the 57th percentile across benchmarks. However, its pricing tends to be at premium levels, placing it in the 13th percentile for cost-effectiveness. A key strength is its perfect accuracy in critical areas such as Hallucinations, General Knowledge, and Ethics, often achieving the most accurate results at its price point and speed. This suggests a robust understanding and adherence to factual information and ethical principles. The model also performs very well in Email Classification (99.0% accuracy) and Instruction Following (77.0% accuracy, 89th percentile), showcasing its ability to process and act upon detailed directives. While strong in Reasoning (80.0% accuracy) and Mathematics (87.0% accuracy), these areas show some room for improvement compared to its perfect scores elsewhere. Its Coding performance is solid at 80.0% accuracy, aligning with its description as capable in coding tasks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $3
Completion $15
Input Cache Read $0.3
Input Cache Write $3.75

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Anthropic
Anthropic | anthropic/claude-3.5-sonnet-20240620 200K $3 / 1M tokens $15 / 1M tokens
Google
Google | anthropic/claude-3.5-sonnet-20240620 200K $3 / 1M tokens $15 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by anthropic