Author's Description
Llama 3.1 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance. This is the online version of the [offline chat model](/models/perplexity/llama-3.1-sonar-small-128k-chat). It is focused on delivering helpful, up-to-date, and factual responses. #online
Key Specifications
Supported Parameters
This model supports the following parameters:
Performance Summary
Perplexity's Llama 3.1 Sonar 8B Online model demonstrates exceptional performance in terms of speed and cost-efficiency. It consistently ranks among the fastest models available and offers highly competitive pricing, making it an attractive option for high-volume or budget-conscious applications. However, its performance across various benchmark categories is mixed. The model shows a notable strength in Reasoning, achieving 74% accuracy, placing it in the 74th percentile. This suggests proficiency in complex problem-solving, logical inference, and abstract thinking. Conversely, the model exhibits significant weaknesses in other areas. Its accuracy in Coding (24%) and Email Classification (81%) is relatively low, placing it in the 22nd and 10th percentiles, respectively. Most concerning are its results in Ethics and General Knowledge, where it scored 4% and 0% accuracy, indicating substantial limitations in these domains. While cost-efficient and fast, the model's utility may be limited to tasks where strong reasoning is paramount and factual recall or ethical discernment are less critical.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.2 |
Completion | $0.2 |
Request | $0.005 per request |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Perplexity
|
Perplexity | perplexity/llama-3.1-sonar-small-128k-online | 127K | $0.2 / 1M tokens | $0.2 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by perplexity
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Perplexity: Sonar Reasoning Pro | Mar 06, 2025 | — | 128K |
Image input
Text input
Text output
|
★ | ★★★★★ | $$$$$ |
Perplexity: Sonar Pro | Mar 06, 2025 | — | 200K |
Image input
Text input
Text output
|
★★★ | ★★★★★ | $$$$$ |
Perplexity: Sonar Deep Research | Mar 06, 2025 | — | 128K |
Text input
Text output
|
— | — | $$$$$ |
Perplexity: R1 1776 | Feb 19, 2025 | — | 128K |
Text input
Text output
|
★ | ★★★★★ | $$$$$ |
Perplexity: Sonar Reasoning | Jan 28, 2025 | — | 127K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
Perplexity: Sonar | Jan 27, 2025 | — | 127K |
Image input
Text input
Text output
|
★★★ | ★★★ | $$$$ |
Perplexity: Llama 3.1 Sonar 70B Online Unavailable | Jul 31, 2024 | 70B | 127K |
Text input
Text output
|
★★ | ★ | $$$$ |