Perplexity: Llama 3.1 Sonar 8B Online

Text input Text output Unavailable
Author's Description

Llama 3.1 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance. This is the online version of the [offline chat model](/models/perplexity/llama-3.1-sonar-small-128k-chat). It is focused on delivering helpful, up-to-date, and factual responses. #online

Key Specifications
Cost
$$
Context
127K
Parameters
8B
Released
Jul 31, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Top P Max Tokens Presence Penalty Temperature Frequency Penalty
Performance Summary

Perplexity's Llama 3.1 Sonar 8B Online model demonstrates exceptional performance in terms of speed and cost-efficiency. It consistently ranks among the fastest models available and offers highly competitive pricing, making it an attractive option for high-volume or budget-conscious applications. However, its performance across various benchmark categories is mixed. The model shows a notable strength in Reasoning, achieving 74% accuracy, placing it in the 74th percentile. This suggests proficiency in complex problem-solving, logical inference, and abstract thinking. Conversely, the model exhibits significant weaknesses in other areas. Its accuracy in Coding (24%) and Email Classification (81%) is relatively low, placing it in the 22nd and 10th percentiles, respectively. Most concerning are its results in Ethics and General Knowledge, where it scored 4% and 0% accuracy, indicating substantial limitations in these domains. While cost-efficient and fast, the model's utility may be limited to tasks where strong reasoning is paramount and factual recall or ethical discernment are less critical.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.2
Completion $0.2
Request $0.005 per request

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Perplexity
Perplexity | perplexity/llama-3.1-sonar-small-128k-online 127K $0.2 / 1M tokens $0.2 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by perplexity