Perplexity: Llama 3.1 Sonar 70B Online

Text input Text output Unavailable
Author's Description

Llama 3.1 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance. This is the online version of the [offline chat model](/models/perplexity/llama-3.1-sonar-large-128k-chat). It is focused on delivering helpful, up-to-date, and factual responses. #online

Key Specifications
Cost
$$$$
Context
127K
Parameters
70B
Released
Jul 31, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Max Tokens Top P Frequency Penalty Temperature Presence Penalty
Performance Summary

Perplexity: Llama 3.1 Sonar 70B Online, released on July 31, 2024, demonstrates moderate speed performance, ranking in the 38th percentile across benchmarks. Its pricing is also moderate, placing it in the 39th percentile. This online model, focused on delivering up-to-date and factual responses, is positioned as a more cost-efficient, faster, and higher-performing alternative to earlier Sonar models. Analysis of benchmark results reveals significant variability in performance across categories. The model exhibits a notable strength in Email Classification, achieving 97.0% accuracy, placing it in the 46th percentile for this task. However, its performance in General Knowledge and Coding is considerably weaker, with accuracies of 8.5% (11th percentile) and 1.0% (7th percentile) respectively, indicating these are significant areas for improvement. In Ethics, it achieved 81.0% accuracy, ranking in the 21st percentile. While its speed and cost are moderate overall, the duration for General Knowledge and Coding benchmarks was relatively high.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1
Completion $1
Request $0.005 per request

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Perplexity
Perplexity | perplexity/llama-3.1-sonar-large-128k-online 127K $1 / 1M tokens $1 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by perplexity