Meta: Llama 3 8B Instruct

Text input Text output
Author's Description

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong performance compared to leading closed-source models in human evaluations. To read more about the model release, [click here](https://ai.meta.com/blog/meta-llama-3/). Usage of this model is subject to [Meta's Acceptable Use Policy](https://llama.meta.com/llama3/use-policy/).

Key Specifications
Cost
$
Context
8K
Parameters
8B
Released
Apr 17, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Tool Choice Response Format Stop Seed Min P Top P Max Tokens Frequency Penalty Temperature Presence Penalty
Features

This model supports the following features:

Tools Response Format
Performance Summary

Meta's Llama 3 8B Instruct model, released on April 17, 2024, is an instruct-tuned version optimized for high-quality dialogue. It performs among the fastest models, typically ranking in the top tier for speed (65th percentile). The model consistently offers highly competitive pricing, placing it in the 94th percentile across benchmarks. Its reliability is strong, with an 85% success rate, indicating consistent and usable responses. In terms of performance across categories, Llama 3 8B Instruct demonstrates a notable strength in Email Classification, achieving 96.0% accuracy. It also performs reasonably well in Hallucinations (88.0% accuracy), General Knowledge (87.8% accuracy), and Ethics (87.5% accuracy), suggesting a solid foundation in these areas. However, the model exhibits significant weaknesses in more complex cognitive tasks. Its accuracy in Mathematics is low at 25.0%, and it struggles considerably with Instruction Following (33.0% accuracy) and especially Reasoning, where it scores only 12.0% accuracy. This indicates that while it excels in certain classification and knowledge recall tasks, its capabilities for intricate problem-solving and multi-step instruction execution are limited.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.03
Completion $0.06

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | meta-llama/llama-3-8b-instruct 8K $0.03 / 1M tokens $0.06 / 1M tokens
Novita
Novita | meta-llama/llama-3-8b-instruct 8K $0.04 / 1M tokens $0.04 / 1M tokens
Groq
Groq | meta-llama/llama-3-8b-instruct 8K $0.03 / 1M tokens $0.06 / 1M tokens
Mancer 2
Mancer 2 | meta-llama/llama-3-8b-instruct 16K $0.03 / 1M tokens $0.06 / 1M tokens
Together
Together | meta-llama/llama-3-8b-instruct 8K $0.1 / 1M tokens $0.1 / 1M tokens
Cloudflare
Cloudflare | meta-llama/llama-3-8b-instruct 7K $0.28 / 1M tokens $0.83 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by meta-llama