Meta: Llama 3 8B Instruct

Text input Text output
Author's Description

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

Key Specifications
Cost
$
Context
8K
Parameters
8B
Released
Apr 17, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Tools Frequency Penalty Top P Min P Response Format Temperature Stop Presence Penalty Tool Choice Max Tokens
Features

This model supports the following features:

Response Format Tools
Performance Summary

Meta's Llama 3 8B Instruct model, launched on April 17, 2024, demonstrates a strong overall performance profile, particularly excelling in operational efficiency. It performs among the fastest models, ranking in the top tier for speed (67th percentile), and consistently offers highly competitive pricing, placing in the 93rd percentile. The model also exhibits strong reliability with an 85% success rate, indicating consistent delivery of usable responses. In terms of specific benchmarks, Llama 3 8B Instruct shows notable strengths in practical applications like Email Classification (96.0% accuracy) and effectively managing hallucinations (88.0% accuracy), appropriately acknowledging uncertainty. While its General Knowledge (87.8% accuracy) and Ethics (87.5% accuracy) scores are respectable, they fall within the lower quartiles for these categories. The model's primary weaknesses lie in more complex cognitive tasks, with significantly lower accuracy in Mathematics (25.0%), Instruction Following (33.0%), and particularly Reasoning (12.0%). This suggests it is well-suited for dialogue and classification tasks but may require further development for advanced analytical and problem-solving scenarios.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.03
Completion $0.04

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | meta-llama/llama-3-8b-instruct 8K $0.03 / 1M tokens $0.04 / 1M tokens
Novita
Novita | meta-llama/llama-3-8b-instruct 8K $0.03 / 1M tokens $0.04 / 1M tokens
Groq
Groq | meta-llama/llama-3-8b-instruct 8K $0.03 / 1M tokens $0.04 / 1M tokens
Mancer 2
Mancer 2 | meta-llama/llama-3-8b-instruct 16K $0.03 / 1M tokens $0.04 / 1M tokens
Together
Together | meta-llama/llama-3-8b-instruct 8K $0.1 / 1M tokens $0.1 / 1M tokens
Cloudflare
Cloudflare | meta-llama/llama-3-8b-instruct 7K $0.28 / 1M tokens $0.83 / 1M tokens
Novita
Novita | meta-llama/llama-3-8b-instruct 8K $0.04 / 1M tokens $0.04 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by meta-llama