Author's Description
The highly anticipated 400B class of Llama3 is here! Clocking in at 128k context with impressive eval scores, the Meta AI team continues to push the frontier of open-source LLMs. Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 405B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong performance compared to leading closed-source models including GPT-4o and Claude 3.5 Sonnet in evaluations. To read more about the model release, [click here](https://ai.meta.com/blog/meta-llama-3-1/). Usage of this model is subject to [Meta's Acceptable Use Policy](https://llama.meta.com/llama3/use-policy/).
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Meta's Llama 3.1 405B Instruct model, released on July 22, 2024, demonstrates strong capabilities as a high-quality dialogue optimized LLM. It consistently ranks among the fastest models across nine benchmarks and offers competitive pricing, placing in the 46th percentile. The model exhibits exceptional reliability with a 97% success rate, indicating minimal technical failures. In terms of performance, Llama 3.1 405B Instruct shows particular strength in ethical reasoning and classification tasks, achieving perfect 100% accuracy in both Ethics and Email Classification benchmarks. It also performs well in mitigating hallucinations, with a 96.0% accuracy in the Hallucinations (Baseline) test. While its General Knowledge and Coding scores are in the lower percentiles (31st), its Mathematics accuracy is moderate at 82.0%. A notable weakness is observed in one instance of the Instruction Following benchmark, where it scored 0.0% accuracy, though it achieved 60.0% in another, suggesting variability or a specific challenge in highly complex, multi-layered instructions. Overall, the model presents a compelling option for dialogue-centric applications, balancing speed, reliability, and strong performance in critical areas.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.8 |
Completion | $0.8 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
DeepInfra
|
DeepInfra | meta-llama/llama-3.1-405b-instruct | 32K | $0.8 / 1M tokens | $0.8 / 1M tokens |
Lambda
|
Lambda | meta-llama/llama-3.1-405b-instruct | 131K | $0.8 / 1M tokens | $0.8 / 1M tokens |
Nebius
|
Nebius | meta-llama/llama-3.1-405b-instruct | 131K | $1 / 1M tokens | $3 / 1M tokens |
Fireworks
|
Fireworks | meta-llama/llama-3.1-405b-instruct | 131K | $3 / 1M tokens | $3 / 1M tokens |
Together
|
Together | meta-llama/llama-3.1-405b-instruct | 130K | $3.5 / 1M tokens | $3.5 / 1M tokens |
Hyperbolic
|
Hyperbolic | meta-llama/llama-3.1-405b-instruct | 131K | $4 / 1M tokens | $4 / 1M tokens |
SambaNova
|
SambaNova | meta-llama/llama-3.1-405b-instruct | 16K | $0.8 / 1M tokens | $0.8 / 1M tokens |
Google
|
Google | meta-llama/llama-3.1-405b-instruct | 128K | $5 / 1M tokens | $16 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by meta-llama
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Meta: Llama Guard 4 12B | Apr 29, 2025 | 12B | 163K |
Text input
Image input
Text output
|
— | ★ | $$ |
Meta: Llama 4 Maverick | Apr 05, 2025 | 17B | 1M |
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$$ |
Meta: Llama 4 Scout | Apr 05, 2025 | 17B | 327K |
Text input
Image input
Text output
|
★★★★ | ★★ | $$ |
Llama Guard 3 8B | Feb 12, 2025 | 8B | 131K |
Text input
Text output
|
★★ | ★ | $$ |
Meta: Llama 3.3 70B Instruct | Dec 06, 2024 | 70B | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $ |
Meta: Llama 3.2 1B Instruct | Sep 24, 2024 | 1B | 131K |
Text input
Text output
|
★★ | ★ | $ |
Meta: Llama 3.2 3B Instruct | Sep 24, 2024 | 3B | 131K |
Text input
Text output
|
★★★ | ★ | $ |
Meta: Llama 3.2 11B Vision Instruct | Sep 24, 2024 | 11B | 128K |
Text input
Image input
Text output
|
★★ | ★★ | $$ |
Meta: Llama 3.2 90B Vision Instruct | Sep 24, 2024 | 90B | 131K |
Text input
Image input
Text output
|
★★★ | ★★ | $$$$ |
Meta: Llama 3.1 405B (base) | Aug 01, 2024 | 405B | 32K |
Text input
Text output
|
★ | ★ | $$$ |
Meta: Llama 3.1 70B Instruct | Jul 22, 2024 | 70B | 131K |
Text input
Text output
|
★★★★ | ★★ | $$ |
Meta: Llama 3.1 8B Instruct | Jul 22, 2024 | 8B | 131K |
Text input
Text output
|
★★★ | ★★ | $ |
Meta: LlamaGuard 2 8B | May 12, 2024 | 8B | 8K |
Text input
Text output
|
★★★★ | ★ | $$ |
Meta: Llama 3 8B Instruct | Apr 17, 2024 | 8B | 8K |
Text input
Text output
|
★★★ | ★★ | $ |
Meta: Llama 3 70B Instruct | Apr 17, 2024 | 70B | 8K |
Text input
Text output
|
★★★★ | ★★ | $$$ |
Meta: Llama 2 70B Chat Unavailable | Jun 19, 2023 | 70B | 4K |
Text input
Text output
|
— | — | $$$$ |