Llama Guard 3 8B

Text input Text output
Author's Description

Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification) and in LLM responses (response classification). It acts as an LLM – it generates text in its output that indicates whether a given prompt or response is safe or unsafe, and if unsafe, it also lists the content categories violated. Llama Guard 3 was aligned to safeguard against the MLCommons standardized hazards taxonomy and designed to support Llama 3.1 capabilities. Specifically, it provides content moderation in 8 languages, and was optimized to support safety and security for search and code interpreter tool calls.

Key Specifications
Cost
$$
Context
131K
Parameters
8B
Released
Feb 12, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Top Logprobs Logit Bias Logprobs Stop Seed Top P Max Tokens Frequency Penalty Temperature Presence Penalty
Performance Summary

Llama Guard 3 8B, a Llama-3.1-8B model fine-tuned for content safety classification, demonstrates exceptional performance in terms of speed and cost-efficiency. It consistently ranks among the fastest models and offers among the most competitive pricing across all benchmarks. However, its performance on general benchmarks is notably low, reflecting its specialized purpose. In categories like General Knowledge, Ethics, Mathematics, Email Classification, Instruction Following, and Coding, the model achieved very low accuracy scores, often in the single digits or 0%. For instance, it scored 5.3% in General Knowledge and 0.5% in Ethics, indicating it is not designed for general-purpose reasoning or task execution. Its duration for these tasks also varied significantly, with some benchmarks like Instruction Following taking exceptionally long. This model's strength lies in its intended application: content safety classification, particularly for LLM inputs and responses, aligned with the MLCommons standardized hazards taxonomy. Its optimization for 8 languages and support for search and code interpreter tool calls further highlight its specialized utility. The benchmark results clearly indicate that Llama Guard 3 8B is not a general-purpose AI but a highly specialized tool for content moderation, where its speed and cost-effectiveness are significant advantages.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.02
Completion $0.06

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Nebius
Nebius | meta-llama/llama-guard-3-8b 131K $0.02 / 1M tokens $0.06 / 1M tokens
DeepInfra
DeepInfra | meta-llama/llama-guard-3-8b 131K $0.055 / 1M tokens $0.055 / 1M tokens
Fireworks
Fireworks | meta-llama/llama-guard-3-8b 131K $0.02 / 1M tokens $0.06 / 1M tokens
Together
Together | meta-llama/llama-guard-3-8b 8K $0.2 / 1M tokens $0.2 / 1M tokens
Groq
Groq | meta-llama/llama-guard-3-8b 8K $0.02 / 1M tokens $0.06 / 1M tokens
SambaNova
SambaNova | meta-llama/llama-guard-3-8b 16K $0.02 / 1M tokens $0.06 / 1M tokens
Cloudflare
Cloudflare | meta-llama/llama-guard-3-8b 0 $0.48 / 1M tokens $0.03 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by meta-llama