Meta: Llama 3.2 3B Instruct

Text input Text output Free Option
Author's Description

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...

Key Specifications
Cost
$
Context
131K
Parameters
3B
Released
Sep 24, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Frequency Penalty Top P Min P Response Format Temperature Stop Presence Penalty Max Tokens
Features

This model supports the following features:

Response Format
Performance Summary

Meta's Llama 3.2 3B Instruct model consistently ranks among the fastest models available and offers highly competitive pricing, placing it in the 96th percentile for cost-efficiency across benchmarks. This 3-billion-parameter multilingual model, designed for advanced NLP tasks, demonstrates a balanced performance profile. It shows a notable strength in hallucination mitigation, achieving 78.0% accuracy, indicating a good ability to acknowledge uncertainty. The model also performs reasonably well in General Knowledge (76.0% accuracy) and Email Classification (88.0% accuracy), where it is highlighted as the most accurate model at its price point. However, Llama 3.2 3B exhibits significant weaknesses in more complex cognitive tasks. Its performance in Ethics (44.0%), Mathematics (4.0%), Reasoning (20.4%), and Coding (11.0%) benchmarks is considerably low, suggesting limitations in these specialized domains. Instruction Following also presents mixed results, with one benchmark showing 0.0% accuracy and another at 26.3%. Overall, the model is well-suited for applications prioritizing speed, cost-effectiveness, and basic text generation with good hallucination control, particularly in multilingual classification tasks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.051
Completion $0.34

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | meta-llama/llama-3.2-3b-instruct 131K $0.051 / 1M tokens $0.34 / 1M tokens
Lambda
Lambda | meta-llama/llama-3.2-3b-instruct 131K $0.051 / 1M tokens $0.34 / 1M tokens
InferenceNet
InferenceNet | meta-llama/llama-3.2-3b-instruct 16K $0.051 / 1M tokens $0.34 / 1M tokens
Novita
Novita | meta-llama/llama-3.2-3b-instruct 32K $0.051 / 1M tokens $0.34 / 1M tokens
Cloudflare
Cloudflare | meta-llama/llama-3.2-3b-instruct 80K $0.051 / 1M tokens $0.34 / 1M tokens
Together
Together | meta-llama/llama-3.2-3b-instruct 131K $0.051 / 1M tokens $0.34 / 1M tokens
SambaNova
SambaNova | meta-llama/llama-3.2-3b-instruct 4K $0.051 / 1M tokens $0.34 / 1M tokens
Hyperbolic
Hyperbolic | meta-llama/llama-3.2-3b-instruct 131K $0.051 / 1M tokens $0.34 / 1M tokens
Nineteen
Nineteen | meta-llama/llama-3.2-3b-instruct 20K $0.051 / 1M tokens $0.34 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by meta-llama