Author's Description
AI21 Jamba Large 1.6 is a high-performance hybrid foundation model combining State Space Models (Mamba) with Transformer attention mechanisms. Developed by AI21, it excels in extremely long-context handling (256K tokens), demonstrates superior inference efficiency (up to 2.5x faster than comparable models), and supports structured JSON output and tool-use capabilities. It has 94 billion active parameters (398 billion total), optimized quantization support (ExpertsInt8), and multilingual proficiency in languages such as English, Spanish, French, Portuguese, Italian, Dutch, German, Arabic, and Hebrew. Usage of this model is subject to the [Jamba Open Model License](https://www.ai21.com/licenses/jamba-open-model-license).
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
AI21 Jamba 1.6 Large demonstrates exceptional performance in terms of speed and cost-efficiency, consistently ranking among the fastest models and offering highly competitive pricing across various benchmarks. This hybrid foundation model, combining State Space Models and Transformer attention, is particularly notable for its extremely long context handling of 256K tokens and superior inference efficiency. In terms of specific benchmark results, Jamba 1.6 Large excels significantly in Email Classification, achieving 99.0% accuracy, placing it in the 78th percentile. This indicates a strong capability in understanding context and purpose for categorization tasks. However, the model shows considerable weaknesses in General Knowledge and Ethics, both scoring 0.0% accuracy, and in Coding, with only 1.0% accuracy. Its Instruction Following capability is moderate at 34.0% accuracy. While its cost and duration metrics for these benchmarks are generally competitive, the accuracy scores suggest that its current strengths lie more in specific classification tasks rather than broad knowledge recall, ethical reasoning, or complex coding challenges. The model's multilingual proficiency and support for structured JSON output and tool-use are valuable features for practical applications.
Model Pricing
Current Pricing
| Feature | Price (per 1M tokens) |
|---|---|
| Prompt | $2 |
| Completion | $8 |
Price History
Available Endpoints
| Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
|---|---|---|---|---|
|
AI21
|
AI21 | ai21/jamba-1.6-large | 256K | $2 / 1M tokens | $8 / 1M tokens |
Benchmark Results
| Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
|---|
Other Models by ai21
|
|
Released | Params | Context |
|
Speed | Ability | Cost |
|---|---|---|---|---|---|---|---|
| AI21: Jamba Mini 1.7 Unavailable | Aug 08, 2025 | — | 256K |
Text input
Text output
|
★★★★★ | ★ | $$ |
| AI21: Jamba Large 1.7 | Aug 08, 2025 | — | 256K |
Text input
Text output
|
★★★★ | ★★ | $$$$$ |
| AI21: Jamba Mini 1.6 Unavailable | Mar 13, 2025 | ~52B | 256K |
Text input
Text output
|
★★★★★ | ★ | $$ |