Author's Description
OLMo-2 32B Instruct is a supervised instruction-finetuned variant of the OLMo-2 32B March 2025 base model. It excels in complex reasoning and instruction-following tasks across diverse benchmarks such as GSM8K, MATH, IFEval, and general NLP evaluation. Developed by AI2, OLMo-2 32B is part of an open, research-oriented initiative, trained primarily on English-language datasets to advance the understanding and development of open-source language models.
Key Specifications
Supported Parameters
This model supports the following parameters:
Performance Summary
AllenAI's OLMo 2 32B Instruct model demonstrates strong overall performance, particularly excelling in reliability with a 96% success rate, indicating consistent and usable responses. The model performs among the fastest models, ranking in the 72nd percentile for speed, and offers competitive pricing, placing in the 38th percentile. In terms of specific benchmarks, OLMo 2 32B Instruct shows notable strengths in handling fictional concepts, achieving 98.0% accuracy in Hallucinations (Baseline), suggesting a good ability to acknowledge uncertainty. It also performs well in General Knowledge (97.5% accuracy) and Email Classification (97.0% accuracy). However, the model exhibits weaknesses in more complex cognitive tasks. Its performance in Reasoning (32.0% accuracy), Mathematics (58.0% accuracy), and Coding (53.0% accuracy) is below average, indicating areas for potential improvement in advanced problem-solving and domain-specific expertise. Instruction Following also presents a challenge, with 45.0% accuracy.
Model Pricing
Current Pricing
| Feature | Price (per 1M tokens) |
|---|---|
| Prompt | $0.2 |
| Completion | $0.35 |
Price History
Available Endpoints
| Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
|---|---|---|---|---|
|
Parasail
|
Parasail | allenai/olmo-2-0325-32b-instruct | 4K | $0.2 / 1M tokens | $0.35 / 1M tokens |
|
Cirrascale
|
Cirrascale | allenai/olmo-2-0325-32b-instruct | 128K | $0.05 / 1M tokens | $0.2 / 1M tokens |
Benchmark Results
| Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
|---|
Other Models by allenai
|
|
Released | Params | Context |
|
Speed | Ability | Cost |
|---|---|---|---|---|---|---|---|
| AllenAI: Molmo 7B D Unavailable | Mar 26, 2025 | 7B | 4K |
Image input
Text input
Text output
|
— | — | $$ |
| AllenAI: Molmo 7B D Unavailable | Mar 26, 2025 | 7B | 4K |
Image input
Text input
Text output
|
— | — | $$ |