AllenAI: Olmo 2 32B Instruct

Text input Text output
Author's Description

OLMo-2 32B Instruct is a supervised instruction-finetuned variant of the OLMo-2 32B March 2025 base model. It excels in complex reasoning and instruction-following tasks across diverse benchmarks such as GSM8K, MATH, IFEval, and general NLP evaluation. Developed by AI2, OLMo-2 32B is part of an open, research-oriented initiative, trained primarily on English-language datasets to advance the understanding and development of open-source language models.

Key Specifications
Cost
$$$$
Context
4K
Parameters
32B
Released
Mar 14, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Min P Stop Presence Penalty Logit Bias Seed Frequency Penalty Temperature Top P Max Tokens
Performance Summary

AllenAI's OLMo 2 32B Instruct model demonstrates strong overall performance, particularly excelling in reliability with a 96% success rate, indicating consistent and usable responses. The model performs among the fastest models, ranking in the 72nd percentile for speed, and offers competitive pricing, placing in the 38th percentile. In terms of specific benchmarks, OLMo 2 32B Instruct shows notable strengths in handling fictional concepts, achieving 98.0% accuracy in Hallucinations (Baseline), suggesting a good ability to acknowledge uncertainty. It also performs well in General Knowledge (97.5% accuracy) and Email Classification (97.0% accuracy). However, the model exhibits weaknesses in more complex cognitive tasks. Its performance in Reasoning (32.0% accuracy), Mathematics (58.0% accuracy), and Coding (53.0% accuracy) is below average, indicating areas for potential improvement in advanced problem-solving and domain-specific expertise. Instruction Following also presents a challenge, with 45.0% accuracy.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.2
Completion $0.35

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Parasail
Parasail | allenai/olmo-2-0325-32b-instruct 4K $0.2 / 1M tokens $0.35 / 1M tokens
Cirrascale
Cirrascale | allenai/olmo-2-0325-32b-instruct 128K $0.05 / 1M tokens $0.2 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by allenai