AllenAI: Olmo 2 32B Instruct

Text input Text output
Author's Description

OLMo-2 32B Instruct is a supervised instruction-finetuned variant of the OLMo-2 32B March 2025 base model. It excels in complex reasoning and instruction-following tasks across diverse benchmarks such as GSM8K,...

Key Specifications
Cost
$$$$
Context
4K
Parameters
32B
Released
Mar 14, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Frequency Penalty Top P Min P Temperature Stop Presence Penalty Max Tokens Logit Bias
Performance Summary

AllenAI's OLMo-2 32B Instruct model demonstrates strong overall performance, particularly excelling in reliability and speed. It performs among the fastest models, ranking in the 73rd percentile for speed across benchmarks, and offers competitive pricing, placing in the 43rd percentile. The model exhibits exceptional reliability with a 96% success rate, indicating consistent and stable operation. In terms of specific benchmarks, OLMo-2 32B Instruct shows a notable strength in mitigating hallucinations, achieving 98.0% accuracy, and performs well in General Knowledge (97.5%) and Ethics (98.0%). It also demonstrates solid performance in Email Classification (97.0%). However, the model shows weaknesses in more complex cognitive tasks such as Mathematics (58.0%), Instruction Following (45.0%), Reasoning (32.0%), and Coding (53.0%), where its accuracy falls into lower percentiles. These areas suggest opportunities for further development to enhance its capabilities in advanced problem-solving and multi-step instruction execution.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.05
Completion $0.2

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Parasail
Parasail | allenai/olmo-2-0325-32b-instruct 4K $0.05 / 1M tokens $0.2 / 1M tokens
Cirrascale
Cirrascale | allenai/olmo-2-0325-32b-instruct 128K $0.05 / 1M tokens $0.2 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by allenai