AllenAI: Olmo 2 32B Instruct

Text input Text output
Author's Description

OLMo-2 32B Instruct is a supervised instruction-finetuned variant of the OLMo-2 32B March 2025 base model. It excels in complex reasoning and instruction-following tasks across diverse benchmarks such as GSM8K, MATH, IFEval, and general NLP evaluation. Developed by AI2, OLMo-2 32B is part of an open, research-oriented initiative, trained primarily on English-language datasets to advance the understanding and development of open-source language models.

Key Specifications
Cost
$$$$
Context
4K
Parameters
32B
Released
Mar 14, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Temperature Frequency Penalty Stop Seed Logit Bias Presence Penalty Max Tokens Top P Min P
Performance Summary

AllenAI's OLMo-2 32B Instruct, created in March 2025, demonstrates a strong overall performance profile, particularly excelling in speed and reliability. It performs among the fastest models, ranking in the top tier for speed (76th percentile), and exhibits exceptional reliability with a 95% success rate, indicating minimal technical failures. Pricing is moderate, positioned at the 38th percentile. In terms of specific benchmarks, the model shows solid performance in General Knowledge (97.5% accuracy) and Ethics (98.0% accuracy), suggesting a robust understanding of factual information and ethical principles. Its Email Classification accuracy of 97.0% highlights its capability in structured classification tasks. However, the model exhibits notable weaknesses in more complex cognitive areas. Instruction Following accuracy is moderate at 45.0%, and Reasoning and Coding benchmarks show lower performance at 36.0% and 53.0% accuracy respectively, placing it in the lower percentiles for these categories. This suggests that while it handles general knowledge and classification well, there is room for improvement in advanced problem-solving, multi-step instruction execution, and complex logical deduction.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1
Completion $1.5

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Parasail
Parasail | allenai/olmo-2-0325-32b-instruct 4K $1 / 1M tokens $1.5 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by allenai