AllenAI: Olmo 3.1 32B Instruct

Text input Text output
Author's Description

Olmo 3.1 32B Instruct is a large-scale, 32-billion-parameter instruction-tuned language model engineered for high-performance conversational AI, multi-turn dialogue, and practical instruction following. As part of the Olmo 3.1 family, this variant emphasizes responsiveness to complex user directions and robust chat interactions while retaining strong capabilities on reasoning and coding benchmarks. Developed by Ai2 under the Apache 2.0 license, Olmo 3.1 32B Instruct reflects the Olmo initiative’s commitment to openness and transparency.

Key Specifications
Cost
$$
Context
65K
Parameters
32B
Released
Jan 06, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Top P Seed Tool Choice Min P Temperature Stop Structured Outputs Response Format Max Tokens Tools Presence Penalty Frequency Penalty
Features

This model supports the following features:

Structured Outputs Response Format Tools
Performance Summary

AllenAI: Olmo 3.1 32B Instruct, a 32-billion-parameter instruction-tuned model, demonstrates exceptional reliability with a 100% success rate across all benchmarks, indicating consistent and usable responses. While its speed performance tends to be slower, ranking in the 16th percentile, it offers cost-effective solutions, placing in the 75th percentile for pricing. The model exhibits strong performance in acknowledging uncertainty, achieving 95.7% accuracy on Hallucinations, and solid results in General Knowledge (94.9%) and Email Classification (95.0%). Its ethical reasoning is also commendable at 97.9%. However, its performance in more complex cognitive tasks like Reasoning (50.0%) and Instruction Following (55.6%) is a notable weakness, suggesting areas for improvement in handling intricate multi-step directives. Coding (79.4%) and Mathematics (82.4%) show moderate accuracy. Overall, Olmo 3.1 32B Instruct is a highly reliable and cost-efficient model, particularly strong in knowledge recall and ethical considerations, but with room for growth in advanced reasoning and instruction adherence.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.2
Completion $0.6

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | allenai/olmo-3.1-32b-instruct-20251215 65K $0.2 / 1M tokens $0.6 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by allenai