AllenAI: Olmo 3.1 32B Instruct

Text input Text output
Author's Description

Olmo 3.1 32B Instruct is a large-scale, 32-billion-parameter instruction-tuned language model engineered for high-performance conversational AI, multi-turn dialogue, and practical instruction following. As part of the Olmo 3.1 family, this...

Key Specifications
Cost
$$
Context
65K
Parameters
32B
Released
Jan 06, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Response Format Tool Choice Structured Outputs Top P Seed Temperature Tools Stop Max Tokens Min P Frequency Penalty Presence Penalty
Features

This model supports the following features:

Structured Outputs Tools Response Format
Performance Summary

AllenAI's Olmo 3.1 32B Instruct demonstrates a strong commitment to reliability, achieving a perfect 100% success rate across all benchmarks, indicating exceptional stability and consistent response delivery. While it tends to have longer response times, ranking in the 18th percentile for speed, it offers cost-effective solutions, placing in the 77th percentile for price. The model exhibits notable strengths in acknowledging uncertainty, achieving 95.7% accuracy on hallucination tests, and performs well in ethics (97.9%) and general knowledge (94.9%). However, its performance in these knowledge-based categories is not top-tier, ranking in the 53rd, 33rd, and 36th percentiles respectively. Instruction following (55.6% accuracy) and reasoning (50.0% accuracy) represent key areas for improvement, with the model ranking in the lower percentiles for these critical capabilities. Coding (79.4%) and mathematics (82.4%) show moderate performance. Overall, Olmo 3.1 32B Instruct is a highly reliable and cost-efficient model, particularly adept at avoiding hallucinations, but its speed and advanced reasoning capabilities could be enhanced.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.2
Completion $0.6

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | allenai/olmo-3.1-32b-instruct-20251215 65K $0.2 / 1M tokens $0.6 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by allenai