AllenAI: Olmo 3 32B Think

Text input Text output
Author's Description

Olmo 3 32B Think is a large-scale, 32-billion-parameter model purpose-built for deep reasoning, complex logic chains and advanced instruction-following scenarios. Its capacity enables strong performance on demanding evaluation tasks and highly nuanced conversational reasoning. Developed by Ai2 under the Apache 2.0 license, Olmo 3 32B Think embodies the Olmo initiative’s commitment to openness, offering full transparency across weights, code and training methodology.

Key Specifications
Cost
$$$$
Context
65K
Parameters
32B
Released
Nov 21, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Reasoning Structured Outputs Frequency Penalty Include Reasoning Top P Seed Min P Temperature Stop Response Format Max Tokens Presence Penalty Logit Bias
Features

This model supports the following features:

Structured Outputs Response Format Reasoning
Performance Summary

AllenAI's Olmo 3 32B Think, a 32-billion-parameter model designed for deep reasoning and complex instruction following, demonstrates a strong overall performance profile. While its speed ranking indicates it tends to have longer response times, placing it in the 5th percentile, its pricing is moderate, falling within the 33rd percentile. A standout feature is its exceptional reliability, boasting a 97% success rate across benchmarks, signifying minimal technical failures and consistent evaluable responses. The model exhibits particular strength in coding and reasoning tasks, achieving 92.0% and 88.0% accuracy respectively, placing it in the 78th and 76th percentiles. It also performs well in General Knowledge with 99.0% accuracy (66th percentile). However, a notable weakness is its performance on the Hallucinations benchmark, where it achieved only 78.0% accuracy (23rd percentile), suggesting room for improvement in acknowledging uncertainty. Email Classification and Ethics benchmarks show solid, though not top-tier, performance at 97.0% and 98.0% accuracy respectively. Despite its slower response times, Olmo 3 32B Think's robust accuracy in complex domains and high reliability make it a compelling option for demanding AI applications.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.15
Completion $0.5

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Parasail
Parasail | allenai/olmo-3-32b-think-20251121 65K $0.15 / 1M tokens $0.5 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by allenai