AllenAI: Olmo 3 32B Think

Text input Text output
Author's Description

Olmo 3 32B Think is a large-scale, 32-billion-parameter model purpose-built for deep reasoning, complex logic chains and advanced instruction-following scenarios. Its capacity enables strong performance on demanding evaluation tasks and...

Key Specifications
Cost
$$$$
Context
65K
Parameters
32B
Released
Nov 21, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Frequency Penalty Structured Outputs Top P Response Format Reasoning Temperature Stop Presence Penalty Include Reasoning Max Tokens Logit Bias
Features

This model supports the following features:

Structured Outputs Response Format Reasoning
Performance Summary

AllenAI: Olmo 3 32B Think, a 32-billion-parameter model designed for deep reasoning, demonstrates strong overall performance. It consistently ranks among the fastest models and offers highly competitive pricing. The model exhibits exceptional reliability with a 97% success rate across benchmarks, indicating minimal technical failures. In terms of specific capabilities, Olmo 3 32B Think excels in Coding (92% accuracy, 75th percentile) and General Knowledge (99% accuracy, 62nd percentile), showcasing its broad understanding and programming proficiency. Its Reasoning capabilities are also robust at 88% accuracy (70th percentile). While its Mathematics performance is solid at 89% accuracy, it incurs a higher cost and duration. A notable weakness is its Instruction Following, where it scored 0% accuracy, suggesting a significant area for improvement. Additionally, its performance on the Hallucinations benchmark, at 78% accuracy, indicates room for improvement in acknowledging uncertainty.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.15
Completion $0.5

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Parasail
Parasail | allenai/olmo-3-32b-think-20251121 65K $0.15 / 1M tokens $0.5 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by allenai