Arcee AI: Maestro Reasoning

Text input Text output
Author's Description

Maestro Reasoning is Arcee's flagship analysis model: a 32 B‑parameter derivative of Qwen 2.5‑32 B tuned with DPO and chain‑of‑thought RL for step‑by‑step logic. Compared to the earlier 7 B preview, the production 32 B release widens the context window to 128 k tokens and doubles pass‑rate on MATH and GSM‑8K, while also lifting code completion accuracy. Its instruction style encourages structured "thought → answer" traces that can be parsed or hidden according to user preference. That transparency pairs well with audit‑focused industries like finance or healthcare where seeing the reasoning path matters. In Arcee Conductor, Maestro is automatically selected for complex, multi‑constraint queries that smaller SLMs bounce.

Key Specifications
Cost
$$$$$
Context
131K
Parameters
32B (Rumoured)
Released
May 05, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Logit Bias Top P Temperature Min P Frequency Penalty Max Tokens
Performance Summary

Arcee AI's Maestro Reasoning model, a 32B-parameter derivative of Qwen 2.5-32B, demonstrates exceptional performance across several key metrics. It consistently ranks among the fastest models and offers highly competitive pricing, positioning it as a cost-effective and efficient solution. Furthermore, its reliability is outstanding, achieving the 100th percentile across six benchmarks, indicating minimal technical failures and consistent response delivery. In terms of benchmark performance, Maestro Reasoning excels in critical areas. It achieved perfect 100% accuracy in Email Classification, Reasoning, and Ethics, often being the most accurate model at its price point and speed. Its General Knowledge is also very strong at 99.3% accuracy. While its Coding (Baseline) performance is respectable at 92.0% accuracy, its Instruction Following (Baseline) benchmark shows a significant weakness with 0.0% accuracy, suggesting a potential area for improvement in handling complex, multi-step instructions. The model's design, emphasizing structured "thought → answer" traces, makes it particularly well-suited for audit-focused industries where transparency of reasoning is paramount.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.9
Completion $3.3

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Together
Together | arcee-ai/maestro-reasoning 131K $0.9 / 1M tokens $3.3 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by arcee-ai