Author's Description
Phi-4-reasoning-plus is an enhanced 14B parameter model from Microsoft, fine-tuned from Phi-4 with additional reinforcement learning to boost accuracy on math, science, and code reasoning tasks. It uses the same dense decoder-only transformer architecture as Phi-4, but generates longer, more comprehensive outputs structured into a step-by-step reasoning trace and final answer. While it offers improved benchmark scores over Phi-4-reasoning across tasks like AIME, OmniMath, and HumanEvalPlus, its responses are typically ~50% longer, resulting in higher latency. Designed for English-only applications, it is well-suited for structured reasoning workflows where output quality takes priority over response speed.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Microsoft's Phi 4 Reasoning Plus, an enhanced 14B parameter model, demonstrates moderate speed performance, ranking in the 27th percentile across benchmarks. It offers competitive pricing, placing in the 47th percentile. Notably, the model exhibits exceptional reliability with a 99% success rate, indicating minimal technical failures. In terms of benchmark performance, Phi 4 Reasoning Plus shows a mixed profile. It excels in Ethics, achieving perfect 100% accuracy, making it the most accurate model at its price point and among models of similar speed. Its Coding performance is strong at 89.0% accuracy, placing it in the 69th percentile. General Knowledge is solid at 95.5% accuracy. However, the model struggles significantly with Instruction Following, achieving only 12.1% accuracy, and shows lower accuracy in Email Classification at 88.0%. Its key strength lies in its robust reasoning capabilities, particularly in ethical scenarios and coding, while its primary weakness is its limited instruction following precision. Designed for English-only structured reasoning workflows, its longer, more comprehensive outputs contribute to higher latency but prioritize output quality.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.07 |
Completion | $0.35 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
DeepInfra
|
DeepInfra | microsoft/phi-4-reasoning-plus-04-30 | 32K | $0.07 / 1M tokens | $0.35 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by microsoft
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Microsoft: MAI DS R1 | Apr 20, 2025 | — | 163K |
Text input
Text output
|
★★★★ | ★★★★★ | $$$ |
Microsoft: Phi 4 Multimodal Instruct | Mar 07, 2025 | ~5.6B | 131K |
Text input
Image input
Text output
|
★★ | ★★ | $$ |
Microsoft: Phi 4 | Jan 09, 2025 | ~14B | 16K |
Text input
Text output
|
★★★★ | ★★★★ | $$ |
Microsoft: Phi-3.5 Mini 128K Instruct | Aug 20, 2024 | ~3.8B | 128K |
Text input
Text output
|
★ | ★★ | $$ |
Microsoft: Phi-3 Mini 128K Instruct | May 25, 2024 | ~3.8B | 128K |
Text input
Text output
|
★★★ | ★★ | $$ |
Microsoft: Phi-3 Medium 128K Instruct | May 23, 2024 | ~14B | 128K |
Text input
Text output
|
★★ | ★ | $$$$ |
WizardLM-2 8x22B | Apr 15, 2024 | 22B | 65K |
Text input
Text output
|
★★★ | ★★ | $$$ |