Microsoft: Phi 4

Text input Text output
Author's Description

[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion parameters, it was trained on a mix of high-quality synthetic datasets, data from curated websites, and academic materials. It has undergone careful improvement to follow instructions accurately and maintain strong safety standards. It works best with English language inputs. For more information, please see [Phi-4 Technical Report](https://arxiv.org/pdf/2412.08905)

Key Specifications
Cost
$$
Context
16K
Parameters
14B (Rumoured)
Released
Jan 09, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Top P Temperature Seed Min P Response Format Frequency Penalty Max Tokens
Features

This model supports the following features:

Response Format
Performance Summary

Microsoft Phi-4, a 14-billion parameter model created on January 9, 2025, demonstrates strong overall performance, particularly in efficiency and reliability. It consistently performs among the fastest models, ranking in the 68th percentile for speed across benchmarks. Furthermore, Phi-4 offers highly competitive pricing, placing in the 85th percentile. Its reliability is exceptional, achieving a perfect 100th percentile, indicating minimal technical failures and consistent response delivery. In specific benchmarks, Phi-4 exhibits varied strengths. It achieved perfect accuracy in the Ethics (Baseline) benchmark, showcasing its robust ethical reasoning capabilities and standing out as the most accurate model at its price point and speed. Its performance in General Knowledge was strong at 96.8% accuracy, while its Reasoning (Baseline) accuracy was moderate at 68.0%. The Email Classification (Baseline) task was a notable weakness, with 94.0% accuracy placing it in the lower 32nd percentile for that specific task. Overall, Phi-4 excels in complex reasoning and ethical applications, offering a highly reliable and cost-effective solution, though its email classification accuracy could be improved.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.07
Completion $0.14

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | microsoft/phi-4 16K $0.07 / 1M tokens $0.14 / 1M tokens
Nebius
Nebius | microsoft/phi-4 16K $0.1 / 1M tokens $0.3 / 1M tokens
NextBit
NextBit | microsoft/phi-4 16K $0.06 / 1M tokens $0.14 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by microsoft