StepFun: Step3

Image input Text input Text output
Author's Description

Step3 is a cutting-edge multimodal reasoning model—built on a Mixture-of-Experts architecture with 321B total parameters and 38B active. It is designed end-to-end to minimize decoding costs while delivering top-tier performance in vision–language reasoning. Through the co-design of Multi-Matrix Factorization Attention (MFA) and Attention-FFN Disaggregation (AFD), Step3 maintains exceptional efficiency across both flagship and low-end accelerators.

Key Specifications
Cost
$$$$$
Context
65K
Parameters
321B (Rumoured)
Released
Aug 28, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Reasoning Structured Outputs Response Format Frequency Penalty Temperature Top P Tool Choice Tools Include Reasoning
Features

This model supports the following features:

Response Format Tools Reasoning Structured Outputs
Performance Summary

Step3, a multimodal reasoning model from stepfun-ai, demonstrates exceptional efficiency and performance, particularly in its speed and cost-effectiveness. Leveraging a Mixture-of-Experts architecture with 321B total parameters and 38B active, and innovations like Multi-Matrix Factorization Attention (MFA) and Attention-FFN Disaggregation (AFD), Step3 consistently ranks among the fastest models and offers highly competitive pricing across all benchmarks. Its reliability is also outstanding, boasting a 99% success rate. In terms of specific benchmark performance, Step3 achieves perfect accuracy in General Knowledge, making it the most accurate model at its price point and among models of comparable speed. It also shows strong capabilities in Coding (94.9% accuracy, 93rd percentile) and Reasoning (93.5% accuracy, 83rd percentile). While its Hallucinations score of 86.0% is respectable, it indicates some room for improvement in acknowledging uncertainty. A notable weakness is its 0.0% accuracy in Instruction Following, suggesting this area requires significant development. Email Classification and Ethics benchmarks show solid, though not top-tier, performance at 96.0% and 99.0% accuracy respectively. Overall, Step3 excels in knowledge-intensive and complex reasoning tasks, offering a compelling balance of speed, cost, and accuracy, despite a clear deficiency in instruction following.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.57
Completion $1.42

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
SiliconFlow
SiliconFlow | stepfun-ai/step3 65K $0.57 / 1M tokens $1.42 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration