StepFun: Step 3.5 Flash

Text input Text output Free Option
Author's Description

Step 3.5 Flash is StepFun's most capable open-source foundation model. Built on a sparse Mixture of Experts (MoE) architecture, it selectively activates only 11B of its 196B parameters per token. It is a reasoning model that is incredibly speed efficient even at long contexts.

Key Specifications
Cost
$$
Context
256K
Parameters
196B (Rumoured)
Released
Jan 29, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Temperature Max Tokens Top P Frequency Penalty Reasoning Include Reasoning Stop
Features

This model supports the following features:

Tools Reasoning
Performance Summary

Step 3.5 Flash, StepFun's open-source foundation model, demonstrates exceptional performance across multiple key metrics. Leveraging a sparse Mixture of Experts (MoE) architecture, it efficiently activates 11B of its 196B parameters per token, contributing to its impressive speed. The model consistently performs among the fastest models, ranking in the 69th percentile for speed across nine benchmarks. Furthermore, it offers highly competitive pricing, placing in the 86th percentile across eight benchmarks. Reliability is a significant strength, with a near-perfect 99% success rate, indicating minimal technical failures. In terms of benchmark results, Step 3.5 Flash achieved perfect 100% accuracy across eight out of nine evaluated categories, including Hallucinations, Instruction Following, Coding (first instance), General Knowledge, Email Classification, Reasoning, Ethics, and Mathematics. This consistent accuracy, often accompanied by top-tier speed and cost efficiency, highlights its robust capabilities in diverse tasks. Notably, it was the most accurate model at its price point and speed for several benchmarks. The second Coding benchmark, however, showed a slightly lower accuracy of 91.9% and a significantly longer duration, suggesting a potential area for optimization or a specific challenge within that particular test instance. Overall, Step 3.5 Flash stands out for its high accuracy, speed efficiency, and cost-effectiveness, making it a highly capable and reliable reasoning model.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.1
Completion $0.3
Input Cache Read $0.02

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
StepFun
StepFun | stepfun/step-3.5-flash 256K $0.1 / 1M tokens $0.3 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration