StepFun: Step 3.5 Flash

Text input Text output
Author's Description

Step 3.5 Flash is StepFun's most capable open-source foundation model. Built on a sparse Mixture of Experts (MoE) architecture, it selectively activates only 11B of its 196B parameters per token....

Key Specifications
Cost
$$$
Context
256K
Parameters
196B (Rumoured)
Released
Jan 29, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Top P Reasoning Include Reasoning Temperature Tools Max Tokens Frequency Penalty Stop
Features

This model supports the following features:

Tools Reasoning
Performance Summary

Step 3.5 Flash, StepFun's open-source foundation model, demonstrates a strong overall performance profile, particularly excelling in cost-efficiency and reliability. It consistently offers among the most competitive pricing, ranking in the 81st percentile across benchmarks. The model also exhibits exceptional reliability with a 99% success rate, indicating minimal technical failures and consistent response delivery. In terms of speed, Step 3.5 Flash generally performs well, placing in the 62nd percentile. A key strength of Step 3.5 Flash is its perfect accuracy across a wide range of critical benchmarks, including Hallucinations, Instruction Following, General Knowledge, Email Classification, Reasoning, Ethics, and Mathematics. This indicates a highly capable and precise model for tasks requiring factual recall, logical deduction, and adherence to complex instructions. Notably, it achieves perfect accuracy in several categories while also being the most accurate model at its price point and among models of similar speed. While its Coding performance is strong (94.9% and 91.9% accuracy), one Coding benchmark showed a significantly longer duration, suggesting potential variability in processing complex coding tasks. The model's sparse Mixture of Experts architecture appears to contribute to its speed efficiency, even with long contexts.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.1
Completion $0.3
Input Cache Read $0.02

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
StepFun
StepFun | stepfun/step-3.5-flash 256K $0.1 / 1M tokens $0.3 / 1M tokens
SiliconFlow
SiliconFlow | stepfun/step-3.5-flash 262K $0.1 / 1M tokens $0.3 / 1M tokens
DeepInfra
DeepInfra | stepfun/step-3.5-flash 262K $0.1 / 1M tokens $0.3 / 1M tokens
Parasail
Parasail | stepfun/step-3.5-flash 262K $0.1 / 1M tokens $0.3 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration