StepFun: Step 3.5 Flash

Name: StepFun: Step 3.5 Flash
Brand: stepfun
Price: 9e-8 USD
Availability: InStock
Rating: 4.9 (11 reviews)

Back

Text input Text output

Author's Description

Step 3.5 Flash is StepFun's most capable open-source foundation model. Built on a sparse Mixture of Experts (MoE) architecture, it selectively activates only 11B of its 196B parameters per token....

Key Specifications

Cost

$$$

Context

256K

Parameters

196B (Rumoured)

Released

Jan 29, 2026

Speed

★★★

Ability

★★★★★

Reliability

★★★

Hugging Face

Supported Parameters

This model supports the following parameters:

Frequency Penalty Include Reasoning Reasoning Temperature Max Tokens Tools Stop Top P

Features

This model supports the following features:

Tools Reasoning

Performance Summary

Step 3.5 Flash, StepFun's open-source foundation model, demonstrates a strong overall performance profile, particularly excelling in cost-efficiency and reliability. It consistently offers among the most competitive pricing, ranking in the 81st percentile across benchmarks. The model also exhibits exceptional reliability with a 99% success rate, indicating minimal technical failures and consistent response delivery. In terms of speed, Step 3.5 Flash generally performs well, placing in the 62nd percentile. A key strength of Step 3.5 Flash is its perfect accuracy across a wide range of critical benchmarks, including Hallucinations, Instruction Following, General Knowledge, Email Classification, Reasoning, Ethics, and Mathematics. This indicates a highly capable and precise model for tasks requiring factual recall, logical deduction, and adherence to complex instructions. Notably, it achieves perfect accuracy in several categories while also being the most accurate model at its price point and among models of similar speed. While its Coding performance is strong (94.9% and 91.9% accuracy), one Coding benchmark showed a significantly longer duration, suggesting potential variability in processing complex coding tasks. The model's sparse Mixture of Experts architecture appears to contribute to its speed efficiency, even with long contexts.

Model Pricing

Current Pricing

Feature	Price (per 1M tokens)
Prompt	$0.09
Completion	$0.3
Input Cache Read	$0.02

Price History

Available Endpoints

Provider	Endpoint Name	Context Length	Pricing (Input)	Pricing (Output)
StepFun	StepFun \| stepfun/step-3.5-flash	256K	$0.09 / 1M tokens	$0.3 / 1M tokens
SiliconFlow	SiliconFlow \| stepfun/step-3.5-flash	262K	$0.1 / 1M tokens	$0.3 / 1M tokens
DeepInfra	DeepInfra \| stepfun/step-3.5-flash	262K	$0.09 / 1M tokens	$0.3 / 1M tokens
Parasail	Parasail \| stepfun/step-3.5-flash	262K	$0.09 / 1M tokens	$0.3 / 1M tokens
Ambient	Ambient \| stepfun/step-3.5-flash	262K	$0.09 / 1M tokens	$0.3 / 1M tokens

Benchmark Results

Benchmark	Category	Reasoning	Strategy	Free	Executions	Accuracy	Cost	Duration

Other Models by stepfun

	Released	Params	Context	Filter by Modalities All Modalities	Speed	Ability	Cost
StepFun: Step 3.7 Flash Unavailable	May 28, 2026	~196B	256K	Image input Video input Text input Text output	★★★★★	★★	$$
StepFun: Step 3.7 Flash	May 28, 2026	~196B	256K	Image input Video input Text input Text output	★★★★	★★★★★	$$$