加载中...

StepFun Flash 3.5

Name: StepFun Flash 3.5
Availability: InStock
Author: StepFunAI

Release date: 2026-02-02知识截止: 2025-1219

Live demoGitHub Hugging Face

Parameters

1960.0亿

Context length

256K

Chinese support

Supported

Reasoning ability

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Model basics

Reasoning traces

Supported

Context length

256K tokens

Max output length

16384 tokens

Model type

聊天大模型

Release date

2026-02-02

Model file size

38GB

MoE architecture

Yes

Total params / Active params

1960.0B / 110B

Knowledge cutoff

2025-12

Inference modes

常规模式（Non-Thinking Mode）思考模式（Thinking Mode）

Open source & experience

Code license

Apache 2.0

Weights license

Apache 2.0- 免费商用授权

GitHub repo

https://github.com/stepfun-ai/Step-3.5-Flash

Hugging Face

https://huggingface.co/stepfun-ai/Step-3.5-Flash

Live demo

No live demo

Official resources

Paper

Step 3.5 Flash: The Open Source 'Light Cavalry' for Agents

DataLearnerAI blog

No blog post yet

API details

API speed

5/5

💡Default unit: $/1M tokens. If vendors use other units, follow their published pricing.

Standard pricingStandard

Modality	Input	Output
Text	0.0	0.0

Cached pricingCache

Modality	Input cache	Output cache
Text	0.0	--

Extended context pricingExtended

Modality	Input	Output
Text	0.0	--

Benchmark Scores

综合评估

2 evaluations

Benchmark / mode

Score

Rank/total

ARC-AGIthinking + 使用工具

56.50

15 / 38

ARC-AGIthinking

53.50

17 / 38

编程与软件工程

2 evaluations

Benchmark / mode

Score

Rank/total

LiveCodeBenchthinking

86.40

4 / 102

SWE-bench Verifiedthinking

74.40

16 / 78

数学推理

4 evaluations

Benchmark / mode

Score

Rank/total

AIME2025thinking + 使用工具

99.80

6 / 104

AIME2025thinking

97.30

17 / 104

IMO-AnswerBenchthinking + 使用工具

86.70

1 / 4

IMO-AnswerBenchthinking

85.40

2 / 4

Agent能力评测

1 evaluations

Benchmark / mode

Score

Rank/total

τ²-Benchthinking + 使用工具

88.20

2 / 33

AI Agent - 信息收集

1 evaluations

Benchmark / mode

Score

Rank/total

BrowseCompthinking + 使用工具

5 / 19

AI Agent - 工具使用

1 evaluations

Benchmark / mode

Score

Rank/total

Terminal Bench 2.0thinking + 使用工具

3 / 11

Publisher

StepFunAI

View publisher details

Model Overview

StepFun Flash 3.5 是阶跃星辰（StepFun）于 2026 年 2 月最新发布的开源“轻量级”旗舰模型，专为智能体（Agent）场景与高并发应用设计。该模型在保持极低推理成本和极高响应速度的同时，具备比肩闭源大模型的逻辑推理与代码生成能力。

1. 模型简介与核心定位

发布机构：阶跃星辰 (StepFunAI)
发布时间：2026年2月2日
核心定位：该模型被称为 Agent 时代的“轻骑兵”，旨在解决传统大模型在复杂智能体交互中“速度慢”与“成本高”的痛点。它特别优化了指令遵循与工具调用能力，适合作为各类端侧应用与云端 Agent 的核心大脑。

2. 架构与技术规格

稀疏 MoE 架构：采用先进的混合专家（Mixture-of-Experts）设计，模型总参数量为 1960亿 (196B)，但单次推理的激活参数量仅为 110亿 (11B)。这种设计在保证知识储备的同时，极大降低了计算负载。
MTP-3 加速技术：引入了“多 Token 预测”（Multi-Token Prediction）技术，一次前向传播可预测 3 个 Token，使得推理速度最高可达 350 TPS，特别适合实时交互场景。
上下文窗口：原生支持 256K 超长上下文，结合滑动窗口与全局注意力机制，能够精准处理长文档分析与复杂的多轮对话历史。

3. 核心能力与支持模态

模态支持：支持文本输入/输出及代码生成（Text/Code）。虽为轻量版，但继承了 Step 系列的多模态理解基因，能高效处理结构化数据。
智能编程与可视化：在代码生成方面表现卓越，官方演示显示其仅凭自然语言描述即可编写并输出基于 WebGL 2.0 的高性能可视化程序。
复杂逻辑推理：在不依赖外部工具的情况下，能够快速完成等差数列求和、阶乘累加等高难度数学运算，逻辑性能显著优于同尺寸竞品。

4. 性能与基准评测

Agent 能力：在复杂的端云协同任务中，表现出极强的任务拆解能力。例如将模糊的用户需求（如“对比全网价格”）自动拆解为具体的搜索、抓取与汇总子任务。
基准对比：在数学逻辑与代码编写的基准测试中，其得分足以媲美当前主流的闭源大模型，同时保持了开源模型的高可控性。

5. 应用场景与限制

推荐用例：实时对话机器人、代码辅助助手、复杂 Agent 工作流编排、端侧设备（如高配个人工作站）本地部署。
部署优势：官方已针对 NVIDIA DGX 及 Apple M4 Max 等硬件进行了专门优化，降低了本地部署门槛。

6. 访问方式与许可

开源许可：模型权重已在 Hugging Face 与 GitHub 上开源（Apache 2.0 协议）。
API 访问：已上线 OpenRouter 及阶跃星辰开放平台，发布初期提供限免或低成本调用服务。

DataLearner 官方微信

欢迎关注 DataLearner 官方微信，获得最新 AI 技术推送

加载中...

StepFun Flash 3.5

Release date: 2026-02-02知识截止: 2025-1219

Live demoGitHub Hugging Face

Parameters

1960.0亿

Context length

256K

Chinese support

Supported

Reasoning ability

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Model basics

Reasoning traces

Supported

Context length

256K tokens

Max output length

16384 tokens

Model type

聊天大模型

Release date

2026-02-02

Model file size

38GB

MoE architecture

Yes

Total params / Active params

1960.0B / 110B

Knowledge cutoff

2025-12

Inference modes

常规模式（Non-Thinking Mode）思考模式（Thinking Mode）

Open source & experience

Code license

Apache 2.0

Weights license

Apache 2.0- 免费商用授权

GitHub repo

https://github.com/stepfun-ai/Step-3.5-Flash

Hugging Face

https://huggingface.co/stepfun-ai/Step-3.5-Flash

Live demo

No live demo

Official resources

Paper

Step 3.5 Flash: The Open Source 'Light Cavalry' for Agents

DataLearnerAI blog

No blog post yet

API details

API speed

5/5

💡Default unit: $/1M tokens. If vendors use other units, follow their published pricing.

Standard pricingStandard

Modality	Input	Output
Text	0.0	0.0

Cached pricingCache

Modality	Input cache	Output cache
Text	0.0	--

Extended context pricingExtended

Modality	Input	Output
Text	0.0	--

Benchmark Scores

综合评估

2 evaluations

Benchmark / mode

Score

Rank/total

ARC-AGIthinking + 使用工具

56.50

15 / 38

ARC-AGIthinking

53.50

17 / 38

编程与软件工程

2 evaluations

Benchmark / mode

Score

Rank/total

LiveCodeBenchthinking

86.40

4 / 102

SWE-bench Verifiedthinking

74.40

16 / 78

数学推理

4 evaluations

Benchmark / mode

Score

Rank/total

AIME2025thinking + 使用工具

99.80

6 / 104

AIME2025thinking

97.30

17 / 104

IMO-AnswerBenchthinking + 使用工具

86.70

1 / 4

IMO-AnswerBenchthinking

85.40

2 / 4

Agent能力评测

1 evaluations

Benchmark / mode

Score

Rank/total

τ²-Benchthinking + 使用工具

88.20

2 / 33

AI Agent - 信息收集

1 evaluations

Benchmark / mode

Score

Rank/total

BrowseCompthinking + 使用工具

5 / 19

AI Agent - 工具使用

1 evaluations

Benchmark / mode

Score

Rank/total

Terminal Bench 2.0thinking + 使用工具

3 / 11

Publisher

StepFunAI

View publisher details

Model Overview

1. 模型简介与核心定位

发布机构：阶跃星辰 (StepFunAI)
发布时间：2026年2月2日
核心定位：该模型被称为 Agent 时代的“轻骑兵”，旨在解决传统大模型在复杂智能体交互中“速度慢”与“成本高”的痛点。它特别优化了指令遵循与工具调用能力，适合作为各类端侧应用与云端 Agent 的核心大脑。

2. 架构与技术规格

稀疏 MoE 架构：采用先进的混合专家（Mixture-of-Experts）设计，模型总参数量为 1960亿 (196B)，但单次推理的激活参数量仅为 110亿 (11B)。这种设计在保证知识储备的同时，极大降低了计算负载。
MTP-3 加速技术：引入了“多 Token 预测”（Multi-Token Prediction）技术，一次前向传播可预测 3 个 Token，使得推理速度最高可达 350 TPS，特别适合实时交互场景。
上下文窗口：原生支持 256K 超长上下文，结合滑动窗口与全局注意力机制，能够精准处理长文档分析与复杂的多轮对话历史。

3. 核心能力与支持模态

模态支持：支持文本输入/输出及代码生成（Text/Code）。虽为轻量版，但继承了 Step 系列的多模态理解基因，能高效处理结构化数据。
智能编程与可视化：在代码生成方面表现卓越，官方演示显示其仅凭自然语言描述即可编写并输出基于 WebGL 2.0 的高性能可视化程序。
复杂逻辑推理：在不依赖外部工具的情况下，能够快速完成等差数列求和、阶乘累加等高难度数学运算，逻辑性能显著优于同尺寸竞品。

4. 性能与基准评测

Agent 能力：在复杂的端云协同任务中，表现出极强的任务拆解能力。例如将模糊的用户需求（如“对比全网价格”）自动拆解为具体的搜索、抓取与汇总子任务。
基准对比：在数学逻辑与代码编写的基准测试中，其得分足以媲美当前主流的闭源大模型，同时保持了开源模型的高可控性。

5. 应用场景与限制

推荐用例：实时对话机器人、代码辅助助手、复杂 Agent 工作流编排、端侧设备（如高配个人工作站）本地部署。
部署优势：官方已针对 NVIDIA DGX 及 Apple M4 Max 等硬件进行了专门优化，降低了本地部署门槛。

6. 访问方式与许可

开源许可：模型权重已在 Hugging Face 与 GitHub 上开源（Apache 2.0 协议）。
API 访问：已上线 OpenRouter 及阶跃星辰开放平台，发布初期提供限免或低成本调用服务。

DataLearner 官方微信

欢迎关注 DataLearner 官方微信，获得最新 AI 技术推送