Claude Fable 5 Benchmark Details

Claude Fable 5 currently shows benchmark results led by Simple Bench (1 / 63, score 81.90), SWE-bench Verified (2 / 112, score 95), SWE-Bench Pro - Public (1 / 54, score 80.30). This page also compares it with 3 competitor models, including performance and pricing views when available.

Benchmark Results

Claude Fable 5

Benchmark Results

General Knowledge

3 evaluations

Benchmark / mode

Score

Rank/total

LiveBench

Thinking Level · High

75.47

12 / 115

LiveBench

Deep Thinking Mode

78.31

5 / 115

HLE

Deep Thinking Mode

4 / 172

Coding and Software Engineer

4 evaluations

Benchmark / mode

Score

Rank/total

SWE-bench Verified

Thinking Level · HighTools

2 / 112

SWE-bench Verified

Deep Thinking ModeTools

2 / 112

SWE-Bench Pro - Public

Deep Thinking ModeTools

80.30

1 / 54

DeepSWE

Deep Thinking ModeTools

2 / 19

Common Sense Reasoning

1 evaluations

Benchmark / mode

Score

Rank/total

Simple Bench

Standard Mode

81.90

1 / 63

AI Agent - Tool Usage

4 evaluations

Benchmark / mode

Score

Rank/total

TerminalBench 2.1

Thinking Level · HighTools

3 / 27

TerminalBench 2.1

Deep Thinking ModeTools

3 / 27

OSWorld-Verified

Thinking Level · HighTools

1 / 24

MCP-Atlas

Standard ModeTools

83.30

4 / 27

Compare with other models

Competitor Comparison

Benchmark scores for Claude Fable 5 compared against top models in its class

Claude Fable 5GPT-5.5 Gemini 3.1 Pro Preview DeepSeek-V4-Pro

Benchmark categories:

The chart shows each model’s highest score per benchmark within the current filter. Out-of-100 benchmarks use raw heights; out-of-range benchmarks are scaled within that benchmark while labels keep the original scores.

9 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.

Benchmark	Claude Fable 5Current	GPT-5.5	Gemini 3.1 Pro Preview	DeepSeek-V4-Pro
HLE 综合评估	59.00Deep Thinking Mode	52.20Thinking Level · High ｜ Tools	51.40Thinking Level · High ｜ Tools	48.20Thinking Level · Extra High ｜ Tools
LiveBench 综合评估	78.31Deep Thinking Mode	80.71Deep Thinking Mode	79.93Thinking Level · High	73.58Standard Mode
DeepSWE 编程与软件工程	70.00Deep Thinking Mode ｜ Tools	67.00Thinking Level · Extra High ｜ Tools	12.00Thinking Level · High ｜ Tools	--
SWE-Bench Pro - Public 编程与软件工程	80.30Deep Thinking Mode ｜ Tools	58.60Thinking Level · High ｜ Tools	54.20Thinking Level · High ｜ Tools	55.40Thinking Level · Extra High ｜ Tools
SWE-bench Verified 编程与软件工程	95.00Thinking Level · High ｜ Tools	--	80.60Thinking Level · High ｜ Tools	80.60Thinking Level · Extra High ｜ Tools
Simple Bench 常识推理	81.90Standard Mode	69.00Standard Mode	79.60Standard Mode	50.90Standard Mode
MCP-Atlas AI Agent - 工具使用	83.30Standard Mode ｜ Tools	75.30Thinking Level · Extra High ｜ Tools	78.20Thinking Level · High ｜ Tools	--
OSWorld-Verified AI Agent - 工具使用	85.00Thinking Level · High ｜ Tools	78.70Thinking Level · High ｜ Tools	76.20Thinking Enabled ｜ Tools	--
TerminalBench 2.1 AI Agent - 工具使用	88.00Thinking Level · High ｜ Tools	83.40Thinking Level · High ｜ Tools	73.80Thinking Level · High ｜ Tools	--

Standard API Pricing: Claude Fable 5 vs. Peer Models

Shows standard text input and output pricing side by side for each model. If extended-context pricing exists, the chart keeps the base rate and explains the threshold below.

Source: DataLearnerAI. Standard text prices shown here use the default supplier. · USD / 1M tokens

When a context threshold exists, the charted base price only applies within these limits:

Gemini 3.1 Pro Preview: Base price applies to <= 200K

Model	Supplier	Standard input	Standard output	Base price applies to
Claude Fable 5	Anthropic	$10 / 1M tokens	$50 / 1M tokens	—
GPT-5.5	OpenAI	$5 / 1M tokens	$30 / 1M tokens	—
Gemini 3.1 Pro Preview	Google Deep Mind	$2 / 1M tokens	$12 / 1M tokens	<= 200K
DeepSeek-V4-Pro	DeepSeek-AI	$0.435 / 1M tokens	$0.87 / 1M tokens	—