Benchmark Results
Benchmark Results
Coding and Software Engineer
3 evaluationsAI Agent - Tool Usage
4 evaluationsCompetitor Comparison
Benchmark scores for Claude Fable 5 compared against top models in its class
4 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.
| Benchmark | Claude Fable 5Current | GPT-5.5 | Gemini 3.1 Pro Preview | DeepSeek-V4-Pro |
|---|---|---|---|---|
HLE 综合评估 | 59.00Deep Thinking Mode | 52.20Thinking Level · High | Tools | 51.40Thinking Level · High | Tools | 48.20Thinking Level · Extra High | Tools |
SWE-Bench Pro - Public 编程与软件工程 | 80.30Deep Thinking Mode | Tools | 58.60Thinking Level · High | Tools | 54.20Thinking Level · High | Tools | 55.40Thinking Level · Extra High | Tools |
SWE-bench Verified 编程与软件工程 | 95.00Thinking Level · High | Tools | -- | 80.60Thinking Level · High | Tools | 80.60Thinking Level · Extra High | Tools |
OSWorld-Verified AI Agent - 工具使用 | 85.00Thinking Level · High | Tools | 78.70Thinking Level · High | Tools | -- | -- |
Standard API Pricing: Claude Fable 5 vs. Peer Models
Shows standard text input and output pricing side by side for each model. If extended-context pricing exists, the chart keeps the base rate and explains the threshold below.
Source: DataLearnerAI. Standard text prices shown here use the default supplier. · USD / 1M tokens
When a context threshold exists, the charted base price only applies within these limits:
| Model | Supplier | Standard input | Standard output | Base price applies to |
|---|---|---|---|---|
Claude Fable 5 | Anthropic | $10 / 1M tokens | $50 / 1M tokens | — |
GPT-5.5 | OpenAI | $5 / 1M tokens | $30 / 1M tokens | — |
Gemini 3.1 Pro Preview | Google Deep Mind | $2 / 1M tokens | $12 / 1M tokens | <= 200K |
DeepSeek-V4-Pro | DeepSeek-AI | $0.435 / 1M tokens | $0.87 / 1M tokens | — |