Composer 1.5 Benchmark Details

Composer 1.5 currently shows benchmark results led by Terminal Bench 2.0 (35 / 46, score 47.90), SWE-bench Multilingual (19 / 20, score 65.90). This page also compares it with 1 competitor models and 1 predecessor or same-series models, including performance and pricing views when available. 1 source link is attached for reference.

Benchmark Results

Composer 1.5

Benchmark Results

Thinking

AI Agent - Tool Usage

1 evaluations
Benchmark / mode
Score
Rank/total
Terminal Bench 2.0
Thinking Mode
47.90
35 / 46

Coding and Software Engineer

1 evaluations
Benchmark / mode
Score
Rank/total
65.90
19 / 20

Competitor Comparison

Benchmark scores for Composer 1.5 compared against top models in its class

Composer 1.5Claude Sonnet 4.5
Benchmark categories:
The chart shows each model’s highest score per benchmark within the current filter. Out-of-100 benchmarks use raw heights; out-of-range benchmarks are scaled within that benchmark while labels keep the original scores.

1 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.

BenchmarkComposer 1.5CurrentClaude Sonnet 4.5
Terminal Bench 2.0
AI Agent - 工具使用
47.90Thinking Enabled
42.80Thinking Enabled | Tools

Standard API Pricing: Composer 1.5 vs. Peer Models

Shows standard text input and output pricing side by side for each model. If extended-context pricing exists, the chart keeps the base rate and explains the threshold below.

Source: DataLearnerAI. Standard text prices shown here use the default supplier. · USD / 1M tokens

ModelSupplierStandard inputStandard outputBase price applies to
Composer 1.5
Cursor$3.5 / 1M tokens$17.5 / 1M tokens

Version History

How each version of the Composer 1.5 series stacks up on benchmark tests

Composer 1.5Composer 1
Benchmark categories:
The chart shows each model’s highest score per benchmark within the current filter. Out-of-100 benchmarks use raw heights; out-of-range benchmarks are scaled within that benchmark while labels keep the original scores.

2 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.· Click a row to view its trend chart.

BenchmarkComposer 1.5CurrentComposer 1
Terminal Bench 2.0
AI Agent - 工具使用
47.90Thinking Enabled
40.00Thinking Enabled
SWE-bench Multilingual
编程与软件工程
65.90Thinking Enabled
56.90Thinking Enabled

Single-Benchmark Version Trend

Viewing: Terminal Bench 2.0 · AI Agent - 工具使用

Benchmark
NormalNormal + ToolsThinkingThinking + ToolsDeepDeep + Tools

X-axis shows model and release date, Y-axis shows score; solid lines connect the same mode across versions, while dotted guides align modes within the same generation.

Standard API Pricing Across the Composer 1.5 Series

Shows standard text input and output pricing side by side for each model. If extended-context pricing exists, the chart keeps the base rate and explains the threshold below.

Source: DataLearnerAI. Standard text prices shown here use the default supplier. · USD / 1M tokens

ModelSupplierStandard inputStandard outputBase price applies to
Composer 1.5
Cursor$3.5 / 1M tokens$17.5 / 1M tokens
Composer 1
Cursor$1.25 / 1M tokens$10 / 1M tokens

Sources