C4AI Command A (202503) Benchmark Details

C4AI Command A (202503) currently shows benchmark results led by Aider-Polyglot (55 / 59, score 12).

Benchmark Results

C4AI Command A (202503)

Benchmark Results

Thinking

Agent Level Benchmark

1 evaluations
Benchmark / mode
Score
Rank/total
Aider-Polyglot
Standard Mode
12
55 / 59