Grok-3 - Reasoning Beta Benchmark Details
Grok-3 - Reasoning Beta currently shows benchmark results led by AIME 2024 (6 / 62, score 93.30), LiveCodeBench (33 / 120, score 79.40), GPQA Diamond (52 / 179, score 84.60).
Benchmark Results
Grok-3 - Reasoning Beta