加载中...
Grok-1.5 currently shows benchmark results led by HumanEval (25 / 38, score 74.10), MMLU (46 / 63, score 81.30), MATH (32 / 41, score 50.60).