加载中...
Phi-4-mini-instruct (3.8B) currently shows benchmark results led by GSM8K (14 / 26, score 88.60), HumanEval (24 / 39, score 74.40), MATH (27 / 42, score 64).