Llama3.1-8B-Instruct Benchmark Details
Llama3.1-8B-Instruct currently shows benchmark results led by GSM8K (16 / 26, score 82.40), MBPP (18 / 28, score 69.40), HumanEval (28 / 39, score 66.50).
Benchmark Results
Llama3.1-8B-Instruct
Benchmark Results
综合评估
3 evaluationsBenchmark / mode
Score
Rank/total