DeepSeek-R1-Distill-Llama-70B Benchmark Details

DeepSeek-R1-Distill-Llama-70B currently shows benchmark results led by MATH-500 (27 / 44, score 94.50), GPQA Diamond (130 / 179, score 65.20).

Benchmark Results

DeepSeek-R1-Distill-Llama-70B

Benchmark Results

Thinking

General Knowledge

1 evaluations
Benchmark / mode
Score
Rank/total
65.20
130 / 179

Math and Reasoning

1 evaluations
Benchmark / mode
Score
Rank/total
94.50
27 / 44