加载中...
加载中...
Quickly view LLM performance across benchmarks like MMLU Pro, HLE, SWE-Bench, and more. Compare models across general knowledge, coding, and reasoning capabilities. Customize your comparison by selecting specific models and benchmarks.
Detailed benchmark descriptions available at:LLM Benchmark List & Guide
Benchmark switcher
Pick the leaderboard to sync both chart and table
Data source: DataLearnerAI
| 0.00 |
| 0.00 |
| 0.00 |
| 3 | Qwen2.5-14B | 63.69 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 4 | Gemma 3 - 12B (IT) | 60.60 | 40.90 | 0.00 | 0.00 | 0.00 | 24.60 |
| 5 | Moonlight-16B-A3B-Instruct | 42.40 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |