Open LLM Leaderboard (China Mirror)

Name: Open LLM Leaderboard (China Mirror)
Creator: DataLearner
License: https://creativecommons.org/licenses/by/4.0/

Open LLM Leaderboard tracks model performance on ARC, HellaSwag, MMLU, TruthfulQA, Winogrande, and GSM8K benchmarks.

Top Model

test_mistral2

Top Score

Model Count

100

Data version

Data source: HuggingFace

Leaderboard snapshot month:

Ranking Table

Model	Type	Parameters (B)	Average	ARC	HellaSwag	MMLU	TruthfulQA	Winogrande	GSM8K	Architecture
test_mistral2	Fine Tuned Models	71.1	29.27	27.90	25.32	24.74	49.10	48.54	0.00

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.