Open LLM Leaderboard (China Mirror)

Name: Open LLM Leaderboard (China Mirror)
Creator: DataLearner
License: https://creativecommons.org/licenses/by/4.0/

Open LLM Leaderboard tracks model performance on ARC, HellaSwag, MMLU, TruthfulQA, Winogrande, and GSM8K benchmarks.

Top Model

GOAT-70B-Storytelling

Top Score

Model Count

100

Data version

Data source: HuggingFace

Leaderboard snapshot month:

Ranking Table

Model	Type	Parameters (B)	Average	ARC	HellaSwag	MMLU	TruthfulQA	Winogrande	GSM8K	Architecture
GOAT-70B-Storytelling	Fine Tuned Models	700	67.38	68.77	87.74	69.92	53.53	83.50	40.79

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.