Open LLM Leaderboard (China Mirror)

Name: Open LLM Leaderboard (China Mirror)
Creator: DataLearner
License: https://creativecommons.org/licenses/by/4.0/

Open LLM Leaderboard tracks model performance on ARC, HellaSwag, MMLU, TruthfulQA, Winogrande, and GSM8K benchmarks.

Top Model

Velara-11B-V2

Top Score

Model Count

100

Data version

Data source: HuggingFace

Leaderboard snapshot month:

Ranking Table

Model	Type	Parameters (B)	Average	ARC	HellaSwag	MMLU	TruthfulQA	Winogrande	GSM8K	Architecture
Velara-11B-V2	Fine Tuned Models	113.9	65.55	63.82	85.85	63.62	58.83	77.82	43.37

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.