DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
HomeOverall LeaderboardOpen LLM Leaderboard (China Mirror)

Open LLM Leaderboard (China Mirror)

Open LLM Leaderboard tracks model performance on ARC, HellaSwag, MMLU, TruthfulQA, Winogrande, and GSM8K benchmarks.

Top Model

test_mistral2

Top Score

-

Model Count

100

Data version

-

Data source: HuggingFace

Model type:全部模型Pretrained ModelsFine Tuned ModelsChat ModelsMerged or MoE Models
Origin:AllChina
Leaderboard snapshot month:

Ranking Table

ModelTypeParameters (B)AverageARCHellaSwagMMLUTruthfulQAWinograndeGSM8KArchitecture
TEtest_mistral2Fine Tuned Models71.129.2727.9025.3224.7449.1048.540.00

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.

MistralModel
GPgpt2-dollyChat Models1.229.2122.7030.1525.8144.9751.460.15GPT2LMHeadModel
PYPythia-70M-ChatSaladFine Tuned Models129.2020.9927.2824.7849.7452.410.00GPTNeoXForCausalLM
SMsmol_llama-220M-open_instructChat Models2.229.1925.0029.7126.1144.0650.280.00LlamaForCausalLM
DIDialoGPT-smallFine Tuned Models1.829.1925.7725.7925.8147.4950.280.00GPT2LMHeadModel
MImistral-environment-allFine Tuned Models72.429.1829.4425.8923.1247.9248.700.00MistralForCausalLM
TEtestfinetunedmodelFine Tuned Models1.229.1825.8531.4026.0740.7550.990.00GPT2LMHeadModel
TITinyMistral-v2.5-MiniPile-Guidelines-E1Pretrained Models029.1626.5425.6523.4449.9049.410.00MistralForCausalLM
TITinyMistral-v2.5-MiniPile-Guidelines-E1Pretrained Models029.1526.4525.6823.5349.8549.410.00MistralForCausalLM
PYpythia-31m-KI_v1-2048-scratchPretrained Models0.329.1523.1225.2323.1251.6751.780.00GPTNeoXForCausalLM
OPopt-125mPretrained Models1.229.1522.8731.4726.0242.8751.620.08OPTForCausalLM
GPgpt-neo-125m-neurallinguisticpioneersFine Tuned Models1.229.1522.4430.3625.1445.6451.220.08GPTNeoForCausalLM
CECerebras-GPT-590MUnkown Model Types5.929.1423.7232.4025.9744.1548.150.45?
LLLlama-2-7b-Chat-AWQFine Tuned Models11.329.1427.2225.4824.6749.9547.510.00Unknown
TITinyYi-7b-TestFine Tuned Models60.629.1126.8826.1424.4146.3550.910.00Unknown
GPgpt3-finnish-largePretrained Models029.1121.7632.8824.1144.3551.540.00BloomModel
GPgpt-neox-122m-minipile-digitsFine Tuned Models1.729.1020.7327.0325.3149.1952.330.00GPTNeoXForCausalLM
16160M-TinyLLama-Mini-CinderFine Tuned Models1.429.0924.6628.1625.0944.0852.570.00LlamaForCausalLM
MPmpt-1b-redpajama-200bFine Tuned Models1029.0525.7726.0824.5047.5750.360.00MosaicGPT
PYpythia-160mPretrained Models2.129.0222.7830.3424.9544.2651.540.23GPTNeoXForCausalLM
GPgpt2-conversational-or-qaFine Tuned Models1.429.0121.4227.6126.5147.3151.140.08GPT2LMHeadModel
HEhepu-o4zf-ravz-7-0Fine Tuned Models72.429.0124.4925.3623.2751.6749.250.00MistralForCausalLM
SMSmolLlamix-8x101MPretrained Models428.9822.7028.5024.6946.0951.300.61MixtralForCausalLM
SMsmol_llama-101M-GQAPretrained Models128.9723.5528.7724.2445.7650.670.83LlamaForCausalLM
SMsmol_llama-101M-GQAFine Tuned Models128.9623.4628.7324.3545.8050.670.76LlamaForCausalLM
OPOPT-19M-ChatSaladFine Tuned Models0.228.9624.4025.1523.1251.3649.720.00OPTForCausalLM
PYpythia-70mPretrained Models128.9321.5927.2925.9047.0651.460.30Unknown
OPopt-125m-gqa-ub-6-best-for-KV-cachePretrained Models1.228.9324.2325.0023.1249.5351.700.00OPTForCausalLM
MIMixsmol-4x400M-v0.1-epoch2Pretrained Models17.728.9223.5532.6025.2639.2452.640.23MixtralForCausalLM
59590mUnkown Model Types6.728.8824.1531.9126.6142.1948.380.08GPT2LMHeadModel
OPopen-calm-largePretrained Models028.8820.7329.5625.2346.5251.140.08GPTNeoXForCausalLM
GPgpt2_137m_DolphinCoderFine Tuned Models1.428.8721.8431.3525.4041.5852.011.06Unknown
GPgpt2_137m_DolphinCoderFine Tuned Models1.428.8721.8431.3525.4041.5852.011.06Unknown
DIDialoGPT-mediumFine Tuned Models028.8624.4926.2125.8447.0649.570.00GPT2LMHeadModel
EAeasyTermsSummerizerFine Tuned Models4.128.8625.7725.8123.1247.6950.750.00Unknown
FIFinOPT-WashingtonFine Tuned Models1.228.8525.1726.2524.8345.8051.070.00OPTForCausalLM
PYpythia-31m-goodwiki-deduped-2048-scratchPretrained Models0.328.8523.1225.6623.1151.3249.880.00GPTNeoXForCausalLM
DIdistilgpt2-emailgenFine Tuned Models0.928.8421.7627.5225.9746.1751.620.00GPT2LMHeadModel
FAfacebook-opt-6.7b-gqa-ub-16-best-for-KV-cachePretrained Models6728.8423.0425.9423.1248.9951.930.00OPTForCausalLM
PYpythia-31mPretrained Models0.328.8121.8427.0024.9749.1049.720.23GPTNeoXForCausalLM
YIYi-8B-LlamaUnkown Model Types87.328.7825.6826.7924.1447.7948.300.00Unknown
PYpythia-owt2-70m-100kFine Tuned Models0.728.7820.9028.3425.0245.1253.280.00Unknown
TITinyMistral-248M-v2Pretrained Models2.528.7821.2526.5623.3949.6051.850.00MistralForCausalLM
25256_5epochFine Tuned Models3.228.7622.2728.9926.6241.7152.720.23GPT2LMHeadModel
SMSmol-Llama-101M-Chat-v1Fine Tuned Models128.7322.8728.6924.9345.7650.040.08LlamaForCausalLM
PYpythia-owt2-70m-50kFine Tuned Models0.728.7121.5028.1525.7044.5052.410.00Unknown
PYpythia-70m-deduped-cleansharegpt-enFine Tuned Models0.728.7121.1627.1625.2448.5750.120.00GPTNeoXForCausalLM
VEverysmol_llama-v11-KIx2Pretrained Models0.628.7022.7027.6025.2844.7551.540.30LlamaForCausalLM
FAfacebook-opt-125m-qcqa-ub-6-best-for-KV-cachePretrained Models1.228.6624.2325.0023.1248.4151.220.00OPTForCausalLM
NAnano-phi-115M-v0.1Pretrained Models1.228.6621.9327.8625.3446.0050.830.00PhiForCausalLM
DIdistilgpt2-emailgen-V2Fine Tuned Models0.928.6420.9926.7825.5346.5152.010.00GPT2LMHeadModel
PYpythia-31m-simplewiki-scratch-bf16Pretrained Models0.328.6122.7825.6123.1249.6550.510.00GPTNeoXForCausalLM
PYpythia-31m-simplepile-lite-2048-scratch-2ePretrained Models0.328.6021.5925.7924.9950.6248.620.00GPTNeoXForCausalLM
FAfacebook-opt-6.7b-qcqa-ub-16-best-for-KV-cachePretrained Models6728.5823.8127.0523.1246.6950.830.00OPTForCausalLM
GPgpt2_open-platypusChat Models1.228.5822.1831.2926.1940.3551.300.15GPT2LMHeadModel
KOKoAlpaca-KoRWKV-6BChat Models65.328.5723.4631.6524.8939.8351.620.00RwkvForCausalLM
RWRWKV-4-PilePlus-169M-20230520-done-ctx4096Fine Tuned Models1.328.5723.9832.2523.3742.2949.170.38Unknown
CHchat_gpt2_dpoFine Tuned Models1.228.5623.9831.2224.9541.2649.960.00GPT2LMHeadModel
FAfalcon-1b-cot-t2Fine Tuned Models13.128.5624.7424.7523.1248.3850.360.00FalconForCausalLM
MYMy_GPT2Fine Tuned Models1.428.5521.9331.5925.8440.7350.510.68GPT2LMHeadModel
GPgpt2Pretrained Models1.428.5322.0131.5325.8340.6950.430.68GPT2LMHeadModel
QUQuokka_590mFine Tuned Models6.728.5324.4031.6125.3639.5950.200.00GPT2LMHeadModel
GPgpt2_guanaco-dolly-platypusChat Models1.228.5223.5531.0326.4040.0250.120.00GPT2LMHeadModel
GPgpt2_platypus-dolly-guanacoChat Models1.228.5123.2131.0426.1640.3150.360.00GPT2LMHeadModel
MAmath_gpt2Fine Tuned Models028.5024.2330.8825.3839.2351.070.23GPT2LMHeadModel
DIdistillgpt2CinderFine Tuned Models0.828.5024.4927.2424.9743.9650.120.23GPT2LMHeadModel
GPgpt_bigcode-santacoderPretrained Models11.228.4921.1630.8424.9745.6447.830.53GPTBigCodeForCausalLM
LAlamini-cerebras-256mFine Tuned Models2.628.4921.7628.7026.6641.8152.010.00Unknown
COcode_gpt2_mini_modelFine Tuned Models1.228.4923.7231.2524.9639.8651.140.00GPT2LMHeadModel
GPgpt-sw3-126mPretrained Models1.928.4922.1829.5424.4344.0350.670.08GPT2LMHeadModel
TITinyStories-AlpacaFine Tuned Models0.728.4623.9824.9223.3546.6851.850.00GPTNeoForCausalLM
PHphi-2-upscaled-4B-instruct-v0.1Fine Tuned Models40.428.4522.9528.6826.8040.9250.590.76PhiForCausalLM
MIMixsmol-4x400M-v0.1-epoch1Chat Models17.728.4522.8730.5725.2839.0352.800.15MixtralForCausalLM
MIMixtral-GQA-400m-v2Pretrained Models20.128.4520.2227.7826.1046.5549.960.08MixtralForCausalLM
GPgpt-sw3-126mPretrained Models1.928.4522.0129.5624.5344.0750.430.08GPT2LMHeadModel
LLLlama-Flan-XL2baseUnkown Model Types2028.4420.6525.3323.1950.5850.910.00LlamaForCausalLM
PYpythia-70m-dedupedPretrained Models128.4421.0827.1725.2647.5149.640.00GPTNeoXForCausalLM
BOboomer-1bPretrained Models1028.4422.7831.5825.6639.1750.510.91LlamaForCausalLM
TITinyMistral-v2-Test1Pretrained Models028.4221.5026.7923.3650.3048.540.00MistralForCausalLM
GPgpt2_camel_physics-platypusChat Models1.228.4123.0431.3226.9139.5649.640.00GPT2LMHeadModel
GPgpt2_platypus-camel_physicsChat Models1.228.4123.0431.3226.9139.5649.640.00Unknown
GPgpt2_testPretrained Models1.428.4021.8431.6025.8640.6750.120.30GPT2LMHeadModel
FIfinetuned-gpt2-tinyFine Tuned Models028.4021.8431.6025.8640.6750.120.30GPT2LMHeadModel
GPgpt2_platypus-camel_physicsChat Models1.228.4022.7831.2425.8738.9551.540.00Unknown
LAlamini-cerebras-590mUnkown Model Types5.928.3824.3231.5825.5740.7247.910.15Unknown
FAfacebook-opt-125m-qcqa-ub-6-best-for-q-lossPretrained Models1.228.3723.2925.5723.1549.0349.170.00OPTForCausalLM
GPgpt2-alpaca-gpt4Fine Tuned Models1.428.3422.6131.1725.7638.0452.170.30GPT2LMHeadModel
QUQuokka_256mFine Tuned Models3.228.3222.8728.8426.4839.4752.250.00GPT2LMHeadModel
COconvo_bot_gpt_v1Fine Tuned Models028.3022.3531.0726.1238.7151.540.00GPT2LMHeadModel
GPGPT-2-SlimOrcaDeduped-airoboros-3.1-MetaMathQA-SFT-124MChat Models1.228.3024.5729.4325.8238.8449.012.12Unknown
PYpythia-31mPretrained Models0.328.3019.9726.3424.2750.1249.090.00GPTNeoXForCausalLM
DLdlite-v2-124mFine Tuned Models1.228.3023.9831.1025.2938.9850.430.00GPT2LMHeadModel
KOko-wand-136MPretrained Models1.428.2921.3325.0023.5850.6849.170.00MistralForCausalLM
LAlamini-cerebras-111mFine Tuned Models1.128.2922.1027.1225.5143.7951.220.00Unknown
PYpythia-31m-simplewiki-2048Pretrained Models0.328.2722.1825.5523.1249.3749.410.00GPTNeoXForCausalLM
FAfacebook-opt-6.7b-qcqa-ub-16-best-for-q-lossPretrained Models6728.2521.6726.6523.1546.8151.220.00OPTForCausalLM
OPopen-calm-7bFine Tuned Models7028.2120.4830.6525.2244.1548.540.23GPTNeoXForCausalLM
GPgpt2023Fine Tuned Models1.428.2021.9331.1125.0540.7150.120.30GPT2LMHeadModel
GPgpt-sw3-126m-instructChat Models1.928.2023.3829.8823.7842.6548.540.99GPT2LMHeadModel
TITinyMistral-248M-SFT-v4Chat Models2.528.2024.9128.1526.0439.5650.510.00MistralForCausalLM
Previous24 / 25Next