DataLearner logoDataLearnerAI
Latest AI Insights
Model Evaluations
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
HomeEvaluation OverviewOpen LLM Leaderboard 中国站

Open LLM Leaderboard 中国站

Open LLM Leaderboard是追踪大模型评测结果的排行榜,通过追踪大语言模型和ChatBot在不同评测任务上的表现来对模型进行排名和评估。

Top Model

test_mistral2

Top Score

-

Model Count

100

Data version

-

Data source: HuggingFace

Filters

Model type:全部模型Pretrained ModelsFine Tuned ModelsChat ModelsMerged or MoE Models

Ranking Table

ModelTypeParameters (B)AverageARCHellaSwagMMLUTruthfulQAWinograndeGSM8KArchitecture
test_mistral2Fine Tuned Models71.129.2727.925.3224.7449.148.540.0MistralModel
gpt2-dolly

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.

Chat Models
1.2
29.21
22.7
30.15
25.81
44.97
51.46
0.15
GPT2LMHeadModel
Pythia-70M-ChatSaladFine Tuned Models129.220.9927.2824.7849.7452.410.0GPTNeoXForCausalLM
smol_llama-220M-open_instructChat Models2.229.1925.029.7126.1144.0650.280.0LlamaForCausalLM
DialoGPT-smallFine Tuned Models1.829.1925.7725.7925.8147.4950.280.0GPT2LMHeadModel
mistral-environment-allFine Tuned Models72.429.1829.4425.8923.1247.9248.70.0MistralForCausalLM
testfinetunedmodelFine Tuned Models1.229.1825.8531.426.0740.7550.990.0GPT2LMHeadModel
TinyMistral-v2.5-MiniPile-Guidelines-E1Pretrained Models029.1626.5425.6523.4449.949.410.0MistralForCausalLM
TinyMistral-v2.5-MiniPile-Guidelines-E1Pretrained Models029.1526.4525.6823.5349.8549.410.0MistralForCausalLM
pythia-31m-KI_v1-2048-scratchPretrained Models0.329.1523.1225.2323.1251.6751.780.0GPTNeoXForCausalLM
opt-125mPretrained Models1.229.1522.8731.4726.0242.8751.620.08OPTForCausalLM
gpt-neo-125m-neurallinguisticpioneersFine Tuned Models1.229.1522.4430.3625.1445.6451.220.08GPTNeoForCausalLM
Cerebras-GPT-590MUnkown Model Types5.929.1423.7232.425.9744.1548.150.45?
Llama-2-7b-Chat-AWQFine Tuned Models11.329.1427.2225.4824.6749.9547.510.0Unknown
TinyYi-7b-TestFine Tuned Models60.629.1126.8826.1424.4146.3550.910.0Unknown
gpt3-finnish-largePretrained Models029.1121.7632.8824.1144.3551.540.0BloomModel
gpt-neox-122m-minipile-digitsFine Tuned Models1.729.120.7327.0325.3149.1952.330.0GPTNeoXForCausalLM
160M-TinyLLama-Mini-CinderFine Tuned Models1.429.0924.6628.1625.0944.0852.570.0LlamaForCausalLM
mpt-1b-redpajama-200bFine Tuned Models1029.0525.7726.0824.547.5750.360.0MosaicGPT
pythia-160mPretrained Models2.129.0222.7830.3424.9544.2651.540.23GPTNeoXForCausalLM
gpt2-conversational-or-qaFine Tuned Models1.429.0121.4227.6126.5147.3151.140.08GPT2LMHeadModel
hepu-o4zf-ravz-7-0Fine Tuned Models72.429.0124.4925.3623.2751.6749.250.0MistralForCausalLM
SmolLlamix-8x101MPretrained Models428.9822.728.524.6946.0951.30.61MixtralForCausalLM
smol_llama-101M-GQAPretrained Models128.9723.5528.7724.2445.7650.670.83LlamaForCausalLM
smol_llama-101M-GQAFine Tuned Models128.9623.4628.7324.3545.850.670.76LlamaForCausalLM
OPT-19M-ChatSaladFine Tuned Models0.228.9624.425.1523.1251.3649.720.0OPTForCausalLM
pythia-70mPretrained Models128.9321.5927.2925.947.0651.460.3Unknown
opt-125m-gqa-ub-6-best-for-KV-cachePretrained Models1.228.9324.2325.023.1249.5351.70.0OPTForCausalLM
Mixsmol-4x400M-v0.1-epoch2Pretrained Models17.728.9223.5532.625.2639.2452.640.23MixtralForCausalLM
590mUnkown Model Types6.728.8824.1531.9126.6142.1948.380.08GPT2LMHeadModel
open-calm-largePretrained Models028.8820.7329.5625.2346.5251.140.08GPTNeoXForCausalLM
gpt2_137m_DolphinCoderFine Tuned Models1.428.8721.8431.3525.441.5852.011.06Unknown
gpt2_137m_DolphinCoderFine Tuned Models1.428.8721.8431.3525.441.5852.011.06Unknown
DialoGPT-mediumFine Tuned Models028.8624.4926.2125.8447.0649.570.0GPT2LMHeadModel
easyTermsSummerizerFine Tuned Models4.128.8625.7725.8123.1247.6950.750.0Unknown
FinOPT-WashingtonFine Tuned Models1.228.8525.1726.2524.8345.851.070.0OPTForCausalLM
pythia-31m-goodwiki-deduped-2048-scratchPretrained Models0.328.8523.1225.6623.1151.3249.880.0GPTNeoXForCausalLM
distilgpt2-emailgenFine Tuned Models0.928.8421.7627.5225.9746.1751.620.0GPT2LMHeadModel
facebook-opt-6.7b-gqa-ub-16-best-for-KV-cachePretrained Models6728.8423.0425.9423.1248.9951.930.0OPTForCausalLM
pythia-31mPretrained Models0.328.8121.8427.024.9749.149.720.23GPTNeoXForCausalLM
Yi-8B-LlamaUnkown Model Types87.328.7825.6826.7924.1447.7948.30.0Unknown
pythia-owt2-70m-100kFine Tuned Models0.728.7820.928.3425.0245.1253.280.0Unknown
TinyMistral-248M-v2Pretrained Models2.528.7821.2526.5623.3949.651.850.0MistralForCausalLM
256_5epochFine Tuned Models3.228.7622.2728.9926.6241.7152.720.23GPT2LMHeadModel
Smol-Llama-101M-Chat-v1Fine Tuned Models128.7322.8728.6924.9345.7650.040.08LlamaForCausalLM
pythia-owt2-70m-50kFine Tuned Models0.728.7121.528.1525.744.552.410.0Unknown
pythia-70m-deduped-cleansharegpt-enFine Tuned Models0.728.7121.1627.1625.2448.5750.120.0GPTNeoXForCausalLM
verysmol_llama-v11-KIx2Pretrained Models0.628.722.727.625.2844.7551.540.3LlamaForCausalLM
facebook-opt-125m-qcqa-ub-6-best-for-KV-cachePretrained Models1.228.6624.2325.023.1248.4151.220.0OPTForCausalLM
nano-phi-115M-v0.1Pretrained Models1.228.6621.9327.8625.3446.050.830.0PhiForCausalLM
distilgpt2-emailgen-V2Fine Tuned Models0.928.6420.9926.7825.5346.5152.010.0GPT2LMHeadModel
pythia-31m-simplewiki-scratch-bf16Pretrained Models0.328.6122.7825.6123.1249.6550.510.0GPTNeoXForCausalLM
pythia-31m-simplepile-lite-2048-scratch-2ePretrained Models0.328.621.5925.7924.9950.6248.620.0GPTNeoXForCausalLM
facebook-opt-6.7b-qcqa-ub-16-best-for-KV-cachePretrained Models6728.5823.8127.0523.1246.6950.830.0OPTForCausalLM
gpt2_open-platypusChat Models1.228.5822.1831.2926.1940.3551.30.15GPT2LMHeadModel
KoAlpaca-KoRWKV-6BChat Models65.328.5723.4631.6524.8939.8351.620.0RwkvForCausalLM
RWKV-4-PilePlus-169M-20230520-done-ctx4096Fine Tuned Models1.328.5723.9832.2523.3742.2949.170.38Unknown
chat_gpt2_dpoFine Tuned Models1.228.5623.9831.2224.9541.2649.960.0GPT2LMHeadModel
falcon-1b-cot-t2Fine Tuned Models13.128.5624.7424.7523.1248.3850.360.0FalconForCausalLM
My_GPT2Fine Tuned Models1.428.5521.9331.5925.8440.7350.510.68GPT2LMHeadModel
gpt2Pretrained Models1.428.5322.0131.5325.8340.6950.430.68GPT2LMHeadModel
Quokka_590mFine Tuned Models6.728.5324.431.6125.3639.5950.20.0GPT2LMHeadModel
gpt2_guanaco-dolly-platypusChat Models1.228.5223.5531.0326.440.0250.120.0GPT2LMHeadModel
gpt2_platypus-dolly-guanacoChat Models1.228.5123.2131.0426.1640.3150.360.0GPT2LMHeadModel
math_gpt2Fine Tuned Models028.524.2330.8825.3839.2351.070.23GPT2LMHeadModel
distillgpt2CinderFine Tuned Models0.828.524.4927.2424.9743.9650.120.23GPT2LMHeadModel
gpt_bigcode-santacoderPretrained Models11.228.4921.1630.8424.9745.6447.830.53GPTBigCodeForCausalLM
lamini-cerebras-256mFine Tuned Models2.628.4921.7628.726.6641.8152.010.0Unknown
code_gpt2_mini_modelFine Tuned Models1.228.4923.7231.2524.9639.8651.140.0GPT2LMHeadModel
gpt-sw3-126mPretrained Models1.928.4922.1829.5424.4344.0350.670.08GPT2LMHeadModel
TinyStories-AlpacaFine Tuned Models0.728.4623.9824.9223.3546.6851.850.0GPTNeoForCausalLM
phi-2-upscaled-4B-instruct-v0.1Fine Tuned Models40.428.4522.9528.6826.840.9250.590.76PhiForCausalLM
Mixsmol-4x400M-v0.1-epoch1Chat Models17.728.4522.8730.5725.2839.0352.80.15MixtralForCausalLM
Mixtral-GQA-400m-v2Pretrained Models20.128.4520.2227.7826.146.5549.960.08MixtralForCausalLM
gpt-sw3-126mPretrained Models1.928.4522.0129.5624.5344.0750.430.08GPT2LMHeadModel
Llama-Flan-XL2baseUnkown Model Types2028.4420.6525.3323.1950.5850.910.0LlamaForCausalLM
pythia-70m-dedupedPretrained Models128.4421.0827.1725.2647.5149.640.0GPTNeoXForCausalLM
boomer-1bPretrained Models1028.4422.7831.5825.6639.1750.510.91LlamaForCausalLM
TinyMistral-v2-Test1Pretrained Models028.4221.526.7923.3650.348.540.0MistralForCausalLM
gpt2_camel_physics-platypusChat Models1.228.4123.0431.3226.9139.5649.640.0GPT2LMHeadModel
gpt2_platypus-camel_physicsChat Models1.228.4123.0431.3226.9139.5649.640.0Unknown
gpt2_testPretrained Models1.428.421.8431.625.8640.6750.120.3GPT2LMHeadModel
finetuned-gpt2-tinyFine Tuned Models028.421.8431.625.8640.6750.120.3GPT2LMHeadModel
gpt2_platypus-camel_physicsChat Models1.228.422.7831.2425.8738.9551.540.0Unknown
lamini-cerebras-590mUnkown Model Types5.928.3824.3231.5825.5740.7247.910.15Unknown
facebook-opt-125m-qcqa-ub-6-best-for-q-lossPretrained Models1.228.3723.2925.5723.1549.0349.170.0OPTForCausalLM
gpt2-alpaca-gpt4Fine Tuned Models1.428.3422.6131.1725.7638.0452.170.3GPT2LMHeadModel
Quokka_256mFine Tuned Models3.228.3222.8728.8426.4839.4752.250.0GPT2LMHeadModel
convo_bot_gpt_v1Fine Tuned Models028.322.3531.0726.1238.7151.540.0GPT2LMHeadModel
GPT-2-SlimOrcaDeduped-airoboros-3.1-MetaMathQA-SFT-124MChat Models1.228.324.5729.4325.8238.8449.012.12Unknown
pythia-31mPretrained Models0.328.319.9726.3424.2750.1249.090.0GPTNeoXForCausalLM
dlite-v2-124mFine Tuned Models1.228.323.9831.125.2938.9850.430.0GPT2LMHeadModel
ko-wand-136MPretrained Models1.428.2921.3325.023.5850.6849.170.0MistralForCausalLM
lamini-cerebras-111mFine Tuned Models1.128.2922.127.1225.5143.7951.220.0Unknown
pythia-31m-simplewiki-2048Pretrained Models0.328.2722.1825.5523.1249.3749.410.0GPTNeoXForCausalLM
facebook-opt-6.7b-qcqa-ub-16-best-for-q-lossPretrained Models6728.2521.6726.6523.1546.8151.220.0OPTForCausalLM
open-calm-7bFine Tuned Models7028.2120.4830.6525.2244.1548.540.23GPTNeoXForCausalLM
gpt2023Fine Tuned Models1.428.221.9331.1125.0540.7150.120.3GPT2LMHeadModel
gpt-sw3-126m-instructChat Models1.928.223.3829.8823.7842.6548.540.99GPT2LMHeadModel
TinyMistral-248M-SFT-v4Chat Models2.528.224.9128.1526.0439.5650.510.0MistralForCausalLM
Previous24 / 25Next