MPT-30B

Name: MosaicML Pretrained Transformer - 30B
Author: MosaicML

基础大模型

MosaicML Pretrained Transformer - 30B

Release date: 2023-06-22更新于: 2023-06-23 20:41:29.747265

Live demo

Parameters

300.0亿

Context length

Chinese support

Not supported

Reasoning ability

MosaicML Pretrained Transformer - 30B is an AI model published by MosaicML, released on 2023-06-22, for 基础大模型, with 300.0B parameters, and 2K tokens context length, requiring about 60GB storage, under the Apache 2.0 license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

MPT-30B

Model basics

Reasoning traces

Not supported

Thinking modes

Thinking modes not supported

Context length

2K tokens

Max output length

No data

Model type

基础大模型

Release date

2023-06-22

Model file size

60GB

MoE architecture

Total params / Active params

300.0B / N/A

Knowledge cutoff

No data

MPT-30B

Open source & experience

Code license

Apache 2.0

Weights license

Apache 2.0- 免费商用授权

GitHub repo

https://github.com/mosaicml/llm-foundry

Hugging Face

https://huggingface.co/mosaicml/mpt-30b

Live demo

No live demo

MPT-30B

Official resources

Paper

MPT-30B: Raising the bar for open-source foundation models

DataLearnerAI blog

No blog post yet

MPT-30B

API details

API speed

No data

No public API pricing yet.

MPT-30B

Benchmark Results

No benchmark data to show.

MPT-30B

Publisher

MosaicML

View publisher details

MosaicML Pretrained Transformer - 30B

Model Overview

MPT-30B是MosaicML开源的一个300亿参数规模的基础大语言模型。这是距离MPT-7B系列模型发布仅仅一个多月时间又一次更新。

相比较此前的MPT-7B系列模型，MPT-30B修改了transformer架构，使其训练和推理更加高效。MPT-30B是一个基础大语言模型，训练数据依然来自MosaicML团队收集的1万亿文本和代码数据集。

MPT-30B具有区别于其他LLM的特殊能力，包括支持8k的上下文窗口（可以通过微调进一步扩展，类似于MPT-7B-StoryWriter），通过ALiBi支持上下文长度的外推，以及通过FlashAttention进行高效推理+训练。由于其预训练组合，它还具有强大的编码能力。

至于300亿参数规模，官方也宣称是一种精心选择的结果，MPT-30B的规模可以在单个GPU上部署：其16位精度的模型可以部署在单个xA100-80GB显卡上，而8位精度的模型则可以部署在一个A100-40GB显卡上。

MPT-30B依然是代码和预训练结果均开源可商用的方式授权，以Apache 2.0协议开源。

Foundation model

MPT

View details

DataLearner 官方微信

欢迎关注 DataLearner 官方微信，获得最新 AI 技术推送