Stable Video Diffusion - XT
Stable Video Diffusion - XT is an AI model published by Stability AI, released on 2023-11-21, for 视觉大模型, with 10.0B parameters, and 2K tokens context length, requiring about 9.56GB storage, under the 开源不可商用 license.
Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology
SVD全称Stable Video Diffusion,是StabilityAI最新的开源文本生成视频大模型。这个模型是基于Stable Diffusion 2.1进行初始化,然后通过在图像模型中插入时空卷积和注意力层来构建这个视频生成模型的架构,最终在1.52以视频数据集上训练得到。
SVD-XT可以生成20帧的576x1024分辨率的视频,而SVD只能生成14帧。
SVD模型的详细信息: https://www.datalearner.com/ai-models/pretrained-models/SVD
下图是样例结果:

欢迎关注 DataLearner 官方微信,获得最新 AI 技术推送
