Qwen3-VL-8B-Instruct

Name: Qwen3-VL-8B-Instruct
Author: 阿里巴巴

Multimodal modelQwen3

Release date: 2025-10-15Updated: 2025-10-15 08:23:371,429

Live demoGitHub Hugging Face Compare

Parameters

8.8B

Context length

256K

Chinese support

Not supported

Reasoning ability

Qwen3-VL-8B-Instruct is an AI model published by 阿里巴巴, released on 2025-10-15, for Multimodal model, with 8.8B parameters, and 256K context length, under the Apache 2.0 license, with a 96.10 score on DocVQA.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Qwen3-VL-8B-Instruct

Model basics

Reasoning traces

Not supported

Thinking modes

Thinking modes not supported

Context length

256K tokens

Max output length

No data

Model type

Multimodal model

Modality (in / out)

Text, Image, Video → Text

Release date

2025-10-15

Model file size

No data

MoE architecture

Total params / Active params

8.8B / N/A

Knowledge cutoff

No data

Qwen3-VL-8B-Instruct

Open source & experience

Code license

Apache 2.0

Weights license

Apache 2.0- 免费商用授权

GitHub repo

https://github.com/QwenLM/Qwen3-VL

Hugging Face

https://huggingface.co/Qwen/Qwen3-VL-8B-Instruct

Live demo

No live demo

Qwen3-VL-8B-Instruct

Official resources

Paper

Qwen3 Technical Report

DataLearnerAI blog

No blog post yet

Qwen3-VL-8B-Instruct

API details

API speed

3/5

No public API pricing yet.

Qwen3-VL-8B-Instruct

Benchmark Results

Qwen3-VL-8B-Instruct currently shows benchmark results led by DocVQA (2 / 5, score 96.10), MMMU (23 / 28, score 69.60). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.