DocVQA

Updated Oct 19, 2025·1,786 views

Problem Count: 50000
Institution: Independent
Category: Multimodal Understanding
Metrics: Accuracy
Language: English
Difficulty: Medium

Overview

A visual question-answering benchmark built around document images and document-understanding tasks.

Related resources

Latest DocVQA model rankings and full benchmark leaderboard

Browse the latest scores, model modes, release dates, and parameter sizes for DocVQA.

Source: DataLearnerAI

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Model Mode Legend

License:

Origin:

Model release cutoff:

Rank	Model				License
	Qwen2.5-VL-72B-Instruct Standard Mode	96.40	2025-01-28	72B	Free Commercial
	Qwen3-VL-8B-Instruct Standard Mode	96.10	2025-10-15	8.8B	Free Commercial
	Qwen3-VL-4B-Instruct Standard Mode	95.30	2025-10-15	4B	Free Commercial
4	Gemini 2.5 Flash-Lite-Preview-09-2025 Standard Mode	92.00	2025-09-25	Unknown	Closed
5	GPT-5-Nano Standard Mode	78.30	2025-08-07	Unknown	Closed

Latest DocVQA model rankings and full benchmark leaderboard

DocVQA Rank