Qwen3-4B-Thinking-2507 Benchmark Details
Qwen3-4B-Thinking-2507 currently shows benchmark results led by AIME2025 (55 / 106, score 81.30), LiveCodeBench (73 / 109, score 55.20), GPQA Diamond (116 / 166, score 65.80).
Benchmark Results
Qwen3-4B-Thinking-2507