DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
HomeAI ModelsGemini 3.5 Flash vs Claude Sonnet 4.6

Gemini 3.5 FlashvsClaude Sonnet 4.6

Across 3 shared benchmarks, Gemini 3.5 Flash leads overall: Gemini 3.5 Flash wins 2, Claude Sonnet 4.6 wins 1, with 0 ties and an average score difference of +3.63.

Google Deep Mind
Gemini 3.5 Flash

Google Deep Mind · 2026-06-20 · Multimodal model

Anthropic
Claude Sonnet 4.6

Anthropic · 2026-02-17 · AI model

Gemini 3.5 Flash2 wins(67%)(33%)1 winClaude Sonnet 4.6

Benchmark scores

Grouped by capability, sorted by largest gap within each. 3 shared benchmarks.

General Knowledge

Even 2/2
BenchmarkGemini 3.5 FlashClaude Sonnet 4.6Diff
ARC-AGI-272.1011 / 59Thinking High (With Tools)58.3018 / 59thinking+13.80
HLE40.2045 / 150Thinking High (With Tools)4920 / 150thinking + 使用工具-8.80

AI Agent - Tool Usage

Gemini 3.5 Flash 1/1
BenchmarkGemini 3.5 FlashClaude Sonnet 4.6Diff
OSWorld-Verified78.403 / 15Thinking High (With Tools)72.508 / 15thinking + 使用工具+5.90

Specs

FieldGemini 3.5 FlashClaude Sonnet 4.6
PublisherGoogle Deep MindAnthropic
Release date2026-06-202026-02-17
Model typeMultimodal modelAI model
ArchitectureDenseDense
Parameters0.00.0
Context length1M1M
Max output655368192

API pricing

Prices use DataLearner records when available; missing fields are not inferred.

ItemGemini 3.5 FlashClaude Sonnet 4.6
Text input$1.5 / 1M tokens$3 / 1M tokens
Text output$9 / 1M tokens$15 / 1M tokens
Cache readNot public$0.3 / 1M tokens
Cache writeNot public$3.75 / 1M tokens

Summary

  • Gemini 3.5 Flashleads in:AI Agent - Tool Usage (1/1)
  • Tied in:General Knowledge

On average across the 3 shared benchmarks, Gemini 3.5 Flash scores 3.63 higher.

Largest single-benchmark gap: ARC-AGI-2 — Gemini 3.5 Flash 72.10 vs Claude Sonnet 4.6 58.30 (+13.80).

Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.

Gemini 3.5 Flash detailsClaude Sonnet 4.6 details·Customize in compare tool