Models with context windows of 128K tokens or more.
Long-context models can process an entire book, large code base, or many contracts in a single call — a key alternative to RAG for long-document workflows. Gemini, Claude, GPT, and Qwen-Long all support context windows from 200K to 2M tokens. The list below covers actual context sizes, long-document benchmark scores, and pricing for leading long-context models.
Filter by type, size, license, or publisher to narrow down the list.
0 models cataloged
Loading models...