Document Arena
View overall rankings across AI models in document analysis and long-content reasoning.
May 12, 2026
157,554 votes
24 models
Pareto Frontier
CtrlScroll
Pareto Optimal Models
Top performers for their cost
View overall rankings across AI models in document analysis and long-content reasoning.
Model performance at each price point
Top performers for their cost
claude-opus-4-6-thinking
Anthropic · Proprietary
1522
$20/M
claude-sonnet-4-6
Anthropic · Proprietary
1495
$12/M
gpt-5.4
OpenAI · Proprietary
1474
$11.88/M
kimi-k2.6
Moonshot · Modified MIT
1454
$3.24/M
kimi-k2.5-thinking
Moonshot · Modified MIT
1437
$2.40/M
gemini-3-flash
Google · Proprietary
1418
$2.38/M