View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.
Model performance at each price point
Top performers for their cost
claude-fable-5
Anthropic · Proprietary
1508
$40/M
claude-opus-4-6-thinking
1503
$20/M
gemini-3.1-pro-preview
Google · Proprietary
1486
$9.50/M
gemini-3.5-flash
1476
$7.13/M
grok-4.20-beta-0309-reasoning
xAI · Proprietary
$5/M
qwen3.7-max-preview
Alibaba · Proprietary
1475
$3.13/M
gemini-3-flash
1473
$2.38/M
mimo-v2.5-pro
Xiaomi · MIT
1466
$0.76/M
gemma-4-31b
Google · Apache 2.0
1451
$0.34/M
deepseek-v4-flash
DeepSeek · MIT
1435
$0.16/M
qwen3-235b-a22b-thinking-2507
Alibaba · Apache 2.0
1399
$0.10/M
granite-4.1-8b
IBM · Apache 2.0
1307
$0.09/M
gemma-2-9b-it-simpo
Princeton · MIT
1280
$0.08/M
mistral-small-24b-instruct-2501
Mistral · Apache 2.0
1274
$0.07/M
llama-3.1-8b-instruct
Meta · Llama 3.1 Community
1211
$0.03/M