Text Arena💻Coding

View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.

Mar 31, 2026
1,014,709 votes
329 models

Pareto Frontier

CtrlScroll

Pareto Frontier Models

Top performers for their cost

Anthropic

claude-opus-4-6-thinking

Anthropic · Proprietary

1556

$20/M

gemini-3.1-pro-preview

Google · Proprietary

1534

$9.50/M

grok-4.20-beta-0309-reasoning

xAI · Proprietary

1525

$5/M

kimi-k2.5-thinking

Moonshot · Modified MIT

1511

$2.40/M

gemini-3-flash

Google · Proprietary

1509

$2.38/M

kimi-k2.5-instant

Moonshot · Modified MIT

1506

$1.53/M

glm-4.7

Z.ai · MIT

1487

$1.41/M

longcat-flash-chat

Meituan · MIT

1476

$0.65/M

deepseek-v3.2-exp-thinking

DeepSeek · MIT

1475

$0.38/M

deepseek-v3.2-thinking

DeepSeek · MIT

1474

$0.35/M

Stepfun

step-3.5-flash

StepFun · Apache 2.0

1449

$0.25/M

mimo-v2-flash (non-thinking)

Xiaomi · MIT

1447

$0.24/M

qwen3-32b

Alibaba · Apache 2.0

1408

$0.20/M

gpt-oss-120b

OpenAI · Apache 2.0

1391

$0.15/M

gpt-oss-20b

OpenAI · Apache 2.0

1369

$0.09/M

mistral-small-24b-instruct-2501

Mistral · Apache 2.0

1312

$0.07/M

gemma-3n-e4b-it

Google · Gemma

1309

$0.03/M