• New Chat
  • Leaderboard
  • Search
Terms of UsePrivacy Policy
Start Voting
Overview
Agent
Start Voting
Agent

Min
Max

Min
Max

Min
Max

Min
Max

Text Arena🧮Math

View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.

Jul 1, 2026
629,157 votes
359 models
Rank by
Rank Spread
1
17
Anthropic
claude-opus-4-6-thinking
Anthropic · Proprietary
1517±12
2,913$5 / $251M
2
130
Anthropic
claude-fable-5
Anthropic · Proprietary
1514±38
243$10 / $501M
3
126
gemini-3.5-flash
Google · Proprietary
1508±21
815$1.50 / $91M
4
111
Anthropic
claude-opus-4-6
Anthropic · Proprietary
1508±11
3,295$5 / $251M
5
127
Anthropic
claude-opus-4-7-thinking
Anthropic · Proprietary
1497±13
2,186$5 / $251M
6
150
qwen3.7-max-preview
Alibaba · Proprietary
1495±40
219$1.25 / $3.751M
7
228
gpt-5.4-high
OpenAI · Proprietary
1492±12
2,693$2.50 / $151.1M
8
131
Anthropic
claude-opus-4-8-thinking
Anthropic · Proprietary
1491±18
1,105$5 / $251M
9
230
Anthropic
claude-opus-4-7
Anthropic · Proprietary
1489±13
2,241$5 / $251M
10
230
gemini-3.1-pro-preview
Google · Proprietary
1487±10
4,038$2 / $121M
11
337
mimo-v2.5-pro
Xiaomi · MIT
1481±15
1,744$0.43 / $0.871M
12
337
ernie-5.1
Baidu · Proprietary
1481±15
1,685N/AN/A
13
337
kimi-k2.6
Moonshot · Modified MIT
1479±14
1,718$0.95 / $4262.1K
14
338
gpt-5.5
OpenAI · Proprietary
1478±14
2,026$5 / $301.1M
15
337
gemini-3-pro
Google · Proprietary
1476±11
2,653$2 / $121M
16
355
qwen3.7-plus
Alibaba · Proprietary
1474±22
710$0.32 / $1.281M
17
342
gemini-3-flash
Google · Proprietary
1474±13
2,001$0.50 / $31M
18
350
qwen3.5-max-preview
Alibaba · Proprietary
1473±16
1,355N/AN/A
19
350
glm-5.1
Z.ai · MIT
1473±17
1,238$1.40 / $4.40202.8K
20
350
Anthropic
claude-opus-4-8
Anthropic · Proprietary
1473±18
1,078$5 / $251M
21
445
gpt-5.5-high
OpenAI · Proprietary
1472±14
1,995$5 / $301.1M
22
543
kimi-k2.5-thinking
Moonshot · Modified MIT
1470±11
3,162$0.60 / $3N/A
23
382
glm-5.2 (max)
Z.ai · MIT
1467±26
474$1.40 / $4.401M
24
386
gemma-4-26b-a4b
Google · Apache 2.0
1466±28
373N/AN/A
25
392
qwen3.6-max-preview
Alibaba · Proprietary
1465±30
358$1.04 / $6.24262.1K
26
386
gemma-4-31b
Google · Apache 2.0
1465±28
402$0.14 / $0.40262.1K
27
659
deepseek-v4-pro-thinking
DeepSeek · MIT
1464±15
1,765$0.43 / $0.871M
28
958
grok-4.20-beta-0309-reasoning
xAI · Proprietary
1462±12
2,814$2 / $62M
29
393
nvidia-nemotron-3-ultra-550b-a55b-nvfp4
Nvidia · OpenMDW-1.1
1462±27
452N/AN/A
30
1061
Anthropic
claude-sonnet-4-6
Anthropic · Proprietary
1461±12
2,728$3 / $151M
31
1057
Anthropic
claude-opus-4-5-20251101
Anthropic · Proprietary
1460±9
4,344$5 / $25200K
32
1068
Anthropic
claude-opus-4-5-20251101-thinking-32k
Anthropic · Proprietary
1459±12
2,268$5 / $25200K
33
1072
gpt-5.4
OpenAI · Proprietary
1457±12
2,889$2.50 / $151.1M
34
2123
Anthropic
claude-sonnet-5-thinking
Anthropic · Proprietary
1455±42
206$2 / $101M
35
1095
Meta
muse-spark
Meta · Proprietary
1452±20
863N/AN/A
36
1876
gemini-2.5-pro
Google · Proprietary
1451±7
7,646$1.25 / $101M
37
1587
qwen3.6-plus
Alibaba · Proprietary
1450±13
2,101$0.33 / $1.951M
38
1492
qwen3-max-preview
Alibaba · Proprietary
1450±15
1,524$0.78 / $3.90262.1K
39
1884
gemini-3-flash (thinking-minimal)
Google · Proprietary
1449±9
4,226$0.50 / $31M
40
1786
qwen3.5-397b-a17b
Alibaba · Apache 2.0
1448±11
2,991$0.39 / $2.45256K
41
1884
Anthropic
claude-sonnet-4-5-20250929-thinking-32k
Anthropic · Proprietary
1448±9
4,911$3 / $15200K
42
1595
mimo-v2-pro
Xiaomi · Proprietary
1447±15
1,630$1 / $31M
43
2296
gpt-5.1-high
OpenAI · Proprietary
1442±12
2,501$1.25 / $10400K
44
10118
kimi-k2.5-instant
Moonshot · Modified MIT
1442±25
513$0.38 / $2.02262.1K
45
18107
qwen3-next-80b-a3b-instruct
Alibaba · Apache 2.0
1442±17
1,213$0.09 / $1.10262.1K
46
16110
minimax-m3
MiniMax · MiniMax Community License
1441±20
952$0.60 / $2.40N/A
47
23100
gpt-5.2-high
OpenAI · Proprietary
1441±11
2,987$1.75 / $14400K
48
22106
deepseek-v4-pro
DeepSeek · MIT
1440±13
2,054$0.43 / $0.871M
49
6136
amazon-nova-experimental-chat-26-02-10
Amazon · Proprietary
1440±39
210N/AN/A
50
15114
longcat-flash-chat
Meituan · MIT
1440±22
687$0.20 / $0.80131.1K
51
22106
mimo-v2.5
Xiaomi · MIT
1440±14
1,886$0.10 / $0.281M
52
15118
qwen3-max-2025-09-23
Alibaba · Proprietary
1439±24
585$0.78 / $3.90262.1K
53
26104
Bytedance
dola-seed-2.0-pro
Bytedance · Proprietary
1438±11
3,360N/AN/A
54
22107
glm-5
Z.ai · MIT
1438±15
1,609$1 / $3.20202.8K
55
25107
ernie-5.0-0110
Baidu · Proprietary
1437±13
2,154N/AN/A
56
27106
deepseek-v3.2
DeepSeek · MIT
1437±11
3,006$0.23 / $0.34131.1K
57
27107
grok-4.20-multi-agent-beta-0309
xAI · Proprietary
1436±12
2,788$2 / $62M
58
24113
grok-4.20-beta1
xAI · Proprietary
1435±15
1,610N/AN/A
59
27113
gpt-5.2-chat-latest-20260210
OpenAI · Proprietary
1434±13
2,088$1.75 / $14128K
60
17129
mistral-medium-3.5
Mistral · Modified MIT
1434±25
521$1.50 / $7.50262.1K
61
22121
mimo-v2-omni
Xiaomi · Proprietary
1434±20
922$0.40 / $2262.1K
62
27113
qwen3.5-27b
Alibaba · Apache 2.0
1433±15
1,655$0.20 / $1.56262.1K
63
28113
glm-4.6
Z.ai · MIT
1433±13
2,106$0.43 / $1.74202.8K
64
27117
amazon-nova-experimental-chat-11-10
Amazon · Proprietary
1432±14
1,586N/AN/A
65
30112
gemini-3.1-flash-lite-preview
Google · Proprietary
1432±11
3,271$0.25 / $1.501M
66
33107
qwen3-235b-a22b-instruct-2507
Alibaba · Apache 2.0
1432±8
5,926$0.26 / $1.06N/A
67
29118
longcat-flash-chat-2602-exp
Meituan · Proprietary
1431±14
1,754N/AN/A
68
30113
Anthropic
claude-opus-4-1-20250805-thinking-16k
Anthropic · Proprietary
1431±11
3,028$15 / $75200K
69
31113
kimi-k2-thinking-turbo
Moonshot · Modified MIT
1430±10
3,786$1.15 / $8262.1K
70
30118
qwen3.5-122b-a10b
Alibaba · Apache 2.0
1429±14
1,779$0.26 / $2.08262.1K
71
30122
deepseek-v4-flash
DeepSeek · MIT
1427±14
1,919$0.09 / $0.181M
72
30125
glm-4.5
Z.ai · MIT
1427±16
1,423$0.60 / $2.20131.1K
73
28131
amazon-nova-experimental-chat-10-20
Amazon · Proprietary
1427±20
806N/AN/A
74
27133
qwen3-vl-235b-a22b-instruct
Alibaba · Apache 2.0
1426±23
702$0.20 / $0.88262.1K
75
23140
deepseek-v3.2-exp-thinking
DeepSeek · MIT
1426±27
480$0.27 / $0.41163.8K
76
39119
o3-2025-04-16
OpenAI · Proprietary
1425±10
3,727$2 / $8200K
77
37124
grok-4-0709
xAI · Proprietary
1424±12
2,263$3 / $15256K
78
29133
deepseek-v3.2-exp
DeepSeek · MIT
1424±20
775$0.27 / $0.41163.8K
79
29134
glm-4.7
Z.ai · MIT
1423±21
709$0.40 / $1.75202.8K
80
33130
gpt-5.5-instant
OpenAI · Proprietary
1423±16
1,468$5 / $301.1M
81
40123
grok-4.1-thinking
xAI · Proprietary
1423±10
3,833N/AN/A
82
26145
Tencent
hunyuan-hy3-preview
Tencent · tencent-hunyuan-community
1422±27
406$0.29 / $1.17262.1K
83
37129
deepseek-v4-flash-thinking
DeepSeek · MIT
1422±14
1,892$0.25 / $1.75200K
84
43122
Anthropic
claude-sonnet-4-5-20250929
Anthropic · Proprietary
1422±9
4,918$3 / $15200K
85
43123
Anthropic
claude-opus-4-1-20250805
Anthropic · Proprietary
1422±9
4,725$15 / $75200K
86
31133
deepseek-v3.1
DeepSeek · MIT
1422±18
990$1.23 / $4.94N/A
87
18155
amazon-nova-experimental-chat-12-10
Amazon · Proprietary
1420±37
234N/AN/A
88
44127
gpt-5.2
OpenAI · Proprietary
1420±10
3,889$1.75 / $14400K
89
43131
deepseek-v3.2-thinking
DeepSeek · MIT
1418±12
2,504$0.23 / $0.34131.1K
90
45129
grok-4.1
xAI · Proprietary
1418±9
4,236N/AN/A
91
42133
minimax-m2.7
MiniMax · Modified MIT
1418±13
2,300$0.18 / $0.72204.8K
92
44133
gpt-5.4-mini-high
OpenAI · Proprietary
1418±12
2,676$0.75 / $4.50400K
93
28148
grok-4-fast-chat
xAI · Proprietary
1418±29
399$3 / $15256K
94
43133
gemini-2.5-flash-preview-09-2025
Google · Proprietary
1417±13
1,944$0.30 / $2.501M
95
30149
qwen3-vl-235b-a22b-thinking
Alibaba · Apache 2.0
1415±28
428$0.26 / $2.60131.1K
96
48133
mistral-large-3
Mistral · Apache 2.0
1415±11
2,812$0.50 / $1.50N/A
97
37146
deepseek-v3.1-thinking
DeepSeek · MIT
1414±22
664$1.23 / $4.94N/A
98
37148
qwen3-235b-a22b-thinking-2507
Alibaba · Apache 2.0
1412±24
489$0.15 / $1.50262.1K
99
45144
gpt-4.5-preview-2025-02-27
OpenAI · Proprietary
1412±15
1,393$75 / $150128K
100
29165
ernie-5.0-preview-1022
Baidu · Proprietary
1410±34
268N/AN/A
101
62134
mistral-medium-2508
Mistral · Proprietary
1410±8
5,821$0.40 / $2131.1K
102
63133
gemini-2.5-flash
Google · Proprietary
1410±7
7,874$0.30 / $2.501M
103
55140
gpt-5.1
OpenAI · Proprietary
1410±11
2,866$1.25 / $10400K
104
53141
qwen3.5-flash
Alibaba · Proprietary
1410±12
2,772N/AN/A
105
27169
Tencent
hunyuan-t1-20250711
Tencent · Proprietary
1410±38
236N/AN/A
106
53145
gpt-5-chat
OpenAI · Proprietary
1409±14
1,783$1.25 / $10128K
107
54144
gpt-5.4-nano-high
OpenAI · Proprietary
1409±12
2,559$0.20 / $1.25400K
108
61144
Stepfun
step-3.5-flash
StepFun · Apache 2.0
1408±11
3,006$0.10 / $0.30262.1K
109
27172
deepseek-v3.1-terminus-thinking
DeepSeek · MIT
1407±41
199$0.27 / $0.95163.8K
110
44155
ernie-5.0-preview-1203
Baidu · Proprietary
1406±23
619N/AN/A
111
69140
chatgpt-4o-latest-20250326
OpenAI · Proprietary
1406±8
5,723$5 / $15128K
112
68145
grok-4-1-fast-reasoning
xAI · Proprietary
1404±10
3,506$0.20 / $0.502M
113
62148
qwen3.5-35b-a3b
Alibaba · Apache 2.0
1404±14
1,762$0.14 / $1262.1K
114
54149
grok-4-fast-reasoning
xAI · Proprietary
1403±18
1,083$0.20 / $0.502M
115
53155
deepseek-r1-0528
DeepSeek · MIT
1403±20
869$0.50 / $2.15163.8K
116
37170
amazon-nova-experimental-chat-26-01-10
Amazon · Proprietary
1403±33
263N/AN/A
117
36174
deepseek-v3.1-terminus
DeepSeek · MIT
1399±38
219$0.27 / $0.95163.8K
118
76149
qwen3-235b-a22b-no-thinking
Alibaba · Apache 2.0
1399±12
2,390$0.46 / $1.82131.1K
119
44171
qwen3-32b
Alibaba · Apache 2.0
1398±30
316$0.08 / $0.28131.1K
120
75152
gpt-5-high
OpenAI · Proprietary
1398±14
1,888$1.25 / $10400K
121
74155
glm-4.5-air
Z.ai · MIT
1398±15
1,539$0.13 / $0.85131.1K
122
62165
kimi-k2-0905-preview
Moonshot · Modified MIT
1397±21
759$0.60 / $2.50262.1K
123
81150
mimo-v2-flash (non-thinking)
Xiaomi · MIT
1396±11
2,843$0.10 / $0.30262.1K
124
77156
o3-mini-high
OpenAI · Proprietary
1396±13
1,909$1.10 / $4.40200K
125
77158
qwen3-235b-a22b
Alibaba · Apache 2.0
1395±14
1,604$0.46 / $1.82131.1K
126
67165
qwen3-next-80b-a3b-thinking
Alibaba · Apache 2.0
1395±20
828$0.10 / $0.78262.1K
127
71165
minimax-m2.1-preview
MiniMax · MIT
1395±18
1,008$0.30 / $1.20204.8K
128
76161
qwen3-30b-a3b-instruct-2507
Alibaba · Apache 2.0
1395±15
1,427$0.05 / $0.19131.1K
129
40175
nvidia-llama-3.3-nemotron-super-49b-v1.5
Nvidia · Nvidia Open
1394±39
194$0.10 / $0.40131.1K
130
94150
Anthropic
claude-haiku-4-5-20251001
Anthropic · Proprietary
1393±8
5,840$1 / $5200K
131
83164
deepseek-r1
DeepSeek · MIT
1392±14
1,606$0.70 / $2.50163.8K
132
83165
grok-4.3
xAI · Proprietary
1392±14
1,904$1.25 / $2.501M
133
91164
Anthropic
claude-opus-4-20250514-thinking-16k
Anthropic · Proprietary
1390±12
2,238$15 / $75200K
134
94162
grok-3-preview-02-24
xAI · Proprietary
1390±11
2,676$3 / $15131.1K
135
94164
o1-2024-12-17
OpenAI · Proprietary
1388±11
2,986$15 / $60200K
136
93168
gpt-oss-120b
OpenAI · Apache 2.0
1388±14
1,793$0.03 / $0.15131.1K
137
98165
o4-mini-2025-04-16
OpenAI · Proprietary
1387±11
2,937$1.10 / $4.40200K
138
98169
gpt-5.3-chat-latest
OpenAI · Proprietary
1385±13
2,046$1.75 / $14128K
139
93172
grok-3-mini-high
xAI · Proprietary
1384±18
976$0.25 / $1.27N/A
140
68176
intellect-3
Prime Intellect · MIT
1384±31
333$0.20 / $1.10131.1K
141
80175
nvidia-nemotron-3-super-120b-a12b
Nvidia · NVIDIA Open Model
1383±25
519N/AN/A
142
104170
minimax-m2.5
MiniMax · Modified MIT
1381±12
2,437$0.12 / $0.48204.8K
143
94175
mimo-v2-flash (thinking)
Xiaomi · MIT
1378±22
632$0.10 / $0.30262.1K
144
105173
gpt-5-mini-high
OpenAI · Proprietary
1376±15
1,458$0.25 / $2400K
145
108173
Anthropic
claude-sonnet-4-20250514-thinking-32k
Anthropic · Proprietary
1374±13
2,023$3 / $151M
146
114173
deepseek-v3-0324
DeepSeek · MIT
1374±10
3,189$3 / $4.5032.8K
147
105175
nvidia-nemotron-3-nano-30b-a3b-bf16
Nvidia · NVIDIA Open Model
1374±18
987$0.06 / $0.24262.1K
148
113173
gemini-2.5-flash-lite-preview-09-2025-no-thinking
Google · Proprietary
1374±11
2,878$0.10 / $0.401M
149
119172
o3-mini
OpenAI · Proprietary
1373±8
4,719$1.10 / $4.40200K
150
114173
Anthropic
claude-opus-4-20250514
Anthropic · Proprietary
1373±11
2,767$15 / $75200K
151
118173
o1-preview
OpenAI · Proprietary
1373±10
4,569$15 / $60N/A
152
98181
ling-flash-2.0
Ant Group · MIT
1371±26
460N/AN/A
153
114175
grok-3-mini-beta
xAI · Proprietary
1370±14
1,528$0.30 / $0.50131.1K
154
122174
qwen2.5-max
Alibaba · Proprietary
1369±10
3,305N/AN/A
155
120176
kimi-k2-0711-preview
Moonshot · Modified MIT
1367±14
1,694$0.60 / $2.50131.1K
156
125175
gpt-4.1-2025-04-14
OpenAI · Proprietary
1367±10
3,224$2 / $81M
157
97184
Stepfun
step-3
StepFun · Apache 2.0
1367±31
352$0.57 / $1.4265.5K
158
119177
trinity-large-thinking
Arcee AI · Apache 2.0
1366±15
1,622$0.25 / $0.80262.1K
159
120177
qwen3-coder-480b-a35b-instruct
Alibaba · Apache 2.0
1366±15
1,626$0.40 / $1.60262.1K
160
131177
gemini-2.5-flash-lite-preview-06-17-thinking
Google · Proprietary
1363±12
2,094$0.10 / $0.401M
161
131178
minimax-m1
MiniMax · Apache 2.0
1362±13
1,795$0.40 / $2.201M
162
120183
nova-2-lite
Amazon · Proprietary
1360±20
826$0.30 / $2.501M
163
101190
llama-3.1-nemotron-ultra-253b-v1
Nvidia · Nvidia Open Model
1359±37
209$0.60 / $1.80131.1K
164
122183
Tencent
hunyuan-turbos-20250416
Tencent · Proprietary
1359±19
846N/AN/A
165
132180
qwq-32b
Alibaba · Apache 2.0
1359±14
1,719$0.50 / $116.4K
166
121184
glm-4.7-flash
Z.ai · MIT
1358±21
717$0.06 / $0.40202.8K
167
137177
o1-mini
OpenAI · Proprietary
1358±8
7,499$1.10 / $4.40N/A
168
134180
Anthropic
claude-sonnet-4-20250514
Anthropic · Proprietary
1358±12
2,470$3 / $151M
169
136183
qwen3-30b-a3b
Alibaba · Apache 2.0
1355±14
1,707$0.12 / $0.50131.1K
170
140183
mistral-medium-2505
Mistral · Proprietary
1352±12
2,228$0.40 / $2131.1K
171
111194
minimax-m2
MiniMax · Apache 2.0
1352±33
319$0.26 / $1.02204.8K
172
146181
gemini-2.0-flash-001
Google · Proprietary
1352±9
4,065$0.10 / $0.401M
173
112194
glm-4.5v
Z.ai · MIT
1351±34
276$0.60 / $1.8065.5K
174
131192
ring-flash-2.0
Ant Group · MIT
1348±27
454N/AN/A
175
154186
gpt-4.1-mini-2025-04-14
OpenAI · Proprietary
1343±11
2,693$0.40 / $1.601M
176
148190
mistral-small-2506
Mistral · Apache 2.0
1341±18
1,040$0.10 / $0.3032K
177
161187
Anthropic
claude-3-7-sonnet-20250219-thinking-32k
Anthropic · Proprietary
1337±11
2,792$3 / $15200K
178
160192
trinity-large-preview
Arcee AI · Apache 2.0
1335±14
1,890$0.15 / $0.45131K
179
163201
qwen-plus-0125
Alibaba · Proprietary
1326±19
732$0.40 / $1.20131.1K
180
172200
Anthropic
claude-3-7-sonnet-20250219
Anthropic · Proprietary
1318±10
3,358$3 / $15200K
181
165209
Stepfun
step-1o-turbo-202506
StepFun · Proprietary
1318±24
564N/AN/A
182
169209
gpt-oss-20b
OpenAI · Apache 2.0
1317±22
678$0.03 / $0.14131.1K
183
161218
olmo-3-32b-think
Ai2 · Apache 2.0
1316±32
315$0.15 / $0.5065.5K
184
165213
gpt-5-nano-high
OpenAI · Proprietary
1316±27
493$0.05 / $0.40400K
185
175201
gemini-1.5-pro-002
Google · Proprietary
1315±7
7,610$3.50 / $10.502.1M
186
156229
granite-4.1-8b
IBM · Apache 2.0
1313±39
236$0.05 / $0.10131.1K
187
177205
gemma-3-27b-it
Google · Gemma
1311±9
3,580$0.08 / $0.16131.1K
188
171214
olmo-3.1-32b-instruct
Ai2 · Apache 2.0
1311±23
697$0.20 / $0.6065.5K
189
175205
deepseek-v3
DeepSeek · DeepSeek
1311±11
2,721$1.14 / $4.56N/A
190
177206
gemini-2.0-flash-lite-preview-02-05
Google · Proprietary
1309±10
2,814$0.07 / $0.301M
191
171221
gemma-3-12b-it
Google · Gemma
1307±27
389$0.05 / $0.15131.1K
192
179205
Anthropic
claude-3-5-sonnet-20241022
Anthropic · Proprietary
1307±7
10,016$3 / $15200K
193
173218
Stepfun
step-2-16k-exp-202412
StepFun · Proprietary
1304±20
642N/AN/A
194
179209
Anthropic
claude-3-5-sonnet-20240620
Anthropic · Proprietary
1303±7
11,359$3 / $15200K
195
179211
athene-v2-chat
NexusFlow · NexusFlow
1300±9
3,412N/AN/A
196
179213
Meta
llama-4-maverick-17b-128e-instruct
Meta · Llama 4
1300±11
2,839$0.63 / $1.80131.1K
197
179213
01.AI
yi-lightning
01 AI · Proprietary
1300±10
3,921N/AN/A
198
180213
Cohere
command-a-03-2025
Cohere · CC-BY-NC-4.0
1299±9
3,990$2.50 / $10256K
199
173232
olmo-3.1-32b-think
Ai2 · Apache 2.0
1298±26
473$0.15 / $0.5065.5K
200
179218
qwen2.5-plus-1127
Alibaba · Proprietary
1297±14
1,404N/AN/A
201
173239
Tencent
hunyuan-turbos-20250226
Tencent · Proprietary
1294±31
238N/AN/A
202
182232
deepseek-v2.5-1210
DeepSeek · DeepSeek
1288±17
1,031N/AN/A
203
182235
glm-4-plus-0111
Z.ai · Proprietary
1287±19
721N/AN/A
204
185231
Meta
llama-4-scout-17b-16e-instruct
Meta · Llama
1286±13
1,943$0.40 / $0.708.2K
205
189224
gpt-4o-2024-08-06
OpenAI · Proprietary
1285±8
6,826$2.50 / $10128K
206
189224
gpt-4o-2024-05-13
OpenAI · Proprietary
1284±7
15,103$5 / $15128K
207
190226
grok-2-2024-08-13
xAI · Proprietary
1283±7
8,950$2 / $10131.1K
208
190229
qwen2.5-72b-instruct
Alibaba · Qwen
1283±8
5,415$1.20 / $1.20N/A
209
194231
Meta
llama-3.1-405b-instruct-fp8
Meta · Llama 3.1 Community
1281±7
8,482$4 / $432.8K
210
182241
Tencent
hunyuan-large-2025-02-10
Tencent · Proprietary
1281±24
497N/AN/A
211
195233
Meta
llama-3.1-405b-instruct-bf16
Meta · Llama 3.1 Community
1278±8
5,215$4 / $432.8K
212
195238
qwen-max-0919
Alibaba · Qwen
1275±12
2,249$1.60 / $6.4032.8K
213
195238
glm-4-plus
Z.ai · Proprietary
1275±10
3,599$0.44 / $1.76204.8K
214
186246
Tencent
hunyuan-standard-2025-02-10
Tencent · Proprietary
1274±24
499N/AN/A
215
186246
gpt-4.1-nano-2025-04-14
OpenAI · Proprietary
1274±23
582$0.10 / $0.401M
216
182247
Tencent
hunyuan-turbo-0110
Tencent · Proprietary
1273±31
243N/AN/A
217
199237
Anthropic
claude-3-opus-20240229
Anthropic · Proprietary
1273±6
25,769$15 / $75200K
218
198239
gemini-advanced-0514
Google · Proprietary
1272±9
6,395N/AN/A
219
199238
gpt-4-turbo-2024-04-09
OpenAI · Proprietary
1272±8
13,217$10 / $30128K
220
195243
llama-3.1-nemotron-70b-instruct
Nvidia · Llama 3.1
1271±17
1,041$1.20 / $1.20131.1K
221
198240
deepseek-v2.5
DeepSeek · DeepSeek
1271±10
3,649N/AN/A
222
201239
gemini-1.5-pro-001
Google · Proprietary
1269±8
10,492$3.50 / $10.502.1M
223
201239
gpt-4-1106-preview
OpenAI · Proprietary
1269±8
13,306$10 / $30128K
224
199240
gemini-1.5-flash-002
Google · Proprietary
1269±9
4,789$0.07 / $0.301M
225
186250
Tencent
hunyuan-large-vision
Tencent · Proprietary
1268±30
351N/AN/A
226
202240
gpt-4-0125-preview
OpenAI · Proprietary
1268±8
12,374$10 / $30128K
227
204240
gpt-4o-mini-2024-07-18
OpenAI · Proprietary
1267±7
9,322$0.15 / $0.60128K
228
202240
Meta
llama-3.3-70b-instruct
Meta · Llama-3.3
1267±8
5,777$0.10 / $0.32131.1K
229
206241
grok-2-mini-2024-08-13
xAI · Proprietary
1265±8
7,261$2 / $10131.1K
230
209243
mistral-large-2407
Mistral · Mistral Research
1261±8
6,664$2 / $6131.1K
231
203246
mistral-small-3.1-24b-instruct-2503
Mistral · Apache 2.0
1261±13
2,129$0.10 / $0.3032K
232
208246
mistral-large-2411
Mistral · MRL
1261±9
3,574$2 / $6128K
233
223246
Meta
llama-3.1-70b-instruct
Meta · Llama 3.1 Community
1252±8
7,677$0.40 / $0.40131.1K
234
218248
amazon-nova-pro-v1.0
Amazon · Proprietary
1252±10
2,978$0.80 / $3.20300K
235
211254
gemma-3n-e4b-it
Google · Gemma
1251±15
1,572$0.06 / $0.1232.8K
236
209256
qwen2.5-coder-32b-instruct
Alibaba · Apache 2.0
1251±19
725$0.87 / $0.8732K
237
202259
magistral-medium-2506
Mistral · Proprietary
1249±26
553$2 / $540K
238
198262
ibm-granite-h-small
IBM · Apache 2.0
1249±32
356N/AN/A
239
225255
phi-4
Microsoft · MIT
1246±10
2,764$0.07 / $0.1416.4K
240
227251
Anthropic
claude-3-5-haiku-20241022
Anthropic · Proprietary
1244±7
6,364$1 / $5200K
241
210262
llama-3.1-tulu-3-70b
Ai2 · Llama 3.1
1242±24
397N/AN/A
242
225257
deepseek-coder-v2
DeepSeek · DeepSeek License
1241±14
1,858$0.14 / $0.28128K
243
227257
mistral-small-24b-instruct-2501
Mistral · Apache 2.0
1240±13
1,683$0.05 / $0.0832.8K
244
210264
gemma-3-4b-it
Google · Gemma
1239±28
423$0.05 / $0.10131.1K
245
232257
qwen2-72b-instruct
Alibaba · Qianwen LICENSE
1235±9
4,835$0.90 / $0.9032.8K
246
214268
Tencent
hunyuan-standard-256k
Tencent · Proprietary
1235±28
361N/AN/A
247
233260
athene-70b-0725
NexusFlow · CC-BY-NC-4.0
1231±10
2,921N/AN/A
248
234260
gpt-4-0314
OpenAI · Proprietary
1230±10
7,052$30 / $608.2K
249
227268
llama-3.1-nemotron-51b-instruct
Nvidia · Llama 3.1
1230±22
507N/AN/A
250
236260
gemini-1.5-flash-001
Google · Proprietary
1229±8
8,392$0.07 / $0.301M
251
235263
amazon-nova-lite-v1.0
Amazon · Proprietary
1227±11
2,511$0.06 / $0.24300K
252
236268
reka-core-20240904
Reka AI · Proprietary
1222±14
1,207N/AN/A
253
236268
jamba-1.5-large
AI21 Labs · Jamba Open
1221±15
1,147$2 / $8256K
254
238268
glm-4-0520
Z.ai · Proprietary
1218±15
1,191N/AN/A
255
242264
Meta
llama-3-70b-instruct
Meta · Llama 3 Community
1218±7
20,941$0.51 / $0.748.2K
256
242267
gpt-4-0613
OpenAI · Proprietary
1217±8
11,181$30 / $608.2K
257
239268
nemotron-4-340b-instruct
Nvidia · NVIDIA Open Model
1216±12
2,352N/AN/A
258
234276
qwq-32b-preview
Alibaba · Apache 2.0
1213±24
480$0.50 / $116.4K
259
243268
Anthropic
claude-3-sonnet-20240229
Anthropic · Proprietary
1213±8
13,766$3 / $15200K
260
246268
gemma-2-27b-it
Google · Gemma license
1212±7
10,170$0.65 / $0.658.2K
261
237283
olmo-2-0325-32b-instruct
Ai2 · Apache-2.0
1207±28
375$0.05 / $0.20128K
262
249269
gemini-1.5-flash-8b-001
Google · Proprietary
1207±8
5,036$0.07 / $0.301M
263
248271
amazon-nova-micro-v1.0
Amazon · Proprietary
1206±11
2,455$0.04 / $0.14128K
264
251274
mistral-large-2402
Mistral · Proprietary
1200±9
7,987$4 / $1232K
265
251276
Cohere
c4ai-aya-expanse-32b
Cohere · CC-BY-NC-4.0
1200±10
3,854N/AN/A
266
251282
reka-flash-20240904
Reka AI · Proprietary
1195±14
1,284N/AN/A
267
246287
llama-3.1-tulu-3-8b
Ai2 · Llama 3.1
1195±25
363N/AN/A
268
252287
ministral-8b-2410
Mistral · MRL
1188±20
683$0.10 / $0.10131.1K
269
261282
Anthropic
claude-3-haiku-20240307
Anthropic · Proprietary
1188±7
14,983$0.25 / $1.25200K
270
260284
Cohere
command-r-plus-08-2024
Cohere · CC-BY-NC-4.0
1188±14
1,467$2.50 / $10128K
271
261284
qwen1.5-110b-chat
Alibaba · Qianwen LICENSE
1185±11
3,188N/AN/A
272
262284
mixtral-8x22b-instruct-v0.1
Mistral · Apache 2.0
1184±9
6,778$0.90 / $0.9065.5K
273
263284
gemma-2-9b-it
Google · Gemma license
1183±8
7,110$0.03 / $0.098.2K
274
262286
01.AI
yi-1.5-34b-chat
01 AI · Apache-2.0
1182±11
2,985N/AN/A
275
263287
mistral-medium
Mistral · Proprietary
1180±11
4,406$2.70 / $8.1032K
276
262291
internlm2_5-20b-chat
InternLM · Other
1180±15
1,387$0 / $032.8K
277
265286
Meta
llama-3.1-8b-instruct
Meta · Llama 3.1 Community
1179±8
7,135$0.02 / $0.03131.1K
278
265292
phi-3-medium-4k-instruct
Microsoft · MIT
1173±10
3,238$0.17 / $0.68N/A
279
265294
gemma-2-9b-it-simpo
Princeton · MIT
1173±15
1,285$0.03 / $0.098.2K
280
265297
Cohere
c4ai-aya-expanse-8b
Cohere · CC-BY-NC-4.0
1168±15
1,307N/AN/A
281
265297
reka-flash-21b-20240226-online
Reka AI · Proprietary
1168±14
2,028N/AN/A
282
272296
Cohere
command-r-plus
Cohere · CC-BY-NC-4.0
1164±8
9,769$2.50 / $10128K
283
272297
qwen1.5-72b-chat
Alibaba · Qianwen LICENSE
1164±10
5,327N/AN/A
284
268300
jamba-1.5-mini
AI21 Labs · Jamba Open
1160±16
1,094$0.20 / $0.40256K
285
265307
granite-3.1-2b-instruct
IBM · Apache 2.0
1159±26
391N/AN/A
286
277300
reka-flash-21b-20240226
Reka AI · Proprietary
1156±11
3,363N/AN/A
287
277300
qwen1.5-32b-chat
Alibaba · Qianwen LICENSE
1155±12
2,649N/AN/A
288
277302
Cohere
command-r-08-2024
Cohere · CC-BY-NC-4.0
1155±14
1,601$0.15 / $0.60128K
289
277303
phi-3-mini-4k-instruct-june-2024
Microsoft · MIT
1152±14
1,568$0.13 / $0.524.1K
290
267312
granite-3.1-8b-instruct
IBM · Apache 2.0
1152±28
382N/AN/A
291
279300
Meta
llama-3-8b-instruct
Meta · Llama 3 Community
1151±8
14,252$0.14 / $0.148.2K
292
278304
phi-3-small-8k-instruct
Microsoft · MIT
1151±13
2,092$0.15 / $0.60N/A
293
274311
zephyr-orpo-141b-A35b-v0.1
HuggingFace · Apache 2.0
1148±22
589N/AN/A
294
281303
mixtral-8x7b-instruct-v0.1
Mistral · Apache 2.0
1147±8
9,663$0.63 / $0.6332K
295
280307
dbrx-instruct-preview
Databricks · DBRX LICENSE
1145±11
4,001$0.60 / $0.6032.8K
296
279311
granite-3.0-8b-instruct
IBM · Apache 2.0
1143±19
873N/AN/A
297
284307
gpt-3.5-turbo-0125
OpenAI · Proprietary
1142±8
8,626$0.50 / $1.5016.4K
298
280310
gpt-3.5-turbo-1106
OpenAI · Proprietary
1141±15
2,134$1 / $216.4K
299
288310
gemma-2-2b-it
Google · Gemma license
1135±8
6,599N/AN/A
300
284314
gemini-pro-dev-api
Google · Proprietary
1132±14
2,274$0.35 / $1.0532.8K
301
284316
gemini-pro
Google · Proprietary
1129±19
993$0.35 / $1.0532.8K
302
288316
Meta
llama-3.2-3b-instruct
Meta · Llama 3.2
1126±16
1,136$0.05 / $0.34131.1K
303
289316
qwen1.5-14b-chat
Alibaba · Qianwen LICENSE
1125±13
2,184$0.30 / $0.30N/A
304
291316
starling-lm-7b-beta
Nexusflow · Apache-2.0
1124±14
1,973N/AN/A
305
295316
Cohere
command-r
Cohere · CC-BY-NC-4.0
1120±9
6,682$0.15 / $0.60128K
306
292322
granite-3.0-2b-instruct
IBM · Apache 2.0
1117±19
908N/AN/A
307
292324
wizardlm-70b
Microsoft · Llama 2 Community
1116±19
903N/AN/A
308
295320
01.AI
yi-34b-chat
01 AI · Yi License
1114±13
2,043$0.90 / $0.904.1K
309
299321
phi-3-mini-4k-instruct
Microsoft · MIT
1111±12
2,564$0.13 / $0.52N/A
310
300322
snowflake-arctic-instruct
Snowflake · Apache 2.0
1109±11
4,793N/AN/A
311
295327
deepseek-llm-67b-chat
DeepSeek · DeepSeek License
1108±23
576N/AN/A
312
297326
tulu-2-dpo-70b
AllenAI/UW · AI2 ImpACT Low-risk
1107±19
888N/AN/A
313
301324
gemma-1.1-7b-it
Google · Gemma license
1107±11
3,039$0.03 / $0.098.2K
314
300324
openchat-3.5-0106
OpenChat · Apache-2.0
1107±14
1,726N/AN/A
315
292333
smollm2-1.7b-instruct
HuggingFace · Apache 2.0
1105±33
271N/AN/A
316
301332
openhermes-2.5-mistral-7b
NousResearch · Apache-2.0
1098±20
697$0.17 / $0.17N/A
317
306330
Meta
llama-2-70b-chat
Meta · Llama 2 Community
1091±10
4,740$0.70 / $2.804.1K
318
306332
phi-3-mini-128k-instruct
Microsoft · MIT
1089±13
2,813$0.13 / $0.52N/A
319
306333
Meta
llama-3.2-1b-instruct
Meta · Llama 3.2
1086±16
1,162$0.03 / $0.20131.1K
320
310333
mistral-7b-instruct-v0.2
Mistral · Apache-2.0
1085±12
2,605$0.20 / $0.2032.8K
321
310334
starling-lm-7b-alpha
UC Berkeley · CC-BY-NC-4.0
1081±16
1,300N/AN/A
322
307336
qwen1.5-7b-chat
Alibaba · Qianwen LICENSE
1080±20
690$0.20 / $0.20N/A
323
306340
dolphin-2.2.1-mistral-7b
Cognitive Computations · Apache-2.0
1077±32
219$0.50 / $0.5016.4K
324
308340
llama2-70b-steerlm-chat
Nvidia · Llama 2 Community
1072±27
440N/AN/A
325
313339
openchat-3.5
OpenChat · Apache-2.0
1071±18
945$0.20 / $0.20N/A
326
315336
vicuna-33b
LMSYS · Non-commercial
1071±12
2,663$0 / $02K
327
313340
qwen-14b-chat
Alibaba · Qianwen LICENSE
1068±24
534N/AN/A
328
315339
gemma-7b-it
Google · Gemma license
1066±16
1,120$0.05 / $0.088.2K
329
316339
Meta
llama-2-13b-chat
Meta · Llama 2 Community
1065±13
2,218$0.25 / $0.254.1K
330
314342
solar-10.7b-instruct-v1.0
Upstage AI · CC-BY-NC-4.0
1064±22
604$0.30 / $0.30N/A
331
315342
nous-hermes-2-mixtral-8x7b-dpo
NousResearch · Apache-2.0
1061±21
628$0.90 / $0.90N/A
332
318343
Meta
codellama-34b-instruct
Meta · Llama 2 Community
1056±19
770$0.35 / $1.4016.4K
333
321345
palm-2
Google · Proprietary
1049±19
901$0.50 / $0.5025.8K
334
322344
gemma-1.1-2b-it
Google · Gemma license
1047±16
1,355N/AN/A
335
316346
mpt-30b-chat
MosaicML · CC-BY-NC-SA-4.0
1047±34
242N/AN/A
336
324345
Meta
llama-2-7b-chat
Meta · Llama 2 Community
1042±14
1,656$0.15 / $0.154.1K
337
324345
zephyr-7b-beta
HuggingFace · MIT
1041±17
1,250$0.15 / $0.1516.4K
338
324346
stripedhyena-nous-7b
Together AI · Apache 2.0
1033±20
676$0.20 / $0.20N/A
339
322347
guanaco-33b
UW · Non-commercial
1033±32
280N/AN/A
340
330345
vicuna-13b
LMSYS · Llama 2 Community
1030±14
2,146$0.30 / $0.30N/A
341
327347
mistral-7b-instruct
Mistral · Apache 2.0
1027±19
974$0.07 / $0.284.1K
342
330347
qwen1.5-4b-chat
Alibaba · Qianwen LICENSE
1026±18
988$0.10 / $0.10N/A
343
333347
olmo-7b-instruct
Ai2 · Apache-2.0
1018±19
848$0.20 / $0.20N/A
344
332347
wizardlm-13b
Microsoft · Llama 2 Community
1017±21
669$0.30 / $0.30N/A
345
334347
gemma-2b-it
Google · Gemma license
1009±22
597$0.10 / $0.10N/A
346
338348
vicuna-7b
LMSYS · Llama 2 Community
994±21
658$0.20 / $0.20N/A
347
340348
chatglm3-6b
Tsinghua · Apache-2.0
989±23
576N/AN/A
348
346355
gpt4all-13b-snoozy
Nomic AI · Non-commercial
941±37
211N/AN/A
349
348355
koala-13b
UC Berkeley · Non-commercial
932±21
751N/AN/A
350
348355
chatglm-6b
Tsinghua · Non-commercial
926±25
525N/AN/A
351
348356
RWKV-4-Raven-14B
RWKV · Apache 2.0
922±24
544N/AN/A
352
348356
mpt-7b-chat
MosaicML · CC-BY-NC-SA-4.0
919±25
471N/AN/A
353
348357
chatglm2-6b
Tsinghua · Apache-2.0
915±35
227N/AN/A
354
348357
alpaca-13b
Stanford · Non-commercial
908±23
652N/AN/A
355
348358
oasst-pythia-12b
OpenAssistant · Apache 2.0
892±22
687N/AN/A
356
351359
dolly-v2-12b
Databricks · MIT
871±29
370N/AN/A
357
353359
fastchat-t5-3b
LMSYS · Apache 2.0
862±26
462N/AN/A
358
356359
Stability
stablelm-tuned-alpha-7b
Stability AI · CC-BY-NC-SA-4.0
839±29
353N/AN/A
359
355359
Meta
llama-13b
Meta · Non-commercial
838±33
252$0.23 / $0.23N/A

Remove Style Control Leaderboard Plots

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Confidence Intervals on Model Strength (via Bootstrapping)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Battle Count for Each Combination of Models (without Ties)

USE CASES

  • Chat with AI
  • Build Apps & Websites
  • Write & Edit Text
  • Search the Web
  • Generate Images
  • Generate Videos
  • Chose any model
  • Compare Models Side by Side

LEADERBOARD RANKINGS

  • Overall
  • Agent
  • Text
  • WebDev
  • Image-to-WebDev
  • Text to Image
  • Image Edit
  • Text to Video
  • Image to Video
  • Video Edit
  • Vision
  • Document
  • Search

COMPANY

  • About Us
  • How It Works
  • Blog
  • Careers
  • Changelog
  • Help Center
  • FAQ

LEGAL

  • Terms
  • Privacy
  • Cookies

FOLLOW

  • X
  • LinkedIn
  • YouTube
  • Discord

© Arena Intelligence 2026