Text Arena🏆Overall

View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.

Jun 10, 2026
6,820,793 votes
366 models
Rank Spread
1
13
Anthropic
Anthropic · Proprietary
1501±4
42,797$5 / $251M
2
14
Anthropic
Anthropic · Proprietary
1497±4
45,808$5 / $251M
3
15
Anthropic
Anthropic · Proprietary
1497±11
2,883$10 / $501M
4
28
Anthropic
Anthropic · Proprietary
1489±5
28,900$5 / $251M
5
410
Google · Proprietary
1481±4
55,403$2 / $121M
6
313
Google · Proprietary
1480±6
10,113$1.50 / $91M
7
410
Anthropic
Anthropic · Proprietary
1480±5
29,924$5 / $251M
8
510
Google · Proprietary
1480±4
41,317$2 / $121M
9
419
Alibaba · Proprietary
1474±10
3,744$1.25 / $3.751M
10
519
Meta
Meta · Proprietary
1472±6
13,511N/AN/A
11
819
OpenAI · Proprietary
1471±4
37,120$2.50 / $151.1M
12
819
Alibaba · Proprietary
1470±5
21,470N/AN/A
13
819
Z.ai · MIT
1470±6
15,954$1.40 / $4.40202.8K
14
919
Baidu · Proprietary
1468±5
21,878N/AN/A
15
919
OpenAI · Proprietary
1468±5
24,620$5 / $301.1M
16
919
Google · Proprietary
1466±4
30,718$0.50 / $31M
17
925
Anthropic
Anthropic · Proprietary
1464±7
9,190$5 / $251M
18
925
Xiaomi · MIT
1462±5
23,364$0.43 / $0.871M
19
925
OpenAI · Proprietary
1462±5
25,255$5 / $301.1M
20
1727
Google · Proprietary
1457±2
124,473$1.25 / $101M
21
1733
Moonshot · Modified MIT
1456±5
22,595$0.95 / $4262.1K
22
1732
Anthropic
Anthropic · Proprietary
1455±4
35,722$3 / $151M
23
1735
OpenAI · Proprietary
1454±4
39,393$2.50 / $151.1M
24
1742
Anthropic
Anthropic · Proprietary
1454±7
9,644$5 / $251M
25
2041
1452±4
38,420$2 / $62M
26
2042
1452±4
37,581$2 / $62M
27
2142
Anthropic
Anthropic · Proprietary
1450±3
71,063$5 / $25200K
28
2145
DeepSeek · MIT
1449±5
25,141$0.43 / $0.871M
29
2143
Bytedance
Bytedance · Proprietary
1448±4
46,537N/AN/A
30
1755
1448±10
3,417N/AN/A
31
2346
Anthropic
1446±4
37,098$5 / $25200K
32
2155
Alibaba · Proprietary
1446±8
5,188$1.04 / $6.24262.1K
33
2155
MiniMax · Proprietary
1446±7
7,782$0.60 / $2.40N/A
34
2251
1446±5
23,617$0.43 / $0.871M
35
2351
Z.ai · MIT
1446±5
23,145$1 / $3.20202.8K
36
2450
Moonshot · Modified MIT
1445±4
44,488$0.60 / $3N/A
37
2451
Baidu · Proprietary
1445±4
35,227N/AN/A
38
2550
1445±3
62,491$0.50 / $31M
39
2454
xAI · Proprietary
1444±5
26,782N/AN/A
40
2455
Baidu · Proprietary
1442±7
9,751N/AN/A
41
2457
Google · Apache 2.0
1441±8
5,889$0.14 / $0.40262.1K
42
2855
OpenAI · Proprietary
1441±4
40,830$1.25 / $10400K
43
2955
Alibaba · Apache 2.0
1441±4
39,840$0.39 / $2.34262.1K
44
2955
Z.ai · MIT
1440±4
35,655$0.43 / $1.74202.8K
45
3057
Alibaba · Proprietary
1439±4
27,716$0.78 / $3.90262.1K
46
2470
1438±10
3,530N/AN/A
47
3157
OpenAI · Proprietary
1438±4
34,387$1.75 / $14128K
48
3657
Anthropic
Anthropic · Proprietary
1438±3
80,893$3 / $15200K
49
3657
xAI · Proprietary
1438±3
65,495N/AN/A
50
3160
Alibaba · Proprietary
1437±5
25,898$0.33 / $1.951M
51
3660
xAI · Proprietary
1437±3
67,612N/AN/A
52
3360
Xiaomi · Proprietary
1436±5
24,482$1 / $31M
53
3164
Z.ai · MIT
1436±6
12,121$0.40 / $1.75202.8K
54
3176
Google · Apache 2.0
1434±8
5,802N/AN/A
55
4464
Anthropic
1433±3
82,387$3 / $15200K
56
4976
Mistral · Apache 2.0
1430±4
43,471$0.50 / $1.50N/A
57
3787
Baidu · Proprietary
1430±9
4,707N/AN/A
58
4479
DeepSeek · MIT
1430±5
24,760$0.10 / $0.201M
59
4979
Z.ai · MIT
1429±5
24,311$0.60 / $2.20131.1K
60
5277
OpenAI · Proprietary
1429±3
82,455$5 / $15128K
61
4984
DeepSeek · MIT
1428±6
18,468$0.50 / $2.15163.8K
62
5288
Xiaomi · MIT
1426±5
23,774$0.14 / $0.281M
63
5584
Mistral · Proprietary
1425±3
93,923$2.70 / $8.1032K
64
5489
Meituan · Proprietary
1425±5
28,117N/AN/A
65
5488
xAI · Proprietary
1425±4
32,906$3 / $15131.1K
66
5588
DeepSeek · MIT
1424±4
47,227$0.23 / $0.34131.1K
67
5295
1424±7
9,068$0.27 / $0.41163.8K
68
5498
DeepSeek · MIT
1423±6
11,932$0.27 / $0.41163.8K
69
5894
OpenAI · Proprietary
1422±4
43,470$1.25 / $10400K
70
5498
Mistral · Modified MIT
1422±7
9,535$1.50 / $7.50262.1K
71
5498
Meituan · MIT
1422±6
11,399$0.20 / $0.80131.1K
72
5797
1422±5
24,641$0.10 / $0.201M
73
5598
Alibaba · Apache 2.0
1421±6
11,503$0.20 / $0.88262.1K
74
55100
Moonshot · Modified MIT
1420±7
8,173$0.38 / $2.02262.1K
75
55101
Xiaomi · Proprietary
1420±7
9,310$0.40 / $2262.1K
76
52106
1420±10
3,465$0.27 / $0.95163.8K
77
54106
1420±10
3,681N/AN/A
78
6098
DeepSeek · MIT
1420±4
41,011$0.23 / $0.34131.1K
79
6297
Alibaba · Apache 2.0
1419±3
97,144$0.26 / $1.06N/A
80
58101
DeepSeek · MIT
1419±6
14,960$1.23 / $4.94N/A
81
60101
OpenAI · Proprietary
1419±5
26,225$5 / $301.1M
82
60100
Alibaba · Apache 2.0
1419±5
22,879$0.09 / $1.10262.1K
83
6298
Anthropic
1418±3
49,804$15 / $75200K
84
55110
DeepSeek · MIT
1418±10
3,705$0.27 / $0.95163.8K
85
62101
Alibaba · Apache 2.0
1418±4
28,441$0.26 / $2.08262.1K
86
66100
Anthropic
Anthropic · Proprietary
1417±3
77,352$15 / $75200K
87
60104
OpenAI · Proprietary
1417±6
14,547$75 / $150128K
88
6798
Google · Proprietary
1417±2
124,427$0.30 / $2.501M
89
60106
DeepSeek · MIT
1417±7
11,736$1.23 / $4.94N/A
90
63104
1417±4
25,395N/AN/A
91
67104
OpenAI · Proprietary
1416±4
47,930$1.75 / $14400K
92
67104
Google · Proprietary
1416±4
44,683$0.25 / $1.501M
93
69106
Moonshot · Modified MIT
1414±3
61,974$1.15 / $8262.1K
94
67114
Alibaba · Apache 2.0
1414±7
8,991$0.10 / $0.10262.1K
95
67114
Alibaba · Proprietary
1413±6
9,158$0.78 / $3.90262.1K
96
69111
OpenAI · Proprietary
1412±4
35,690$0.75 / $4.50400K
97
78114
OpenAI · Proprietary
1411±4
55,819$1.75 / $14400K
98
78114
1411±4
46,565$0.10 / $0.30262.1K
99
81114
xAI · Proprietary
1410±4
41,393$3 / $15256K
100
85114
OpenAI · Proprietary
1409±4
59,750$2 / $8200K
101
71116
xAI · Proprietary
1409±8
6,815$3 / $15256K
102
89114
xAI · Proprietary
1408±3
56,722$0.20 / $0.502M
103
85114
Alibaba · Apache 2.0
1408±4
27,301$0.20 / $1.56262.1K
104
89114
1407±4
32,905$0.30 / $2.501M
105
68122
Tencent
Tencent · Proprietary
1406±12
2,218N/AN/A
106
85120
Tencent
Tencent · tencent-hunyuan-community
1405±8
6,615$0.29 / $1.17262.1K
107
93117
OpenAI · Proprietary
1405±4
31,932$1.25 / $10400K
108
93117
Stepfun
StepFun · Apache 2.0
1405±4
41,780$0.09 / $0.30262.1K
109
93119
xAI · Proprietary
1404±5
24,424$1.25 / $2.501M
110
95119
OpenAI · Proprietary
1404±4
31,576$1.25 / $10128K
111
95119
MiniMax · Modified MIT
1403±5
31,351$0.25 / $1204.8K
112
95122
Alibaba · Apache 2.0
1401±7
7,938$0.26 / $2.60131.1K
113
94123
Tencent
Tencent · Proprietary
1400±9
4,706N/AN/A
114
93123
1399±10
3,408N/AN/A
115
104122
Alibaba · Proprietary
1398±4
37,586N/AN/A
116
104122
xAI · Proprietary
1398±5
18,717$0.20 / $0.502M
117
107123
Alibaba · Apache 2.0
1396±4
29,123$0.14 / $1262.1K
118
105123
1395±6
10,963$0.10 / $0.30262.1K
119
107124
1394±6
11,474N/AN/A
120
110123
Alibaba · Apache 2.0
1394±4
38,210$0.46 / $1.82131.1K
121
111123
Anthropic
Anthropic · Proprietary
1392±3
87,371$1 / $5200K
122
111127
MiniMax · MIT
1391±5
17,137$0.29 / $0.95204.8K
123
115129
OpenAI · Proprietary
1388±4
32,965$1.75 / $14128K
124
121132
Alibaba · Apache 2.0
1384±5
23,730$0.05 / $0.19131.1K
125
122133
Z.ai · MIT
1383±4
31,075$0.13 / $0.85131.1K
126
123134
OpenAI · Proprietary
1382±4
50,990$2 / $81M
127
122140
Moonshot · Modified MIT
1380±6
11,778$0.60 / $2.50262.1K
128
124136
1379±3
47,235$0.10 / $0.401M
129
123142
Nvidia · NVIDIA Open Model
1378±7
7,539N/AN/A
130
124147
Tencent
Tencent · Proprietary
1376±6
10,725N/AN/A
131
124142
Anthropic
Anthropic · Proprietary
1375±4
36,895$15 / $75200K
132
125142
DeepSeek · MIT
1375±4
45,498$3 / $4.5032.8K
133
122152
Z.ai · MIT
1375±11
2,805$0.30 / $0.90131.1K
134
126147
OpenAI · Proprietary
1374±5
27,024$0.25 / $2400K
135
127148
OpenAI · Proprietary
1373±4
34,794$0.20 / $1.25400K
136
127149
DeepSeek · MIT
1373±5
18,524$0.70 / $2.50163.8K
137
128151
Moonshot · Modified MIT
1371±5
27,639$0.60 / $2.50131.1K
138
128151
Mistral · Proprietary
1369±5
33,224$0.40 / $2131.1K
139
129152
1368±4
32,904$0.10 / $0.401M
140
128153
Alibaba · Apache 2.0
1368±6
13,696$0.10 / $0.78262.1K
141
129154
xAI · Proprietary
1367±5
16,966$0.25 / $1.27N/A
142
132153
Alibaba · Proprietary
1367±4
32,620N/AN/A
143
132154
OpenAI · Proprietary
1366±4
27,807$15 / $60200K
144
132155
Alibaba · Apache 2.0
1366±5
26,267$0.46 / $1.82131.1K
145
132155
OpenAI · Apache 2.0
1365±4
30,644$0.04 / $0.18131.1K
146
135155
Anthropic
Anthropic · Proprietary
1364±4
44,206$15 / $75200K
147
128161
1364±11
2,839N/AN/A
148
134159
Amazon · Proprietary
1363±6
12,232$0.30 / $2.501M
149
132160
Ant Group · MIT
1363±7
7,010N/AN/A
150
136158
xAI · Proprietary
1363±5
22,713$0.30 / $0.50131.1K
151
140160
MiniMax · Modified MIT
1359±4
41,149$0.15 / $0.90204.8K
152
142161
Google · Gemma
1358±4
47,540$0.08 / $0.16131.1K
153
136167
Inception AI · Proprietary
1356±10
3,121$0.25 / $0.75128K
154
138164
Prime Intellect · MIT
1356±8
5,334$0.20 / $1.10131.1K
155
144162
Alibaba · Apache 2.0
1356±5
25,731$0.40 / $1.60262.1K
156
147162
Google · Proprietary
1354±4
43,755$0.10 / $0.401M
157
147164
Z.ai · MIT
1353±6
11,735$0.06 / $0.40202.8K
158
147164
OpenAI · Proprietary
1353±5
31,122$15 / $60N/A
159
149164
OpenAI · Proprietary
1353±4
45,448$1.10 / $4.40200K
160
148171
Stepfun
StepFun · Apache 2.0
1349±7
6,545$0.57 / $1.4265.5K
161
151169
Nvidia · NVIDIA Open Model
1349±5
15,514$0.06 / $0.24262.1K
162
153169
Anthropic
1348±4
35,109$3 / $151M
163
159177
MiniMax · Apache 2.0
1342±4
35,195$0.40 / $2.201M
164
155180
MiniMax · Apache 2.0
1342±8
6,870$0.26 / $1204.8K
165
159178
Arcee AI · Apache 2.0
1342±5
29,174$0.22 / $0.85262.1K
166
160178
OpenAI · Proprietary
1340±4
39,329$0.40 / $1.601M
167
155184
Alibaba · Apache 2.0
1340±9
3,926$0.08 / $0.28131.1K
168
160180
Mistral · Apache 2.0
1339±5
17,709$0.10 / $0.3032K
169
162180
Arcee AI · Apache 2.0
1339±4
30,022$0.15 / $0.45131K
170
162182
Anthropic
Anthropic · Proprietary
1338±4
40,313$3 / $151M
171
159185
1338±10
3,345$0.10 / $0.40131.1K
172
163183
OpenAI · Proprietary
1337±5
18,589$1.10 / $4.40200K
173
163185
Stepfun
StepFun · Proprietary
1335±7
9,040N/AN/A
174
162186
Google · Gemma
1334±9
3,829$0.05 / $0.15131.1K
175
163186
Z.ai · MIT
1334±8
4,958$0.60 / $1.8065.5K
176
164185
DeepSeek · DeepSeek
1333±5
21,770$1.14 / $4.56N/A
177
163186
Ant Group · MIT
1332±7
7,144N/AN/A
178
166185
Cohere
Cohere · CC-BY-NC-4.0
1331±3
56,264$2.50 / $10256K
179
163189
Z.ai · Proprietary
1331±8
5,760N/AN/A
180
169186
1330±4
24,955$0.07 / $0.301M
181
169186
Alibaba · Apache 2.0
1329±4
25,392$0.50 / $116.4K
182
166193
Alibaba · Proprietary
1326±8
5,819$0.40 / $1.20131.1K
183
172195
Stepfun
StepFun · Proprietary
1321±9
4,833N/AN/A
184
176195
OpenAI · Proprietary
1320±7
8,268$0.05 / $0.40400K
185
170197
Tencent
Tencent · Proprietary
1320±12
2,220N/AN/A
186
171197
Nvidia · Nvidia Open Model
1319±12
2,549$0.60 / $1.80131.1K
187
181194
Google · Proprietary
1319±3
55,606$3.50 / $10.502.1M
188
181194
OpenAI · Proprietary
1319±4
57,342$1.10 / $4.40200K
189
182195
OpenAI · Proprietary
1317±4
51,981$1.10 / $4.40N/A
190
182195
Alibaba · Apache 2.0
1317±5
26,485$0.12 / $0.50131.1K
191
182196
Anthropic
1314±4
38,824$3 / $15200K
192
183199
Ai2 · Apache 2.0
1312±6
12,221$0.20 / $0.6065.5K
193
181205
Tencent
Tencent · Proprietary
1311±11
2,290N/AN/A
194
182207
1308±12
2,218N/AN/A
195
189204
Google · Gemma
1306±5
22,592$0.06 / $0.1232.8K
196
190204
xAI · Proprietary
1305±4
63,498$2 / $10131.1K
197
192207
01.AI
01 AI · Proprietary
1302±5
27,332N/AN/A
198
193207
OpenAI · Proprietary
1301±3
112,881$5 / $15128K
199
193211
Alibaba · Proprietary
1299±6
10,187N/AN/A
200
193209
Anthropic
Anthropic · Proprietary
1299±4
43,183$3 / $15200K
201
192215
Ai2 · Apache 2.0
1299±8
5,945$0.15 / $0.5065.5K
202
195210
Anthropic
Anthropic · Proprietary
1298±3
88,340$3 / $15200K
203
185231
Ai2 · Apache 2.0
1294±21
803$0.20 / $0.2036.9K
204
193219
DeepSeek · DeepSeek
1294±8
6,795N/AN/A
205
193225
IBM · Apache 2.0
1292±10
4,032$0.05 / $0.10131.1K
206
199218
NexusFlow · NexusFlow
1292±5
24,739N/AN/A
207
195226
Google · Gemma
1291±9
4,171$0.05 / $0.10131.1K
208
200221
Z.ai · Proprietary
1290±5
26,126$0.44 / $1.76204.8K
209
196227
Tencent
Tencent · Proprietary
1288±10
3,738N/AN/A
210
201226
OpenAI · Apache 2.0
1288±6
10,632$0.03 / $0.14131.1K
211
202225
1288±4
39,983$0.63 / $1.80131.1K
212
202225
Google · Proprietary
1287±4
34,902$0.07 / $0.301M
213
203225
OpenAI · Proprietary
1287±3
68,709$0.15 / $0.60128K
214
202229
OpenAI · Proprietary
1285±8
6,103$0.10 / $0.401M
215
203226
Meta
Meta · Llama 3.1 Community
1284±4
41,375$4 / $432.8K
216
202231
1283±8
7,140$1.20 / $1.20131.1K
217
204227
OpenAI · Proprietary
1283±4
45,499$2.50 / $10128K
218
199236
Inception AI · Proprietary
1282±14
1,954$0.25 / $0.75128K
219
203229
Alibaba · Qwen
1282±6
16,478$1.60 / $6.4032.8K
220
205227
Meta
Meta · Llama 3.1 Community
1282±4
59,656$4 / $432.8K
221
206227
Anthropic
Anthropic · Proprietary
1281±3
82,419$3 / $15200K
222
205229
1281±5
30,296$0.40 / $0.708.2K
223
206229
xAI · Proprietary
1281±4
52,567$2 / $10131.1K
224
206231
Google · Proprietary
1279±5
50,148N/AN/A
225
210231
1278±5
33,217$0.10 / $0.3032K
226
213233
Meta
Meta · Llama-3.3
1275±3
54,738$0.10 / $0.32131.1K
227
206241
Tencent
Tencent · Proprietary
1274±10
3,904N/AN/A
228
217236
Google · Proprietary
1274±4
79,138$3.50 / $10.502.1M
229
221236
OpenAI · Proprietary
1272±4
98,114$10 / $30128K
230
221237
DeepSeek · DeepSeek
1271±5
24,572N/AN/A
231
217242
Ai2 · Apache 2.0
1270±7
8,505$0.15 / $0.5065.5K
232
225240
Alibaba · Qwen
1269±4
39,406$1.20 / $1.20N/A
233
226242
Mistral · Mistral Research
1266±4
45,459$2 / $6131.1K
234
226242
Mistral · MRL
1265±4
28,073$2 / $6128K
235
226243
NexusFlow · CC-BY-NC-4.0
1265±6
19,621N/AN/A
236
229243
OpenAI · Proprietary
1264±4
100,105$10 / $30128K
237
225245
Tencent
Tencent · Proprietary
1263±9
5,372N/AN/A
238
230244
OpenAI · Proprietary
1262±4
93,439$10 / $30128K
239
230243
Anthropic
Anthropic · Proprietary
1262±3
194,909$15 / $75200K
240
231244
Meta
Meta · Llama 3.1 Community
1261±4
55,240$0.40 / $0.40131.1K
241
232245
Amazon · Proprietary
1259±5
24,745$0.80 / $3.20300K
242
230246
Ai2 · Llama 3.1
1256±10
2,846N/AN/A
243
238245
Anthropic
Anthropic · Proprietary
1255±3
69,981$0.80 / $4200K
244
235246
Mistral · Proprietary
1254±6
11,637$2 / $540K
245
240248
Reka AI · Proprietary
1248±7
7,312N/AN/A
246
243256
IBM · Apache 2.0
1241±8
5,684N/AN/A
247
245252
Google · Proprietary
1239±4
62,833$0.07 / $0.301M
248
245258
AI21 Labs · Jamba Open
1237±7
8,662$2 / $8256K
249
246260
Mistral · Apache 2.0
1234±6
14,681$0.05 / $0.0832.8K
250
247260
Google · Gemma license
1231±3
75,754$0.65 / $0.658.2K
251
246262
Alibaba · Apache 2.0
1230±8
5,432$0.87 / $0.8732K
252
246262
Cohere
Cohere · CC-BY-NC-4.0
1229±7
9,866$2.50 / $10128K
253
247262
Amazon · Proprietary
1229±5
19,372$0.06 / $0.24300K
254
246264
1228±10
3,749N/AN/A
255
247264
Princeton · MIT
1227±7
10,072$0.03 / $0.098.2K
256
247264
Z.ai · Proprietary
1226±7
9,788N/AN/A
257
248263
Google · Proprietary
1226±4
35,558$0.07 / $0.301M
258
248264
Nvidia · NVIDIA Open Model
1225±5
19,659N/AN/A
259
249264
Cohere
Cohere · CC-BY-NC-4.0
1224±5
27,124N/AN/A
260
251264
Meta
Meta · Llama 3 Community
1221±4
156,876$0.51 / $0.748.2K
261
253264
Anthropic
Anthropic · Proprietary
1218±4
109,284$3 / $15200K
262
251267
Reka AI · Proprietary
1218±7
7,536N/AN/A
263
249270
Ai2 · Apache-2.0
1218±11
3,334$0.05 / $0.20128K
264
255266
Microsoft · MIT
1217±5
24,126$0.07 / $0.1416.4K
265
262271
Amazon · Proprietary
1208±5
19,364$0.04 / $0.14128K
266
263271
Google · Gemma license
1207±4
54,611$0.03 / $0.098.2K
267
264271
OpenAI · Proprietary
1206±5
54,173$30 / $608.2K
268
264272
Cohere
Cohere · CC-BY-NC-4.0
1204±4
77,554$2.50 / $10128K
269
264273
Alibaba · Qianwen LICENSE
1203±5
37,325$0.90 / $0.9032.8K
270
262278
Tencent
Tencent · Proprietary
1202±12
2,728N/AN/A
271
269277
Anthropic
Anthropic · Proprietary
1195±4
117,701$0.25 / $1.25200K
272
265279
Ai2 · Llama 3.1
1193±11
2,896N/AN/A
273
270279
DeepSeek · DeepSeek License
1191±6
15,147$0.14 / $0.28128K
274
268279
Mistral · MRL
1191±9
4,781$0.10 / $0.10131.1K
275
270280
Cohere
Cohere · CC-BY-NC-4.0
1187±7
10,140$0.15 / $0.60128K
276
270281
AI21 Labs · Jamba Open
1187±7
8,858$0.20 / $0.40256K
277
271279
Meta
Meta · Llama 3.1 Community
1187±4
49,605$0.02 / $0.03131.1K
278
272279
OpenAI · Proprietary
1186±4
88,723$30 / $608.2K
279
270281
Cohere
Cohere · CC-BY-NC-4.0
1185±7
9,818N/AN/A
280
277284
Mistral · Proprietary
1177±5
62,436$4 / $1232K
281
278288
Alibaba · Qianwen LICENSE
1175±6
26,195N/AN/A
282
280288
01.AI
01 AI · Apache-2.0
1173±5
24,146N/AN/A
283
280291
Reka AI · Proprietary
1171±7
15,450N/AN/A
284
281291
Alibaba · Qianwen LICENSE
1166±5
39,302N/AN/A
285
281291
Meta
Meta · Llama 3 Community
1166±4
104,642$0.14 / $0.148.2K
286
281293
Mistral · Proprietary
1165±5
34,550$2.70 / $8.1032K
287
281293
Reka AI · Proprietary
1165±6
24,806N/AN/A
288
283293
Cohere
Cohere · CC-BY-NC-4.0
1163±5
54,036$0.15 / $0.60128K
289
283293
Mistral · Apache 2.0
1162±5
51,416$0.90 / $0.9065.5K
290
280295
Alibaba · Apache 2.0
1162±11
3,231$0.50 / $116.4K
291
283295
InternLM · Other
1159±7
9,901$0 / $032.8K
292
286295
Google · Gemma license
1156±4
46,616N/AN/A
293
286299
IBM · Apache 2.0
1150±11
3,090N/AN/A
294
290298
Google · Proprietary
1149±7
18,354$0.35 / $1.0532.8K
295
290303
HuggingFace · Apache 2.0
1144±11
4,652N/AN/A
296
293303
Microsoft · MIT
1138±5
25,055$0.17 / $0.68N/A
297
293303
Alibaba · Qianwen LICENSE
1137±6
21,741N/AN/A
298
294307
Nexusflow · Apache-2.0
1132±7
16,056N/AN/A
299
295306
Mistral · Apache 2.0
1132±4
73,503$0.63 / $0.6332K
300
293309
Google · Proprietary
1131±11
6,390$0.35 / $1.0532.8K
301
295308
01.AI
01 AI · Yi License
1129±7
15,483$0.90 / $0.904.1K
302
295309
Alibaba · Qianwen LICENSE
1128±7
17,839$0.30 / $0.30N/A
303
295311
IBM · Apache 2.0
1128±11
3,188N/AN/A
304
298309
OpenAI · Proprietary
1125±5
66,207$0.50 / $1.5016.4K
305
298315
AllenAI/UW · AI2 ImpACT Low-risk
1121±10
6,535N/AN/A
306
298316
Microsoft · Llama 2 Community
1120±9
8,214N/AN/A
307
299314
Databricks · DBRX LICENSE
1119±6
32,191$0.60 / $0.6032.8K
308
301316
Meta
Meta · Llama 2 Community
1115±5
38,492$0.70 / $2.804.1K
309
300321
NousResearch · Apache-2.0
1112±12
3,777$0.90 / $0.90N/A
310
305319
Microsoft · MIT
1110±6
17,766$0.15 / $0.60N/A
311
304321
Meta
Meta · Llama 3.2
1110±8
7,936$0.05 / $0.34131.1K
312
305322
UC Berkeley · CC-BY-NC-4.0
1108±8
10,224N/AN/A
313
305322
OpenChat · Apache-2.0
1107±8
12,637N/AN/A
314
306322
LMSYS · Non-commercial
1105±6
22,479$0 / $02K
315
304325
DeepSeek · DeepSeek License
1105±12
4,932N/AN/A
316
309325
Snowflake · Apache 2.0
1101±6
32,832N/AN/A
317
307330
Nvidia · Llama 2 Community
1098±12
3,585N/AN/A
318
309328
OpenChat · Apache-2.0
1097±10
7,968$0.20 / $0.20N/A
319
309328
IBM · Apache 2.0
1097±9
6,638N/AN/A
320
310330
OpenAI · Proprietary
1094±9
16,619$1 / $216.4K
321
312328
Google · Gemma license
1094±6
23,893$0.03 / $0.098.2K
322
310330
NousResearch · Apache-2.0
1093±10
5,006$0.17 / $0.17N/A
323
315330
Mistral · Apache-2.0
1090±7
19,402$0.20 / $0.2032.8K
324
317333
Meta
Meta · Llama 2 Community
1085±7
19,174$0.25 / $0.254.1K
325
317336
Alibaba · Qianwen LICENSE
1084±10
4,737$0.20 / $0.20N/A
326
315337
Upstage AI · CC-BY-NC-4.0
1083±13
4,155$0.30 / $0.30N/A
327
315338
Cognitive Computations · Apache-2.0
1081±15
1,679$0.50 / $0.5016.4K
328
320336
1080±6
12,297$0.13 / $0.524.1K
329
317336
IBM · Apache 2.0
1080±8
6,837N/AN/A
330
320337
Microsoft · Llama 2 Community
1077±9
7,044$0.30 / $0.30N/A
331
324337
Microsoft · MIT
1073±6
20,118$0.13 / $0.52N/A
332
324342
HuggingFace · MIT
1070±9
11,118$0.15 / $0.1516.4K
333
324344
MosaicML · CC-BY-NC-SA-4.0
1069±12
2,572N/AN/A
334
325344
Meta
Meta · Llama 2 Community
1066±9
7,366$0.35 / $1.4016.4K
335
325347
HuggingFace · MIT
1058±16
1,785N/AN/A
336
332345
LMSYS · Llama 2 Community
1058±7
19,367$0.30 / $0.30N/A
337
325348
Meta
Meta · Llama 2 Community
1057±18
1,143$0.70 / $2.8016.4K
338
331346
Google · Gemma license
1056±10
8,925$0.05 / $0.088.2K
339
332346
Meta
Meta · Llama 3.2
1055±8
8,045$0.03 / $0.20131.1K
340
328348
TII · Falcon-180B TII License
1054±17
1,295N/AN/A
341
333346
Meta
Meta · Llama 2 Community
1053±7
14,148$0.15 / $0.154.1K
342
332347
UW · Non-commercial
1053±12
2,921N/AN/A
343
332347
Alibaba · Qianwen LICENSE
1051±11
4,964N/AN/A
344
333346
Microsoft · MIT
1050±7
20,685$0.13 / $0.52N/A
345
335351
HuggingFace · Apache 2.0
1042±14
2,199N/AN/A
346
336351
Together AI · Apache 2.0
1039±11
5,182$0.20 / $0.20N/A
347
340351
Ai2 · Apache-2.0
1032±11
6,328$0.20 / $0.20N/A
348
343351
LMSYS · Llama 2 Community
1031±9
6,923$0.20 / $0.20N/A
349
345351
Google · Proprietary
1027±9
8,554$0.50 / $0.5025.8K
350
345351
Mistral · Apache 2.0
1024±9
8,977$0.07 / $0.284.1K
351
345351
Google · Gemma license
1022±8
10,854N/AN/A
352
352354
Google · Gemma license
1002±12
4,780$0.10 / $0.10N/A
353
352354
Alibaba · Qianwen LICENSE
997±9
7,597$0.10 / $0.10N/A
354
352355
UC Berkeley · Non-commercial
990±10
6,965N/AN/A
355
354357
Tsinghua · Apache-2.0
972±12
4,658N/AN/A
356
355360
Nomic AI · Non-commercial
956±15
1,743N/AN/A
357
355360
MosaicML · CC-BY-NC-SA-4.0
955±12
3,924N/AN/A
358
356360
RWKV · Apache 2.0
948±11
4,845N/AN/A
359
356362
Tsinghua · Apache-2.0
939±14
2,658N/AN/A
360
356362
Stanford · Non-commercial
933±11
5,745N/AN/A
361
359363
Tsinghua · Non-commercial
919±12
4,914N/AN/A
362
359363
OpenAssistant · Apache 2.0
916±11
6,310N/AN/A
363
361363
LMSYS · Apache 2.0
894±12
4,203N/AN/A
364
364365
Stability
Stability AI · CC-BY-NC-SA-4.0
867±13
3,287N/AN/A
365
364366
Databricks · MIT
851±13
3,412N/AN/A
366
365366
Meta
Meta · Non-commercial
834±16
2,391$0.23 / $0.23N/A

Remove Style Control Leaderboard Plots

Confidence Intervals on Model Strength (via Bootstrapping)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Battle Count for Each Combination of Models (without Ties)