Text Arena💻Coding

View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.

Jun 5, 2026
1,302,812 votes
360 models
Rank Spread
1
12
Anthropic
Anthropic · Proprietary
1536±7
9,727$5 / $251M
2
12
Anthropic
Anthropic · Proprietary
1535±7
11,224$5 / $251M
3
38
Anthropic
Anthropic · Proprietary
1518±8
7,038$5 / $251M
4
39
Anthropic
Anthropic · Proprietary
1517±8
7,375$5 / $251M
5
326
Anthropic
Anthropic · Proprietary
1504±15
1,565$5 / $251M
6
420
Anthropic
1503±7
7,621$5 / $25200K
7
325
Z.ai · MIT
1500±10
4,196$1.40 / $4.40202.8K
8
329
Anthropic
Anthropic · Proprietary
1500±15
1,686$5 / $251M
9
522
Anthropic
Anthropic · Proprietary
1499±7
8,620$3 / $151M
10
522
Anthropic
Anthropic · Proprietary
1498±6
17,039$5 / $25200K
11
526
OpenAI · Proprietary
1497±9
5,734$5 / $301.1M
12
335
Alibaba · Proprietary
1497±18
1,136$1.25 / $3.751M
13
528
Xiaomi · MIT
1495±9
5,426$0.43 / $0.871M
14
526
OpenAI · Proprietary
1495±7
8,625$2.50 / $151.1M
15
534
Google · Proprietary
1491±12
2,862$1.50 / $91M
16
531
Baidu · Proprietary
1490±9
5,094N/AN/A
17
529
Google · Proprietary
1490±6
13,244$2 / $121M
18
629
Anthropic
1489±5
19,003$3 / $15200K
19
545
MiniMax · Proprietary
1486±17
1,320$0.60 / $2.40N/A
20
637
Alibaba · Proprietary
1485±8
5,827N/AN/A
21
832
Anthropic
Anthropic · Proprietary
1485±5
18,877$3 / $15200K
22
551
1485±20
841N/AN/A
23
542
Moonshot · Modified MIT
1484±14
1,800$0.40 / $1.90262.1K
24
937
Google · Proprietary
1483±7
8,573$2 / $121M
25
840
Moonshot · Modified MIT
1483±9
5,269$0.95 / $4262.1K
26
842
Meta
Meta · Proprietary
1480±10
3,566N/AN/A
27
1240
Anthropic
1480±6
9,843$15 / $75200K
28
1240
OpenAI · Proprietary
1479±7
9,532$2.50 / $151.1M
29
1344
Xiaomi · Proprietary
1477±8
6,640$1 / $31M
30
1644
Moonshot · Modified MIT
1476±6
10,784$0.60 / $3N/A
31
1650
OpenAI · Proprietary
1475±8
6,021$5 / $301.1M
32
2046
Anthropic
Anthropic · Proprietary
1473±5
15,530$15 / $75200K
33
1950
Bytedance
Bytedance · Proprietary
1473±6
11,535N/AN/A
34
1851
Meituan · Proprietary
1472±7
7,584N/AN/A
35
2053
DeepSeek · MIT
1470±8
6,330$0.43 / $0.871M
36
2253
Alibaba · Apache 2.0
1468±7
10,036$0.39 / $2.34262.1K
37
1862
Meituan · MIT
1467±13
2,233$0.20 / $0.80131.1K
38
2258
Xiaomi · MIT
1467±8
5,874$0.14 / $0.281M
39
1774
Alibaba · Proprietary
1465±16
1,460$1.04 / $6.24262.1K
40
2559
Alibaba · Proprietary
1464±8
6,769$0.33 / $1.951M
41
2562
Google · Proprietary
1463±8
6,384$0.50 / $31M
42
2761
1463±7
9,074$2 / $62M
43
2281
Xiaomi · Proprietary
1460±14
1,815$0.40 / $2262.1K
44
3166
1460±7
9,236$2 / $62M
45
3169
Z.ai · MIT
1459±8
5,667$1 / $3.20202.8K
46
3172
1458±9
5,801$0.43 / $0.871M
47
3376
Alibaba · Proprietary
1457±8
5,366$0.78 / $3.90262.1K
48
3575
Baidu · Proprietary
1456±7
8,416N/AN/A
49
3084
Z.ai · MIT
1456±12
2,411$0.40 / $1.75202.8K
50
2996
Google · Apache 2.0
1454±15
1,363$0.14 / $0.40262.1K
51
3780
DeepSeek · MIT
1453±7
8,375$0.23 / $0.34131.1K
52
3778
Moonshot · Modified MIT
1453±6
14,542$1.15 / $8262.1K
53
3979
Google · Proprietary
1452±4
26,215$1.25 / $101M
54
3782
OpenAI · Proprietary
1452±7
8,210$1.25 / $10400K
55
3785
DeepSeek · MIT
1452±8
6,163$0.10 / $0.201M
56
3980
Anthropic
Anthropic · Proprietary
1451±5
19,804$1 / $5200K
57
3599
Mistral · Modified MIT
1450±14
1,810$1.50 / $7.50262.1K
58
3887
Z.ai · MIT
1450±7
7,481$0.43 / $1.74202.8K
59
27107
1449±21
788N/AN/A
60
4092
MiniMax · Modified MIT
1448±7
7,987$0.27 / $1.08204.8K
61
4291
DeepSeek · MIT
1448±6
10,431$0.23 / $0.34131.1K
62
4192
OpenAI · Proprietary
1448±7
8,804$1.75 / $14128K
63
31111
1446±21
735N/AN/A
64
4299
xAI · Proprietary
1445±8
6,820N/AN/A
65
4493
Alibaba · Apache 2.0
1445±5
21,022$0.26 / $1.06N/A
66
4497
xAI · Proprietary
1444±6
14,783N/AN/A
67
37105
Google · Apache 2.0
1444±15
1,367N/AN/A
68
4699
Mistral · Apache 2.0
1443±6
9,770$0.50 / $1.50N/A
69
4899
xAI · Proprietary
1443±6
15,318N/AN/A
70
4799
OpenAI · Proprietary
1443±6
11,437$1.75 / $14400K
71
4499
Anthropic
Anthropic · Proprietary
1442±8
6,676$15 / $75200K
72
45100
Alibaba · Apache 2.0
1441±9
4,791$0.09 / $1.10262.1K
73
4899
OpenAI · Proprietary
1441±7
8,409$0.75 / $4.50400K
74
42107
Alibaba · Apache 2.0
1440±13
2,314$0.20 / $0.88262.1K
75
5299
1440±6
11,686$0.10 / $0.30262.1K
76
43108
Alibaba · Proprietary
1440±13
2,041$0.78 / $3.90262.1K
77
54100
1438±6
14,459$0.50 / $31M
78
43111
1438±13
1,920$0.27 / $0.41163.8K
79
56105
OpenAI · Proprietary
1436±7
9,126$1.25 / $10400K
80
54107
1436±8
6,073$0.10 / $0.201M
81
57105
Stepfun
StepFun · Apache 2.0
1435±7
9,448$0.09 / $0.30262.1K
82
61104
Mistral · Proprietary
1435±5
20,831$2.70 / $8.1032K
83
56108
OpenAI · Proprietary
1435±8
6,358$1.25 / $10400K
84
57107
Alibaba · Apache 2.0
1435±7
7,490$0.26 / $2.08262.1K
85
53111
DeepSeek · MIT
1434±12
2,501$0.27 / $0.41163.8K
86
58110
OpenAI · Proprietary
1434±8
7,571$5 / $301.1M
87
57111
Z.ai · MIT
1433±9
4,773$0.60 / $2.20131.1K
88
62108
OpenAI · Proprietary
1433±6
12,897$1.75 / $14400K
89
60111
xAI · Proprietary
1432±8
5,402$3 / $15131.1K
90
61111
1432±8
5,318N/AN/A
91
50123
xAI · Proprietary
1430±16
1,249$3 / $15256K
92
55119
Tencent
Tencent · tencent-hunyuan-community
1430±14
1,834$0.29 / $1.17262.1K
93
43133
1429±23
636$0.27 / $0.95163.8K
94
44130
1429±21
704N/AN/A
95
57123
Alibaba · Apache 2.0
1428±14
1,626$0.26 / $2.60131.1K
96
63119
DeepSeek · MIT
1427±11
2,729$0.50 / $2.15163.8K
97
61123
Baidu · Proprietary
1426±13
1,955N/AN/A
98
73118
Alibaba · Apache 2.0
1424±7
7,228$0.20 / $1.56262.1K
99
63129
Alibaba · Apache 2.0
1424±15
1,612$0.10 / $0.10262.1K
100
77114
Google · Proprietary
1423±4
25,609$0.30 / $2.501M
101
73120
xAI · Proprietary
1423±9
5,799$1.25 / $2.501M
102
71123
MiniMax · MIT
1423±10
3,426$0.29 / $0.95204.8K
103
49141
Tencent
Tencent · Proprietary
1421±27
437N/AN/A
104
77126
xAI · Proprietary
1419±9
3,956$0.20 / $0.502M
105
73134
DeepSeek · MIT
1418±13
1,905$1.23 / $4.94N/A
106
74133
1418±12
2,444$0.10 / $0.30262.1K
107
84129
Alibaba · Apache 2.0
1418±9
4,663$0.05 / $0.19131.1K
108
91126
OpenAI · Proprietary
1415±5
15,868$5 / $15128K
109
84135
DeepSeek · MIT
1414±11
2,625$1.23 / $4.94N/A
110
73139
Baidu · Proprietary
1413±19
916N/AN/A
111
91134
Anthropic
1413±8
6,414$3 / $151M
112
85136
1413±12
2,292N/AN/A
113
91135
Alibaba · Apache 2.0
1412±9
4,852$0.40 / $1.60262.1K
114
92133
xAI · Proprietary
1412±6
13,198$0.20 / $0.502M
115
92135
Alibaba · Apache 2.0
1411±7
7,669$0.14 / $1262.1K
116
93136
xAI · Proprietary
1409±7
8,157$3 / $15256K
117
96136
OpenAI · Proprietary
1408±6
11,749$2 / $8200K
118
96136
Alibaba · Proprietary
1407±7
9,520N/AN/A
119
95137
OpenAI · Proprietary
1407±8
5,500$0.25 / $2400K
120
96137
OpenAI · Proprietary
1407±7
8,449$1.75 / $14128K
121
81149
DeepSeek · MIT
1407±21
778$0.27 / $0.95163.8K
122
92141
Nvidia · NVIDIA Open Model
1405±14
1,766N/AN/A
123
102139
Anthropic
Anthropic · Proprietary
1402±7
7,900$15 / $75200K
124
102139
OpenAI · Proprietary
1402±7
8,396$0.20 / $1.25400K
125
102139
1402±7
6,842$0.30 / $2.501M
126
104141
OpenAI · Proprietary
1400±8
5,989$1.25 / $10128K
127
105141
Google · Proprietary
1400±7
10,841$0.25 / $1.501M
128
100146
Moonshot · Modified MIT
1400±12
2,243$0.60 / $2.50262.1K
129
105141
Alibaba · Apache 2.0
1399±7
6,977$0.46 / $1.82131.1K
130
100152
OpenAI · Proprietary
1397±13
1,939$75 / $150128K
131
110145
Z.ai · MIT
1396±8
6,106$0.13 / $0.85131.1K
132
92174
Z.ai · MIT
1392±25
534$0.30 / $0.90131.1K
133
100168
Inception AI · Proprietary
1392±20
768$0.25 / $0.75128K
134
113157
Alibaba · Apache 2.0
1391±11
2,676$0.10 / $0.78262.1K
135
108161
Ant Group · MIT
1391±15
1,528N/AN/A
136
119152
OpenAI · Proprietary
1390±7
9,316$2 / $81M
137
117161
Amazon · Proprietary
1388±12
2,516$0.30 / $2.501M
138
105174
Tencent
Tencent · Proprietary
1387±20
805N/AN/A
139
123158
Mistral · Proprietary
1386±8
5,900$0.40 / $2131.1K
140
119160
Alibaba · Apache 2.0
1386±9
4,341$0.46 / $1.82131.1K
141
123168
Z.ai · MIT
1383±11
2,690$0.06 / $0.40202.8K
142
129168
OpenAI · Apache 2.0
1380±8
6,494$0.04 / $0.18131.1K
143
130168
Anthropic
Anthropic · Proprietary
1380±7
7,397$3 / $151M
144
128170
Nvidia · NVIDIA Open Model
1379±10
3,277$0.06 / $0.24262.1K
145
128174
OpenAI · Proprietary
1379±12
2,596$1.10 / $4.40200K
146
131168
Arcee AI · Apache 2.0
1378±8
7,347$0.15 / $0.45131K
147
131168
MiniMax · Modified MIT
1378±7
10,571$0.15 / $0.90204.8K
148
130169
Moonshot · Modified MIT
1378±8
5,243$0.60 / $2.50131.1K
149
131176
xAI · Proprietary
1376±10
3,296$0.25 / $1.27N/A
150
128177
MiniMax · Apache 2.0
1374±15
1,544$0.26 / $1204.8K
151
133176
1374±8
6,002$0.10 / $0.401M
152
133177
DeepSeek · MIT
1372±12
2,317$0.70 / $2.50163.8K
153
135176
1371±6
9,679$0.10 / $0.401M
154
138176
DeepSeek · MIT
1369±7
8,367$3 / $4.5032.8K
155
138176
OpenAI · Proprietary
1369±7
8,720$1.10 / $4.40200K
156
130180
Prime Intellect · MIT
1368±19
972$0.20 / $1.10131.1K
157
135178
OpenAI · Proprietary
1367±10
3,973$15 / $60200K
158
138177
OpenAI · Proprietary
1367±7
6,919$0.40 / $1.601M
159
136178
OpenAI · Proprietary
1367±9
5,123$15 / $60N/A
160
128187
1366±24
552N/AN/A
161
138178
xAI · Proprietary
1366±9
4,255$0.30 / $0.50131.1K
162
133180
Ant Group · MIT
1366±15
1,540N/AN/A
163
133181
Stepfun
StepFun · Apache 2.0
1366±16
1,232$0.57 / $1.4265.5K
164
138180
Mistral · Apache 2.0
1363±10
3,359$0.10 / $0.3032K
165
144178
OpenAI · Proprietary
1363±7
8,478$1.10 / $4.40N/A
166
146178
OpenAI · Proprietary
1362±6
9,461$1.10 / $4.40200K
167
145180
Anthropic
1361±8
6,191$3 / $15200K
168
138182
Tencent
Tencent · Proprietary
1361±14
1,776N/AN/A
169
146180
Alibaba · Proprietary
1360±8
5,102N/AN/A
170
146180
Arcee AI · Apache 2.0
1360±8
7,789$0.22 / $0.85262.1K
171
149180
MiniMax · Apache 2.0
1359±8
6,486$0.40 / $2.201M
172
133191
Alibaba · Apache 2.0
1358±24
513$0.08 / $0.28131.1K
173
134191
1357±22
659$0.10 / $0.40131.1K
174
149188
OpenAI · Proprietary
1352±15
1,684$0.05 / $0.40400K
175
157185
Google · Proprietary
1352±7
6,995$0.10 / $0.401M
176
146195
Z.ai · MIT
1350±18
991$0.60 / $1.8065.5K
177
154188
Ai2 · Apache 2.0
1349±12
2,512$0.20 / $0.6065.5K
178
138206
Tencent
Tencent · Proprietary
1343±31
275N/AN/A
179
170188
Anthropic
Anthropic · Proprietary
1343±5
14,964$3 / $15200K
180
162200
Stepfun
StepFun · Proprietary
1340±15
1,505N/AN/A
181
171195
Anthropic
Anthropic · Proprietary
1339±7
7,145$3 / $15200K
182
171196
Alibaba · Apache 2.0
1338±9
4,528$0.12 / $0.50131.1K
183
172199
Alibaba · Apache 2.0
1335±9
4,045$0.50 / $116.4K
184
176201
Cohere
Cohere · CC-BY-NC-4.0
1331±6
10,221$2.50 / $10256K
185
171208
Alibaba · Proprietary
1329±18
893$0.40 / $1.20131.1K
186
162217
Inception AI · Proprietary
1326±29
394$0.25 / $0.75128K
187
176205
DeepSeek · DeepSeek
1326±10
3,280$1.14 / $4.56N/A
188
180205
Google · Gemma
1323±7
8,076$0.08 / $0.16131.1K
189
178206
1323±10
3,474$0.07 / $0.301M
190
173212
Ai2 · Apache 2.0
1321±18
1,054$0.15 / $0.5065.5K
191
169225
Tencent
Tencent · Proprietary
1320±30
299N/AN/A
192
178208
Mistral · Proprietary
1319±12
2,248$2 / $540K
193
176217
Stepfun
StepFun · Proprietary
1317±20
737N/AN/A
194
181213
Alibaba · Proprietary
1315±14
1,553N/AN/A
195
178222
IBM · Apache 2.0
1314±20
1,025$0.05 / $0.10131.1K
196
184212
01.AI
01 AI · Proprietary
1313±10
4,316N/AN/A
197
184211
NexusFlow · NexusFlow
1313±9
4,019N/AN/A
198
172228
Nvidia · Nvidia Open Model
1312±30
367$0.60 / $1.80131.1K
199
184212
1310±8
6,137$0.10 / $0.3032K
200
181222
DeepSeek · DeepSeek
1310±17
1,079N/AN/A
201
184218
OpenAI · Apache 2.0
1309±13
2,167$0.03 / $0.14131.1K
202
178228
Tencent
Tencent · Proprietary
1307±24
519N/AN/A
203
186215
Anthropic
Anthropic · Proprietary
1307±7
13,607$3 / $15200K
204
183226
Tencent
Tencent · Proprietary
1306±19
963N/AN/A
205
182226
OpenAI · Proprietary
1306±19
807$0.10 / $0.401M
206
188218
1303±7
6,998$0.63 / $1.80131.1K
207
188223
DeepSeek · DeepSeek
1302±9
4,252N/AN/A
208
190224
OpenAI · Proprietary
1298±6
19,526$5 / $15128K
209
181239
1296±31
286N/AN/A
210
194225
Google · Proprietary
1295±7
9,175$3.50 / $10.502.1M
211
195226
Alibaba · Qwen
1293±8
6,688$1.20 / $1.20N/A
212
196228
Meta
Meta · Llama 3.1 Community
1292±7
6,249$4 / $432.8K
213
195228
Z.ai · Proprietary
1291±9
4,449$0.44 / $1.76204.8K
214
198228
OpenAI · Proprietary
1290±6
10,927$0.15 / $0.60128K
215
196231
Alibaba · Qwen
1289±11
2,756$1.60 / $6.4032.8K
216
191235
Ai2 · Apache 2.0
1288±15
1,566$0.15 / $0.5065.5K
217
200228
xAI · Proprietary
1288±7
10,368$2 / $10131.1K
218
190237
Z.ai · Proprietary
1287±18
894N/AN/A
219
202228
Anthropic
Anthropic · Proprietary
1287±6
11,249$0.80 / $4200K
220
200231
1286±9
5,256$0.40 / $0.708.2K
221
204231
Meta
Meta · Llama 3.1 Community
1284±7
9,714$4 / $432.8K
222
203232
OpenAI · Proprietary
1284±8
7,318$2.50 / $10128K
223
190243
Google · Gemma
1282±23
543$0.05 / $0.15131.1K
224
209237
Mistral · Mistral Research
1277±8
7,589$2 / $6131.1K
225
200244
Alibaba · Apache 2.0
1276±18
873$0.87 / $0.8732K
226
209238
Mistral · MRL
1276±9
4,212$2 / $6128K
227
206245
1272±15
1,312$1.20 / $1.20131.1K
228
216242
Amazon · Proprietary
1270±9
3,853$0.80 / $3.20300K
229
200248
Tencent
Tencent · Proprietary
1270±24
549N/AN/A
230
216243
Google · Gemma
1270±10
3,532$0.06 / $0.1232.8K
231
219241
xAI · Proprietary
1269±7
8,652$2 / $10131.1K
232
220241
Meta
Meta · Llama-3.3
1269±6
8,747$0.10 / $0.32131.1K
233
220242
OpenAI · Proprietary
1269±7
17,104$10 / $30128K
234
216245
NexusFlow · CC-BY-NC-4.0
1267±11
3,122N/AN/A
235
220243
Google · Proprietary
1267±7
12,747$3.50 / $10.502.1M
236
221243
Anthropic
Anthropic · Proprietary
1265±6
33,748$15 / $75200K
237
221247
Google · Proprietary
1262±8
5,892$0.07 / $0.301M
238
223247
Meta
Meta · Llama 3.1 Community
1260±7
9,389$0.40 / $0.40131.1K
239
225248
Google · Proprietary
1255±9
8,138N/AN/A
240
227248
OpenAI · Proprietary
1255±7
15,605$10 / $30128K
241
225254
DeepSeek · DeepSeek License
1252±12
2,671$0.14 / $0.28128K
242
234252
OpenAI · Proprietary
1250±8
15,289$10 / $30128K
243
224255
IBM · Apache 2.0
1249±17
1,268N/AN/A
244
233255
Mistral · Apache 2.0
1246±12
2,083$0.05 / $0.0832.8K
245
238256
Amazon · Proprietary
1239±10
3,060$0.06 / $0.24300K
246
241256
Google · Proprietary
1236±8
10,680$0.07 / $0.301M
247
229264
Ai2 · Llama 3.1
1236±24
450N/AN/A
248
242258
Microsoft · MIT
1232±10
3,305$0.07 / $0.1416.4K
249
236265
Tencent
Tencent · Proprietary
1231±24
497N/AN/A
250
236265
Google · Gemma
1231±23
605$0.05 / $0.10131.1K
251
241262
Reka AI · Proprietary
1231±15
1,216N/AN/A
252
242262
Z.ai · Proprietary
1228±14
1,718N/AN/A
253
241262
AI21 Labs · Jamba Open
1228±14
1,440$2 / $8256K
254
241271
1223±21
665N/AN/A
255
245262
Anthropic
Anthropic · Proprietary
1223±7
18,888$3 / $15200K
256
247265
Amazon · Proprietary
1218±10
2,981$0.04 / $0.14128K
257
247264
Google · Proprietary
1218±8
6,069$0.07 / $0.301M
258
248270
Google · Gemma license
1211±6
12,088$0.65 / $0.658.2K
259
243277
Ai2 · Apache-2.0
1211±27
427$0.05 / $0.20128K
260
248272
Nvidia · NVIDIA Open Model
1210±11
3,254N/AN/A
261
248272
OpenAI · Proprietary
1209±9
8,306$30 / $608.2K
262
252272
Meta
Meta · Llama 3 Community
1207±7
28,126$0.51 / $0.748.2K
263
248276
Mistral · MRL
1202±19
838$0.10 / $0.10131.1K
264
257275
Anthropic
Anthropic · Proprietary
1200±7
20,898$0.25 / $1.25200K
265
257276
Cohere
Cohere · CC-BY-NC-4.0
1197±9
4,685N/AN/A
266
257276
Alibaba · Qianwen LICENSE
1196±9
6,249$0.90 / $0.9032.8K
267
258276
Meta
Meta · Llama 3.1 Community
1195±7
8,582$0.02 / $0.03131.1K
268
257282
Princeton · MIT
1191±15
1,471$0.03 / $0.098.2K
269
257284
Reka AI · Proprietary
1191±15
1,207N/AN/A
270
262280
OpenAI · Proprietary
1188±8
13,719$30 / $608.2K
271
259286
Cohere
Cohere · CC-BY-NC-4.0
1188±14
1,675$2.50 / $10128K
272
252290
IBM · Apache 2.0
1187±26
478N/AN/A
273
262286
Alibaba · Qianwen LICENSE
1184±10
4,763N/AN/A
274
254291
Ai2 · Llama 3.1
1184±24
476N/AN/A
275
263284
Mistral · Proprietary
1183±8
10,418$4 / $1232K
276
262290
AI21 Labs · Jamba Open
1179±15
1,352$0.20 / $0.40256K
277
268288
Google · Gemma license
1174±7
8,921$0.03 / $0.098.2K
278
268290
Cohere
Cohere · CC-BY-NC-4.0
1172±8
13,937$2.50 / $10128K
279
267291
Cohere
Cohere · CC-BY-NC-4.0
1170±13
1,783$0.15 / $0.60128K
280
268291
01.AI
01 AI · Apache-2.0
1170±10
3,841N/AN/A
281
272291
Mistral · Apache 2.0
1166±9
8,780$0.90 / $0.9065.5K
282
270291
Alibaba · Qianwen LICENSE
1166±10
6,370N/AN/A
283
269292
Reka AI · Proprietary
1165±13
2,879N/AN/A
284
274292
Mistral · Proprietary
1163±10
5,149$2.70 / $8.1032K
285
270294
Cohere
Cohere · CC-BY-NC-4.0
1161±14
1,567N/AN/A
286
274294
InternLM · Other
1160±14
1,684$0 / $032.8K
287
269300
Alibaba · Apache 2.0
1156±24
566$0.50 / $116.4K
288
275295
Alibaba · Qianwen LICENSE
1155±11
3,930N/AN/A
289
275295
Reka AI · Proprietary
1154±11
4,748N/AN/A
290
278295
Meta
Meta · Llama 3 Community
1152±8
18,374$0.14 / $0.148.2K
291
272305
IBM · Apache 2.0
1150±24
508N/AN/A
292
283303
Nexusflow · Apache-2.0
1143±13
2,948N/AN/A
293
285305
Alibaba · Qianwen LICENSE
1138±13
3,208$0.30 / $0.30N/A
294
287303
OpenAI · Proprietary
1137±8
11,130$0.50 / $1.5016.4K
295
290306
Databricks · DBRX LICENSE
1132±11
5,502$0.60 / $0.6032.8K
296
290307
Microsoft · MIT
1131±10
3,973$0.17 / $0.68N/A
297
285312
HuggingFace · Apache 2.0
1130±20
831N/AN/A
298
290308
Cohere
Cohere · CC-BY-NC-4.0
1128±9
9,645$0.15 / $0.60128K
299
290308
Mistral · Apache 2.0
1127±8
11,784$0.63 / $0.6332K
300
290319
AllenAI/UW · AI2 ImpACT Low-risk
1117±21
805N/AN/A
301
291318
OpenAI · Proprietary
1116±16
2,121$1 / $216.4K
302
293318
OpenChat · Apache-2.0
1113±14
2,005N/AN/A
303
291319
IBM · Apache 2.0
1113±18
1,108N/AN/A
304
296314
Google · Gemma license
1113±8
7,298N/AN/A
305
295318
01.AI
01 AI · Yi License
1112±13
2,345$0.90 / $0.904.1K
306
291328
Google · Proprietary
1108±23
678$0.35 / $1.0532.8K
307
293327
Alibaba · Qianwen LICENSE
1108±20
772$0.20 / $0.20N/A
308
299325
Google · Proprietary
1105±14
2,681$0.35 / $1.0532.8K
309
299325
Microsoft · MIT
1102±12
3,219$0.15 / $0.60N/A
310
299328
Meta
Meta · Llama 3.2
1098±16
1,351$0.05 / $0.34131.1K
311
299328
UC Berkeley · CC-BY-NC-4.0
1098±16
1,397N/AN/A
312
297331
DeepSeek · DeepSeek License
1096±24
649N/AN/A
313
300329
1094±14
1,841$0.13 / $0.524.1K
314
301328
Microsoft · MIT
1093±12
3,449$0.13 / $0.52N/A
315
300331
IBM · Apache 2.0
1091±17
1,134N/AN/A
316
301329
Snowflake · Apache 2.0
1090±11
5,734N/AN/A
317
306331
Google · Gemma license
1084±10
4,332$0.03 / $0.098.2K
318
306331
Mistral · Apache-2.0
1082±12
3,114$0.20 / $0.2032.8K
319
301334
Microsoft · Llama 2 Community
1082±20
988N/AN/A
320
301335
NousResearch · Apache-2.0
1080±23
575$0.90 / $0.90N/A
321
308331
Meta
Meta · Llama 2 Community
1079±10
5,717$0.70 / $2.804.1K
322
306332
LMSYS · Non-commercial
1079±13
2,866$0 / $02K
323
306335
OpenChat · Apache-2.0
1075±19
971$0.20 / $0.20N/A
324
304339
Alibaba · Qianwen LICENSE
1072±24
599N/AN/A
325
309335
Meta
Meta · Llama 3.2
1071±16
1,346$0.03 / $0.20131.1K
326
306339
Upstage AI · CC-BY-NC-4.0
1067±27
482$0.30 / $0.30N/A
327
308339
NousResearch · Apache-2.0
1065±23
589$0.17 / $0.17N/A
328
315338
Meta
Meta · Llama 2 Community
1063±13
2,626$0.25 / $0.254.1K
329
306348
HuggingFace · MIT
1053±40
201N/AN/A
330
321342
Google · Gemma license
1049±16
1,381$0.05 / $0.088.2K
331
313347
HuggingFace · Apache 2.0
1048±32
352N/AN/A
332
320344
Meta
Meta · Llama 2 Community
1046±20
853$0.35 / $1.4016.4K
333
321344
HuggingFace · MIT
1045±18
1,250$0.15 / $0.1516.4K
334
325344
LMSYS · Llama 2 Community
1041±14
2,389$0.30 / $0.30N/A
335
325344
Microsoft · MIT
1040±13
3,886$0.13 / $0.52N/A
336
315348
MosaicML · CC-BY-NC-SA-4.0
1040±35
258N/AN/A
337
322348
Microsoft · Llama 2 Community
1035±22
735$0.30 / $0.30N/A
338
326345
Google · Gemma license
1035±14
1,963N/AN/A
339
325348
Nvidia · Llama 2 Community
1025±28
467N/AN/A
340
329348
Mistral · Apache 2.0
1019±20
1,032$0.07 / $0.284.1K
341
329348
Ai2 · Apache-2.0
1016±22
772$0.20 / $0.20N/A
342
329349
LMSYS · Llama 2 Community
1011±23
726$0.20 / $0.20N/A
343
330349
Google · Gemma license
1010±22
742$0.10 / $0.10N/A
344
335349
Meta
Meta · Llama 2 Community
1002±14
1,956$0.15 / $0.154.1K
345
334349
Together AI · Apache 2.0
1002±22
704$0.20 / $0.20N/A
346
335349
Alibaba · Qianwen LICENSE
999±17
1,283$0.10 / $0.10N/A
347
330350
UW · Non-commercial
996±35
263N/AN/A
348
336349
Google · Proprietary
995±21
917$0.50 / $0.5025.8K
349
342353
Tsinghua · Apache-2.0
965±26
535N/AN/A
350
348355
UC Berkeley · Non-commercial
945±24
747N/AN/A
351
349356
RWKV · Apache 2.0
928±27
505N/AN/A
352
349356
Tsinghua · Non-commercial
920±26
551N/AN/A
353
349356
MosaicML · CC-BY-NC-SA-4.0
914±31
397N/AN/A
354
350356
Tsinghua · Apache-2.0
902±35
293N/AN/A
355
350356
OpenAssistant · Apache 2.0
901±25
714N/AN/A
356
351356
Stability
Stability AI · CC-BY-NC-SA-4.0
887±32
363N/AN/A
357
357359
Stanford · Non-commercial
799±27
626N/AN/A
358
357359
Databricks · MIT
777±33
396N/AN/A
359
357359
LMSYS · Apache 2.0
764±30
428N/AN/A
360
360360
Meta
Meta · Non-commercial
683±38
304$0.23 / $0.23N/A

Remove Style Control Leaderboard Plots

Confidence Intervals on Model Strength (via Bootstrapping)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Battle Count for Each Combination of Models (without Ties)