Text Arena🧮Math

View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.

Jun 5, 2026
608,143 votes
354 models
Rank Spread
1
115
Google · Proprietary
1522±26
558$1.50 / $91M
2
114
Anthropic
Anthropic · Proprietary
1512±13
2,168$5 / $251M
3
114
Anthropic
Anthropic · Proprietary
1510±12
2,455$5 / $251M
4
119
OpenAI · Proprietary
1505±14
1,911$2.50 / $151.1M
5
144
Alibaba · Proprietary
1497±40
220$1.25 / $3.751M
6
123
Google · Proprietary
1496±11
2,911$2 / $121M
7
129
Anthropic
Anthropic · Proprietary
1490±16
1,423$5 / $251M
8
130
Anthropic
Anthropic · Proprietary
1488±16
1,395$5 / $251M
9
159
MiniMax · Proprietary
1487±39
236$0.60 / $2.40N/A
10
136
Xiaomi · MIT
1484±18
1,035$0.43 / $0.871M
11
138
Baidu · Proprietary
1482±18
1,033N/AN/A
12
341
Moonshot · Modified MIT
1479±18
1,062$0.95 / $4262.1K
13
159
Anthropic
Anthropic · Proprietary
1479±31
317$5 / $251M
14
536
Google · Proprietary
1476±11
2,655$2 / $121M
15
443
OpenAI · Proprietary
1476±18
1,200$5 / $301.1M
16
447
Z.ai · MIT
1475±20
915$1.40 / $4.40202.8K
17
544
OpenAI · Proprietary
1474±17
1,200$5 / $301.1M
18
541
Google · Proprietary
1474±13
2,004$0.50 / $31M
19
544
Alibaba · Proprietary
1474±16
1,314N/AN/A
20
181
Anthropic
Anthropic · Proprietary
1472±33
282$5 / $251M
21
642
Moonshot · Modified MIT
1471±12
2,467$0.60 / $3N/A
22
181
Alibaba · Proprietary
1469±30
349$1.04 / $6.24262.1K
23
482
Google · Apache 2.0
1466±28
372N/AN/A
24
483
Google · Apache 2.0
1464±28
398$0.14 / $0.40262.1K
25
676
1461±18
1,056$0.43 / $0.871M
26
855
Anthropic
Anthropic · Proprietary
1460±9
4,280$5 / $25200K
27
769
1459±14
1,980$2 / $62M
28
864
Anthropic
1459±12
2,267$5 / $25200K
29
872
Anthropic
Anthropic · Proprietary
1457±14
1,926$3 / $151M
30
685
Meta
Meta · Proprietary
1457±20
825N/AN/A
31
881
OpenAI · Proprietary
1452±14
1,994$2.50 / $151.1M
32
887
Alibaba · Proprietary
1452±16
1,332$0.33 / $1.951M
33
1182
Alibaba · Apache 2.0
1451±12
2,267$0.39 / $2.34262.1K
34
1573
Google · Proprietary
1450±7
7,598$1.25 / $101M
35
1381
1450±10
3,427$0.50 / $31M
36
1087
Alibaba · Proprietary
1450±15
1,525$0.78 / $3.90262.1K
37
1089
Xiaomi · Proprietary
1449±15
1,585$1 / $31M
38
6113
Mistral · Modified MIT
1448±32
314$1.50 / $7.50262.1K
39
1881
Anthropic
1448±9
4,862$3 / $15200K
40
1992
OpenAI · Proprietary
1443±12
2,499$1.25 / $10400K
41
8113
Moonshot · Modified MIT
1442±25
513$0.40 / $1.90262.1K
42
6127
Xiaomi · Proprietary
1442±35
289$0.40 / $2262.1K
43
14102
Alibaba · Apache 2.0
1442±17
1,212$0.09 / $1.10262.1K
44
1993
OpenAI · Proprietary
1442±11
2,943$1.75 / $14400K
45
11110
Meituan · MIT
1441±22
688$0.20 / $0.80131.1K
46
6131
1441±39
207N/AN/A
47
18105
DeepSeek · MIT
1439±17
1,191$0.10 / $0.201M
48
20101
Bytedance
Bytedance · Proprietary
1438±12
2,513N/AN/A
49
11114
Alibaba · Proprietary
1438±24
584$0.78 / $3.90262.1K
50
19102
Baidu · Proprietary
1438±13
2,126N/AN/A
51
19102
1437±14
1,958$2 / $62M
52
19105
xAI · Proprietary
1436±15
1,563N/AN/A
53
19110
Xiaomi · MIT
1436±17
1,114$0.14 / $0.281M
54
22102
DeepSeek · MIT
1436±11
2,980$0.23 / $0.34131.1K
55
20109
Z.ai · MIT
1435±16
1,378$1 / $3.20202.8K
56
22108
OpenAI · Proprietary
1434±13
2,038$1.75 / $14128K
57
19112
DeepSeek · MIT
1434±17
1,263$0.43 / $0.871M
58
20109
Alibaba · Apache 2.0
1434±15
1,620$0.20 / $1.56262.1K
59
23108
Z.ai · MIT
1433±13
2,107$0.43 / $1.74202.8K
60
27105
Moonshot · Modified MIT
1432±10
3,746$1.15 / $8262.1K
61
23112
Meituan · Proprietary
1432±14
1,704N/AN/A
62
23112
1432±14
1,584N/AN/A
63
27102
Alibaba · Apache 2.0
1432±8
5,893$0.26 / $1.06N/A
64
25109
Google · Proprietary
1431±12
2,401$0.25 / $1.501M
65
27108
Anthropic
1430±11
3,026$15 / $75200K
66
26120
Z.ai · MIT
1427±16
1,425$0.60 / $2.20131.1K
67
27120
Alibaba · Apache 2.0
1427±14
1,729$0.26 / $2.08262.1K
68
23127
1427±20
806N/AN/A
69
19135
1426±27
480$0.27 / $0.41163.8K
70
27124
OpenAI · Proprietary
1425±16
1,460$5 / $301.1M
71
22129
Alibaba · Apache 2.0
1425±23
704$0.20 / $0.88262.1K
72
38114
OpenAI · Proprietary
1425±10
3,730$2 / $8200K
73
26127
1424±18
1,151$0.10 / $0.201M
74
35120
xAI · Proprietary
1424±12
2,265$3 / $15256K
75
24128
DeepSeek · MIT
1424±20
775$0.27 / $0.41163.8K
76
24129
Z.ai · MIT
1423±21
710$0.40 / $1.75202.8K
77
39118
xAI · Proprietary
1423±10
3,789N/AN/A
78
41119
Anthropic
Anthropic · Proprietary
1422±9
4,724$15 / $75200K
79
41120
Anthropic
Anthropic · Proprietary
1421±9
4,867$3 / $15200K
80
32129
DeepSeek · MIT
1420±18
993$1.23 / $4.94N/A
81
18150
1420±37
234N/AN/A
82
38128
OpenAI · Proprietary
1420±14
1,837$0.75 / $4.50400K
83
40124
OpenAI · Proprietary
1420±11
3,052$1.75 / $14400K
84
22142
Tencent
Tencent · tencent-hunyuan-community
1420±28
390$0.29 / $1.17262.1K
85
41123
xAI · Proprietary
1419±9
4,200N/AN/A
86
39128
OpenAI · Proprietary
1418±15
1,687$0.20 / $1.25400K
87
41127
DeepSeek · MIT
1418±12
2,483$0.23 / $0.34131.1K
88
23143
xAI · Proprietary
1418±29
399$3 / $15256K
89
41128
1417±13
1,944$0.30 / $2.501M
90
26144
Alibaba · Apache 2.0
1415±28
428$0.26 / $2.60131.1K
91
36142
DeepSeek · MIT
1414±22
663$1.23 / $4.94N/A
92
42128
Mistral · Apache 2.0
1414±11
2,766$0.50 / $1.50N/A
93
35143
Alibaba · Apache 2.0
1412±24
489$0.10 / $0.10262.1K
94
41139
OpenAI · Proprietary
1412±15
1,393$75 / $150128K
95
58128
Mistral · Proprietary
1411±8
5,780$2.70 / $8.1032K
96
24160
Baidu · Proprietary
1410±34
268N/AN/A
97
61128
Google · Proprietary
1410±7
7,837$0.30 / $2.501M
98
50135
OpenAI · Proprietary
1410±11
2,865$1.25 / $10400K
99
22164
Tencent
Tencent · Proprietary
1410±38
236N/AN/A
100
47140
OpenAI · Proprietary
1409±14
1,786$1.25 / $10128K
101
47140
MiniMax · Modified MIT
1408±15
1,595$0.27 / $1.08204.8K
102
20166
1408±41
201$0.27 / $0.95163.8K
103
50140
Alibaba · Proprietary
1408±13
2,055N/AN/A
104
56140
Stepfun
StepFun · Apache 2.0
1407±12
2,317$0.09 / $0.30262.1K
105
41150
Baidu · Proprietary
1406±23
619N/AN/A
106
65135
OpenAI · Proprietary
1406±8
5,726$5 / $15128K
107
63140
xAI · Proprietary
1405±10
3,449$0.20 / $0.502M
108
53143
Alibaba · Apache 2.0
1405±14
1,721$0.14 / $1262.1K
109
50144
xAI · Proprietary
1404±18
1,085$0.20 / $0.502M
110
47151
DeepSeek · MIT
1403±20
869$0.50 / $2.15163.8K
111
36165
1403±33
263N/AN/A
112
34169
DeepSeek · MIT
1399±38
219$0.27 / $0.95163.8K
113
71144
Alibaba · Apache 2.0
1399±12
2,390$0.46 / $1.82131.1K
114
41165
Alibaba · Apache 2.0
1398±30
316$0.08 / $0.28131.1K
115
67151
Z.ai · MIT
1398±15
1,540$0.13 / $0.85131.1K
116
71147
OpenAI · Proprietary
1398±14
1,886$1.25 / $10400K
117
58160
Moonshot · Modified MIT
1397±21
759$0.60 / $2.50262.1K
118
74145
1397±11
2,793$0.10 / $0.30262.1K
119
65160
xAI · Proprietary
1396±18
1,064$1.25 / $2.501M
120
74152
OpenAI · Proprietary
1396±13
1,909$1.10 / $4.40200K
121
72153
Alibaba · Apache 2.0
1395±14
1,604$0.46 / $1.82131.1K
122
66160
MiniMax · MIT
1395±18
1,010$0.29 / $0.95204.8K
123
71156
Alibaba · Apache 2.0
1395±15
1,426$0.05 / $0.19131.1K
124
65162
Alibaba · Apache 2.0
1395±20
829$0.10 / $0.78262.1K
125
39171
1394±39
194$0.10 / $0.40131.1K
126
78158
DeepSeek · MIT
1392±14
1,606$0.70 / $2.50163.8K
127
89152
Anthropic
Anthropic · Proprietary
1391±9
4,985$1 / $5200K
128
85158
Anthropic
Anthropic · Proprietary
1390±12
2,240$15 / $75200K
129
89157
xAI · Proprietary
1390±11
2,677$3 / $15131.1K
130
89160
OpenAI · Proprietary
1388±11
2,986$15 / $60200K
131
88163
OpenAI · Apache 2.0
1388±14
1,793$0.04 / $0.18131.1K
132
92160
OpenAI · Proprietary
1387±11
2,938$1.10 / $4.40200K
133
92164
OpenAI · Proprietary
1384±14
2,003$1.75 / $14128K
134
88167
xAI · Proprietary
1384±18
977$0.25 / $1.27N/A
135
65172
Prime Intellect · MIT
1383±31
333$0.20 / $1.10131.1K
136
74170
Nvidia · NVIDIA Open Model
1382±25
515N/AN/A
137
98166
MiniMax · Modified MIT
1380±12
2,387$0.15 / $0.90204.8K
138
89170
1379±22
633$0.10 / $0.30262.1K
139
100168
OpenAI · Proprietary
1377±15
1,460$0.25 / $2400K
140
103168
Anthropic
1374±13
2,023$3 / $151M
141
108168
DeepSeek · MIT
1374±10
3,191$3 / $4.5032.8K
142
98171
Nvidia · NVIDIA Open Model
1374±18
987$0.06 / $0.24262.1K
143
107168
1374±11
2,877$0.10 / $0.401M
144
114167
OpenAI · Proprietary
1373±8
4,722$1.10 / $4.40200K
145
108168
Anthropic
Anthropic · Proprietary
1373±11
2,769$15 / $75200K
146
112168
OpenAI · Proprietary
1373±10
4,569$15 / $60N/A
147
92175
Ant Group · MIT
1372±27
460N/AN/A
148
106171
Arcee AI · Apache 2.0
1370±16
1,551$0.22 / $0.85262.1K
149
108171
xAI · Proprietary
1370±14
1,528$0.30 / $0.50131.1K
150
117169
Alibaba · Proprietary
1369±10
3,305N/AN/A
151
119170
OpenAI · Proprietary
1368±10
3,227$2 / $81M
152
115172
Moonshot · Modified MIT
1367±14
1,693$0.60 / $2.50131.1K
153
92179
Stepfun
StepFun · Apache 2.0
1367±31
352$0.57 / $1.4265.5K
154
115172
Alibaba · Apache 2.0
1366±15
1,627$0.40 / $1.60262.1K
155
125172
1363±12
2,095$0.10 / $0.401M
156
125173
MiniMax · Apache 2.0
1362±13
1,797$0.40 / $2.201M
157
115178
Amazon · Proprietary
1360±20
825$0.30 / $2.501M
158
93185
Nvidia · Nvidia Open Model
1359±37
209$0.60 / $1.80131.1K
159
127175
Alibaba · Apache 2.0
1359±14
1,720$0.50 / $116.4K
160
119179
Tencent
Tencent · Proprietary
1359±19
845N/AN/A
161
116179
Z.ai · MIT
1358±21
718$0.06 / $0.40202.8K
162
133173
OpenAI · Proprietary
1358±8
7,499$1.10 / $4.40N/A
163
129175
Anthropic
Anthropic · Proprietary
1358±12
2,475$3 / $151M
164
131178
Alibaba · Apache 2.0
1355±14
1,707$0.12 / $0.50131.1K
165
107189
MiniMax · Apache 2.0
1352±33
319$0.26 / $1204.8K
166
135179
Mistral · Proprietary
1352±12
2,228$0.40 / $2131.1K
167
141176
Google · Proprietary
1352±9
4,067$0.10 / $0.401M
168
110189
Z.ai · MIT
1349±34
277$0.60 / $1.8065.5K
169
125187
Ant Group · MIT
1348±27
454N/AN/A
170
150181
OpenAI · Proprietary
1343±11
2,694$0.40 / $1.601M
171
143185
Mistral · Apache 2.0
1341±18
1,042$0.10 / $0.3032K
172
156182
Anthropic
1337±11
2,793$3 / $15200K
173
154186
Arcee AI · Apache 2.0
1336±14
1,857$0.15 / $0.45131K
174
159196
Alibaba · Proprietary
1326±19
732$0.40 / $1.20131.1K
175
167195
Anthropic
Anthropic · Proprietary
1318±10
3,358$3 / $15200K
176
160204
Stepfun
StepFun · Proprietary
1318±24
564N/AN/A
177
162204
OpenAI · Apache 2.0
1318±22
680$0.03 / $0.14131.1K
178
146218
IBM · Apache 2.0
1317±39
229$0.05 / $0.10131.1K
179
160208
OpenAI · Proprietary
1316±26
494$0.05 / $0.40400K
180
156213
Ai2 · Apache 2.0
1315±32
314$0.15 / $0.5065.5K
181
170196
Google · Proprietary
1315±7
7,610$3.50 / $10.502.1M
182
172200
Google · Gemma
1311±9
3,581$0.08 / $0.16131.1K
183
166209
Ai2 · Apache 2.0
1311±23
697$0.20 / $0.6065.5K
184
171200
DeepSeek · DeepSeek
1311±11
2,721$1.14 / $4.56N/A
185
172201
1309±10
2,814$0.07 / $0.301M
186
166216
Google · Gemma
1307±27
389$0.05 / $0.15131.1K
187
174200
Anthropic
Anthropic · Proprietary
1307±6
10,019$3 / $15200K
188
168213
Stepfun
StepFun · Proprietary
1304±20
642N/AN/A
189
174204
Anthropic
Anthropic · Proprietary
1303±7
11,359$3 / $15200K
190
174206
NexusFlow · NexusFlow
1300±9
3,412N/AN/A
191
174208
1300±11
2,839$0.63 / $1.80131.1K
192
174208
01.AI
01 AI · Proprietary
1300±10
3,921N/AN/A
193
175208
Cohere
Cohere · CC-BY-NC-4.0
1299±9
3,993$2.50 / $10256K
194
168227
Ai2 · Apache 2.0
1298±26
473$0.15 / $0.5065.5K
195
174213
Alibaba · Proprietary
1297±14
1,404N/AN/A
196
168234
Tencent
Tencent · Proprietary
1294±31
238N/AN/A
197
177227
DeepSeek · DeepSeek
1288±17
1,031N/AN/A
198
177230
Z.ai · Proprietary
1287±19
721N/AN/A
199
180226
1286±13
1,944$0.40 / $0.708.2K
200
184219
OpenAI · Proprietary
1285±8
6,826$2.50 / $10128K
201
184219
OpenAI · Proprietary
1284±7
15,103$5 / $15128K
202
185221
xAI · Proprietary
1283±7
8,950$2 / $10131.1K
203
185224
Alibaba · Qwen
1283±8
5,415$1.20 / $1.20N/A
204
189226
Meta
Meta · Llama 3.1 Community
1281±7
8,482$4 / $432.8K
205
177236
Tencent
Tencent · Proprietary
1281±24
497N/AN/A
206
190227
Meta
Meta · Llama 3.1 Community
1278±8
5,215$4 / $432.8K
207
190233
Alibaba · Qwen
1275±12
2,249$1.60 / $6.4032.8K
208
190233
Z.ai · Proprietary
1275±10
3,599$0.44 / $1.76204.8K
209
181241
OpenAI · Proprietary
1274±23
582$0.10 / $0.401M
210
181241
Tencent
Tencent · Proprietary
1274±24
499N/AN/A
211
177242
Tencent
Tencent · Proprietary
1273±31
243N/AN/A
212
194232
Anthropic
Anthropic · Proprietary
1272±6
25,769$15 / $75200K
213
193234
Google · Proprietary
1272±9
6,395N/AN/A
214
194233
OpenAI · Proprietary
1272±8
13,217$10 / $30128K
215
190238
1271±17
1,041$1.20 / $1.20131.1K
216
193235
DeepSeek · DeepSeek
1271±10
3,649N/AN/A
217
197234
Google · Proprietary
1269±8
10,492$3.50 / $10.502.1M
218
197234
OpenAI · Proprietary
1269±8
13,306$10 / $30128K
219
195235
Google · Proprietary
1269±9
4,789$0.07 / $0.301M
220
181245
Tencent
Tencent · Proprietary
1268±30
351N/AN/A
221
198235
OpenAI · Proprietary
1268±8
12,374$10 / $30128K
222
198235
Meta
Meta · Llama-3.3
1267±8
5,777$0.10 / $0.32131.1K
223
199235
OpenAI · Proprietary
1267±7
9,322$0.15 / $0.60128K
224
201236
xAI · Proprietary
1265±8
7,261$2 / $10131.1K
225
204238
Mistral · Mistral Research
1261±8
6,664$2 / $6131.1K
226
199241
1261±13
2,131$0.10 / $0.3032K
227
203240
Mistral · MRL
1261±9
3,574$2 / $6128K
228
218241
Meta
Meta · Llama 3.1 Community
1252±7
7,677$0.40 / $0.40131.1K
229
213243
Amazon · Proprietary
1251±10
2,978$0.80 / $3.20300K
230
193256
IBM · Apache 2.0
1251±32
358N/AN/A
231
206249
Google · Gemma
1251±15
1,572$0.06 / $0.1232.8K
232
204251
Alibaba · Apache 2.0
1250±19
725$0.87 / $0.8732K
233
198254
Mistral · Proprietary
1249±26
553$2 / $540K
234
220250
Microsoft · MIT
1246±10
2,764$0.07 / $0.1416.4K
235
223246
Anthropic
Anthropic · Proprietary
1244±7
6,364$0.80 / $4200K
236
205257
Ai2 · Llama 3.1
1242±24
397N/AN/A
237
220252
DeepSeek · DeepSeek License
1241±14
1,858$0.14 / $0.28128K
238
222252
Mistral · Apache 2.0
1240±13
1,683$0.05 / $0.0832.8K
239
205259
Google · Gemma
1239±28
423$0.05 / $0.10131.1K
240
227252
Alibaba · Qianwen LICENSE
1235±9
4,835$0.90 / $0.9032.8K
241
209263
Tencent
Tencent · Proprietary
1235±28
361N/AN/A
242
228255
NexusFlow · CC-BY-NC-4.0
1231±10
2,921N/AN/A
243
229255
OpenAI · Proprietary
1230±10
7,052$30 / $608.2K
244
222263
1230±22
507N/AN/A
245
231255
Google · Proprietary
1229±8
8,392$0.07 / $0.301M
246
230258
Amazon · Proprietary
1227±11
2,511$0.06 / $0.24300K
247
231263
Reka AI · Proprietary
1222±14
1,207N/AN/A
248
231263
AI21 Labs · Jamba Open
1221±15
1,147$2 / $8256K
249
233263
Z.ai · Proprietary
1218±15
1,191N/AN/A
250
237259
Meta
Meta · Llama 3 Community
1218±7
20,941$0.51 / $0.748.2K
251
237262
OpenAI · Proprietary
1217±8
11,181$30 / $608.2K
252
234263
Nvidia · NVIDIA Open Model
1216±12
2,352N/AN/A
253
229271
Alibaba · Apache 2.0
1213±24
480$0.50 / $116.4K
254
238263
Anthropic
Anthropic · Proprietary
1213±8
13,766$3 / $15200K
255
242263
Google · Gemma license
1212±7
10,170$0.65 / $0.658.2K
256
232278
Ai2 · Apache-2.0
1207±28
375$0.05 / $0.20128K
257
244264
Google · Proprietary
1207±8
5,036$0.07 / $0.301M
258
243266
Amazon · Proprietary
1206±11
2,455$0.04 / $0.14128K
259
246269
Mistral · Proprietary
1200±9
7,987$4 / $1232K
260
246271
Cohere
Cohere · CC-BY-NC-4.0
1200±10
3,854N/AN/A
261
246277
Reka AI · Proprietary
1195±14
1,284N/AN/A
262
241282
Ai2 · Llama 3.1
1195±25
363N/AN/A
263
247282
Mistral · MRL
1188±20
683$0.10 / $0.10131.1K
264
256277
Anthropic
Anthropic · Proprietary
1188±7
14,983$0.25 / $1.25200K
265
255279
Cohere
Cohere · CC-BY-NC-4.0
1188±14
1,467$2.50 / $10128K
266
256279
Alibaba · Qianwen LICENSE
1185±11
3,188N/AN/A
267
257279
Mistral · Apache 2.0
1184±9
6,778$0.90 / $0.9065.5K
268
258279
Google · Gemma license
1183±8
7,110$0.03 / $0.098.2K
269
257281
01.AI
01 AI · Apache-2.0
1182±11
2,985N/AN/A
270
258282
Mistral · Proprietary
1180±11
4,406$2.70 / $8.1032K
271
257286
InternLM · Other
1180±15
1,387$0 / $032.8K
272
260281
Meta
Meta · Llama 3.1 Community
1179±8
7,135$0.02 / $0.03131.1K
273
260287
Microsoft · MIT
1173±10
3,238$0.17 / $0.68N/A
274
260289
Princeton · MIT
1173±15
1,285$0.03 / $0.098.2K
275
260292
Cohere
Cohere · CC-BY-NC-4.0
1168±15
1,307N/AN/A
276
260292
Reka AI · Proprietary
1168±14
2,028N/AN/A
277
267291
Cohere
Cohere · CC-BY-NC-4.0
1164±8
9,769$2.50 / $10128K
278
267292
Alibaba · Qianwen LICENSE
1164±10
5,327N/AN/A
279
263295
AI21 Labs · Jamba Open
1160±16
1,094$0.20 / $0.40256K
280
260302
IBM · Apache 2.0
1159±26
391N/AN/A
281
272295
Reka AI · Proprietary
1155±11
3,363N/AN/A
282
272295
Alibaba · Qianwen LICENSE
1155±12
2,649N/AN/A
283
272297
Cohere
Cohere · CC-BY-NC-4.0
1155±14
1,601$0.15 / $0.60128K
284
272298
1152±14
1,568$0.13 / $0.524.1K
285
262307
IBM · Apache 2.0
1151±28
382N/AN/A
286
274295
Meta
Meta · Llama 3 Community
1151±8
14,252$0.14 / $0.148.2K
287
273299
Microsoft · MIT
1151±13
2,092$0.15 / $0.60N/A
288
269306
HuggingFace · Apache 2.0
1148±22
589N/AN/A
289
276298
Mistral · Apache 2.0
1147±8
9,663$0.63 / $0.6332K
290
275302
Databricks · DBRX LICENSE
1145±11
4,001$0.60 / $0.6032.8K
291
274306
IBM · Apache 2.0
1143±19
873N/AN/A
292
279302
OpenAI · Proprietary
1142±8
8,626$0.50 / $1.5016.4K
293
275305
OpenAI · Proprietary
1141±15
2,134$1 / $216.4K
294
283305
Google · Gemma license
1134±8
6,599N/AN/A
295
279309
Google · Proprietary
1132±14
2,274$0.35 / $1.0532.8K
296
279311
Google · Proprietary
1129±19
993$0.35 / $1.0532.8K
297
283311
Meta
Meta · Llama 3.2
1126±16
1,136$0.05 / $0.34131.1K
298
284311
Alibaba · Qianwen LICENSE
1125±13
2,184$0.30 / $0.30N/A
299
286311
Nexusflow · Apache-2.0
1124±14
1,973N/AN/A
300
290311
Cohere
Cohere · CC-BY-NC-4.0
1120±9
6,682$0.15 / $0.60128K
301
287317
IBM · Apache 2.0
1117±19
908N/AN/A
302
287319
Microsoft · Llama 2 Community
1116±19
903N/AN/A
303
290315
01.AI
01 AI · Yi License
1114±13
2,043$0.90 / $0.904.1K
304
294316
Microsoft · MIT
1111±12
2,564$0.13 / $0.52N/A
305
295317
Snowflake · Apache 2.0
1109±11
4,793N/AN/A
306
290322
DeepSeek · DeepSeek License
1108±23
576N/AN/A
307
292321
AllenAI/UW · AI2 ImpACT Low-risk
1107±19
888N/AN/A
308
296319
Google · Gemma license
1107±11
3,039$0.03 / $0.098.2K
309
295319
OpenChat · Apache-2.0
1107±14
1,726N/AN/A
310
287328
HuggingFace · Apache 2.0
1105±33
271N/AN/A
311
296327
NousResearch · Apache-2.0
1098±20
697$0.17 / $0.17N/A
312
301325
Meta
Meta · Llama 2 Community
1091±10
4,740$0.70 / $2.804.1K
313
301327
Microsoft · MIT
1089±13
2,813$0.13 / $0.52N/A
314
301328
Meta
Meta · Llama 3.2
1086±16
1,162$0.03 / $0.20131.1K
315
305328
Mistral · Apache-2.0
1085±12
2,605$0.20 / $0.2032.8K
316
305329
UC Berkeley · CC-BY-NC-4.0
1081±16
1,300N/AN/A
317
302331
Alibaba · Qianwen LICENSE
1080±20
690$0.20 / $0.20N/A
318
301335
Cognitive Computations · Apache-2.0
1077±32
219$0.50 / $0.5016.4K
319
303335
Nvidia · Llama 2 Community
1072±27
440N/AN/A
320
308334
OpenChat · Apache-2.0
1071±18
945$0.20 / $0.20N/A
321
310331
LMSYS · Non-commercial
1071±12
2,663$0 / $02K
322
308335
Alibaba · Qianwen LICENSE
1068±24
534N/AN/A
323
310334
Google · Gemma license
1066±16
1,120$0.05 / $0.088.2K
324
311334
Meta
Meta · Llama 2 Community
1065±13
2,218$0.25 / $0.254.1K
325
309337
Upstage AI · CC-BY-NC-4.0
1064±22
604$0.30 / $0.30N/A
326
310337
NousResearch · Apache-2.0
1060±21
628$0.90 / $0.90N/A
327
313338
Meta
Meta · Llama 2 Community
1056±19
770$0.35 / $1.4016.4K
328
316340
Google · Proprietary
1049±19
901$0.50 / $0.5025.8K
329
317339
Google · Gemma license
1047±16
1,355N/AN/A
330
311341
MosaicML · CC-BY-NC-SA-4.0
1047±34
242N/AN/A
331
319340
Meta
Meta · Llama 2 Community
1042±14
1,656$0.15 / $0.154.1K
332
319340
HuggingFace · MIT
1041±17
1,250$0.15 / $0.1516.4K
333
319341
Together AI · Apache 2.0
1033±20
676$0.20 / $0.20N/A
334
317342
UW · Non-commercial
1033±33
280N/AN/A
335
325340
LMSYS · Llama 2 Community
1030±14
2,146$0.30 / $0.30N/A
336
322342
Mistral · Apache 2.0
1027±19
974$0.07 / $0.284.1K
337
325342
Alibaba · Qianwen LICENSE
1026±18
988$0.10 / $0.10N/A
338
328342
Ai2 · Apache-2.0
1018±19
848$0.20 / $0.20N/A
339
327342
Microsoft · Llama 2 Community
1017±21
669$0.30 / $0.30N/A
340
329342
Google · Gemma license
1009±22
597$0.10 / $0.10N/A
341
333343
LMSYS · Llama 2 Community
994±21
658$0.20 / $0.20N/A
342
335343
Tsinghua · Apache-2.0
989±23
576N/AN/A
343
341350
Nomic AI · Non-commercial
941±37
211N/AN/A
344
343350
UC Berkeley · Non-commercial
932±21
751N/AN/A
345
343350
Tsinghua · Non-commercial
926±25
525N/AN/A
346
343351
RWKV · Apache 2.0
922±24
544N/AN/A
347
343351
MosaicML · CC-BY-NC-SA-4.0
919±25
471N/AN/A
348
343352
Tsinghua · Apache-2.0
915±35
227N/AN/A
349
343352
Stanford · Non-commercial
908±23
652N/AN/A
350
343353
OpenAssistant · Apache 2.0
892±22
687N/AN/A
351
346354
Databricks · MIT
871±29
370N/AN/A
352
348354
LMSYS · Apache 2.0
862±26
462N/AN/A
353
351354
Stability
Stability AI · CC-BY-NC-SA-4.0
839±29
353N/AN/A
354
350354
Meta
Meta · Non-commercial
838±33
252$0.23 / $0.23N/A

Remove Style Control Leaderboard Plots

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Confidence Intervals on Model Strength (via Bootstrapping)

Battle Count for Each Combination of Models (without Ties)