Text Arena🧮Mathematical

View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.

Jun 5, 2026
519,990 votes
347 models
Rank Spread
1
117
Anthropic
Anthropic · Proprietary
1523±14
1,923$5 / $251M
2
117
Anthropic
Anthropic · Proprietary
1522±13
2,219$5 / $251M
3
126
Xiaomi · MIT
1512±19
1,039$0.43 / $0.871M
4
124
OpenAI · Proprietary
1511±15
1,796$2.50 / $151.1M
5
137
Google · Proprietary
1506±25
606$1.50 / $91M
6
131
OpenAI · Proprietary
1505±18
1,212$5 / $301.1M
7
131
Anthropic
Anthropic · Proprietary
1505±16
1,490$5 / $251M
8
149
Anthropic
Anthropic · Proprietary
1503±33
324$5 / $251M
9
132
Anthropic
Anthropic · Proprietary
1502±17
1,434$5 / $251M
10
135
Moonshot · Modified MIT
1500±19
1,034$0.95 / $4262.1K
11
139
Baidu · Proprietary
1500±20
971N/AN/A
12
137
OpenAI · Proprietary
1498±18
1,236$5 / $301.1M
13
157
Alibaba · Proprietary
1494±32
324$1.04 / $6.24262.1K
14
335
Google · Proprietary
1494±12
2,624$2 / $121M
15
147
Alibaba · Proprietary
1493±18
1,159N/AN/A
16
166
Anthropic
Anthropic · Proprietary
1492±34
283$5 / $251M
17
174
MiniMax · Proprietary
1490±38
249$0.60 / $2.40N/A
18
350
Anthropic
Anthropic · Proprietary
1484±15
1,728$3 / $151M
19
451
Google · Proprietary
1482±13
1,941$2 / $121M
20
452
Moonshot · Modified MIT
1481±13
2,082$0.60 / $3N/A
21
549
Anthropic
Anthropic · Proprietary
1481±11
3,339$5 / $25200K
22
364
Z.ai · MIT
1480±20
874$1.40 / $4.40202.8K
23
372
Meta
Meta · Proprietary
1477±22
768N/AN/A
24
388
Google · Apache 2.0
1476±32
296N/AN/A
25
564
Xiaomi · Proprietary
1475±17
1,382$1 / $31M
26
388
Google · Apache 2.0
1474±30
322$0.14 / $0.40262.1K
27
1104
Alibaba · Proprietary
1473±40
227$1.25 / $3.751M
28
566
Google · Proprietary
1473±16
1,415$0.50 / $31M
29
572
1472±19
1,020$0.43 / $0.871M
30
1059
Anthropic
1471±10
3,851$3 / $15200K
31
771
Anthropic
1471±15
1,554$5 / $25200K
32
869
OpenAI · Proprietary
1471±14
1,863$2.50 / $151.1M
33
3104
Xiaomi · Proprietary
1468±33
308$0.40 / $2262.1K
34
872
1468±14
1,886$2 / $62M
35
1272
Alibaba · Apache 2.0
1467±14
1,977$0.39 / $2.34262.1K
36
883
DeepSeek · MIT
1465±18
1,202$0.43 / $0.871M
37
1284
Alibaba · Proprietary
1464±17
1,284$0.33 / $1.951M
38
1087
Xiaomi · MIT
1463±18
1,158$0.14 / $0.281M
39
1378
OpenAI · Proprietary
1463±13
2,356$1.75 / $14400K
40
1387
Z.ai · MIT
1462±17
1,209$1 / $3.20202.8K
41
1381
OpenAI · Proprietary
1462±14
1,721$1.25 / $10400K
42
1385
1461±14
1,844$2 / $62M
43
1386
OpenAI · Proprietary
1461±15
1,800$1.75 / $14128K
44
1487
OpenAI · Proprietary
1460±15
1,708$0.75 / $4.50400K
45
1488
OpenAI · Proprietary
1459±16
1,547$5 / $301.1M
46
5116
Mistral · Modified MIT
1457±32
324$1.50 / $7.50262.1K
47
1988
Anthropic
Anthropic · Proprietary
1453±10
3,937$3 / $15200K
48
1695
xAI · Proprietary
1453±16
1,355N/AN/A
49
1991
1452±12
2,893$0.50 / $31M
50
1991
Moonshot · Modified MIT
1452±11
2,922$1.15 / $8262.1K
51
13118
Tencent
Tencent · tencent-hunyuan-community
1451±29
388$0.29 / $1.17262.1K
52
2093
Anthropic
1450±12
2,393$15 / $75200K
53
18104
1449±18
1,157$0.10 / $0.201M
54
2191
OpenAI · Proprietary
1449±10
3,452$2 / $8200K
55
2791
Google · Proprietary
1448±8
6,551$1.25 / $101M
56
21102
Bytedance
Bytedance · Proprietary
1448±13
2,291N/AN/A
57
19105
DeepSeek · MIT
1447±18
1,194$0.10 / $0.201M
58
21104
DeepSeek · MIT
1447±14
1,747$0.23 / $0.34131.1K
59
21105
Meituan · Proprietary
1445±16
1,545N/AN/A
60
13131
Moonshot · Modified MIT
1445±31
333$0.40 / $1.90262.1K
61
13131
xAI · Proprietary
1444±32
320$3 / $15256K
62
19122
DeepSeek · MIT
1442±24
564$1.23 / $4.94N/A
63
26108
OpenAI · Proprietary
1442±14
1,747$1.25 / $10400K
64
26108
Z.ai · MIT
1442±14
1,675$0.43 / $1.74202.8K
65
23112
Alibaba · Apache 2.0
1442±16
1,478$0.20 / $1.56262.1K
66
25111
OpenAI · Proprietary
1442±15
1,635$0.20 / $1.25400K
67
32104
Anthropic
Anthropic · Proprietary
1442±10
3,769$15 / $75200K
68
25115
Alibaba · Proprietary
1441±16
1,314$0.78 / $3.90262.1K
69
21118
xAI · Proprietary
1441±19
1,057$1.25 / $2.501M
70
17132
1440±29
392$0.27 / $0.41163.8K
71
31112
DeepSeek · MIT
1439±13
2,232$0.23 / $0.34131.1K
72
31116
Google · Proprietary
1438±13
2,217$0.25 / $1.501M
73
32116
OpenAI · Proprietary
1438±13
2,027$1.25 / $10400K
74
33112
xAI · Proprietary
1437±11
2,974N/AN/A
75
33118
OpenAI · Proprietary
1435±12
2,574$1.75 / $14400K
76
32122
MiniMax · Modified MIT
1435±16
1,501$0.28 / $1.20204.8K
77
34118
xAI · Proprietary
1435±13
2,159$3 / $15256K
78
33121
Baidu · Proprietary
1434±15
1,665N/AN/A
79
45116
Alibaba · Apache 2.0
1434±9
4,874$0.26 / $1.06N/A
80
20138
Alibaba · Apache 2.0
1433±29
377$0.26 / $2.60131.1K
81
37128
Alibaba · Apache 2.0
1431±16
1,519$0.26 / $2.08262.1K
82
49122
Anthropic
Anthropic · Proprietary
1429±10
4,032$1 / $5200K
83
49126
xAI · Proprietary
1427±11
3,309N/AN/A
84
45131
OpenAI · Proprietary
1427±15
1,783$1.75 / $14128K
85
23149
Alibaba · Apache 2.0
1426±32
282$0.08 / $0.28131.1K
86
32144
Alibaba · Apache 2.0
1426±25
491$0.10 / $0.10262.1K
87
50131
Anthropic
Anthropic · Proprietary
1425±13
2,043$15 / $75200K
88
35143
Moonshot · Modified MIT
1424±23
641$0.60 / $2.50262.1K
89
51131
xAI · Proprietary
1424±12
2,701$0.20 / $0.502M
90
25150
Baidu · Proprietary
1423±34
267N/AN/A
91
36145
Meituan · MIT
1422±24
544$0.20 / $0.80131.1K
92
38145
Z.ai · MIT
1422±24
540$0.40 / $1.75202.8K
93
51136
1422±15
1,630$0.30 / $2.501M
94
45145
MiniMax · MIT
1421±22
652$0.29 / $0.95204.8K
95
52137
Stepfun
StepFun · Apache 2.0
1421±14
1,924$0.09 / $0.30262.1K
96
51140
Z.ai · MIT
1419±17
1,257$0.60 / $2.20131.1K
97
65131
Google · Proprietary
1418±7
6,744$0.30 / $2.501M
98
41149
Baidu · Proprietary
1418±27
478N/AN/A
99
52145
Alibaba · Apache 2.0
1417±18
1,044$0.09 / $1.10262.1K
100
51145
DeepSeek · MIT
1416±19
949$0.50 / $2.15163.8K
101
51147
DeepSeek · MIT
1416±20
839$1.23 / $4.94N/A
102
59144
Mistral · Apache 2.0
1414±14
1,952$0.50 / $1.50N/A
103
65140
OpenAI · Proprietary
1414±11
2,644$15 / $60200K
104
59145
DeepSeek · MIT
1414±15
1,504$0.70 / $2.50163.8K
105
59145
OpenAI · Proprietary
1414±16
1,283$75 / $150128K
106
51149
DeepSeek · MIT
1414±23
634$0.27 / $0.41163.8K
107
61145
Alibaba · Proprietary
1413±15
1,787N/AN/A
108
61145
OpenAI · Proprietary
1413±14
1,711$1.10 / $4.40200K
109
57149
xAI · Proprietary
1412±19
938$0.20 / $0.502M
110
50150
Alibaba · Proprietary
1412±26
476$0.78 / $3.90262.1K
111
66145
Anthropic
1412±13
1,903$3 / $151M
112
61147
Alibaba · Apache 2.0
1412±16
1,521$0.14 / $1262.1K
113
51150
Alibaba · Apache 2.0
1412±24
587$0.20 / $0.88262.1K
114
75140
OpenAI · Proprietary
1411±9
4,759$5 / $15128K
115
74145
Anthropic
Anthropic · Proprietary
1410±11
2,639$15 / $75200K
116
74145
OpenAI · Proprietary
1410±11
2,779$1.10 / $4.40200K
117
65149
OpenAI · Proprietary
1409±17
1,283$0.25 / $2400K
118
70148
OpenAI · Proprietary
1409±15
1,562$1.25 / $10128K
119
78145
Mistral · Proprietary
1408±9
4,721$2.70 / $8.1032K
120
34174
1407±40
183$0.10 / $0.40131.1K
121
79149
Alibaba · Apache 2.0
1404±12
2,304$0.46 / $1.82131.1K
122
78149
1403±13
2,207$0.10 / $0.30262.1K
123
80150
MiniMax · Modified MIT
1401±13
2,175$0.15 / $1.15204.8K
124
79150
Alibaba · Apache 2.0
1401±15
1,501$0.46 / $1.82131.1K
125
78153
Alibaba · Apache 2.0
1401±17
1,206$0.05 / $0.19131.1K
126
61172
Nvidia · NVIDIA Open Model
1399±27
417N/AN/A
127
87150
OpenAI · Proprietary
1398±10
3,876$15 / $60N/A
128
80164
1396±18
1,053N/AN/A
129
94154
OpenAI · Proprietary
1392±9
4,380$1.10 / $4.40200K
130
90161
xAI · Proprietary
1392±12
2,448$3 / $15131.1K
131
88168
Z.ai · MIT
1391±15
1,452$0.13 / $0.85131.1K
132
78175
1391±27
426$0.10 / $0.30262.1K
133
90164
Anthropic
Anthropic · Proprietary
1391±12
2,320$3 / $151M
134
93163
Anthropic
1390±11
2,559$3 / $15200K
135
80175
Z.ai · MIT
1389±26
481$0.06 / $0.40202.8K
136
87174
Alibaba · Apache 2.0
1387±22
669$0.10 / $0.78262.1K
137
93173
Moonshot · Modified MIT
1387±15
1,565$0.60 / $2.50131.1K
138
87175
1387±24
594N/AN/A
139
70182
Tencent
Tencent · Proprietary
1386±38
225N/AN/A
140
93173
Arcee AI · Apache 2.0
1385±16
1,546$0.22 / $0.85262.1K
141
96173
OpenAI · Apache 2.0
1384±15
1,544$0.04 / $0.18131.1K
142
74182
Prime Intellect · MIT
1384±36
234$0.20 / $1.10131.1K
143
111173
MiniMax · Apache 2.0
1382±14
1,778$0.40 / $2.201M
144
109174
Alibaba · Apache 2.0
1381±16
1,460$0.40 / $1.60262.1K
145
109174
Arcee AI · Apache 2.0
1381±16
1,557$0.15 / $0.45131K
146
86185
Stepfun
StepFun · Apache 2.0
1378±33
305$0.57 / $1.4265.5K
147
125173
OpenAI · Proprietary
1376±8
6,318$1.10 / $4.40N/A
148
112176
xAI · Proprietary
1376±18
987$0.25 / $1.27N/A
149
127175
OpenAI · Proprietary
1373±11
3,058$2 / $81M
150
87188
MiniMax · Apache 2.0
1373±35
285$0.26 / $1204.8K
151
126176
1371±13
2,074$0.10 / $0.401M
152
127176
1369±13
1,986$0.10 / $0.401M
153
128176
DeepSeek · MIT
1368±11
2,925$3 / $4.5032.8K
154
129176
Alibaba · Proprietary
1368±10
2,942N/AN/A
155
127178
Alibaba · Apache 2.0
1368±14
1,579$0.12 / $0.50131.1K
156
89194
1367±37
207N/AN/A
157
128179
Alibaba · Apache 2.0
1365±14
1,579$0.50 / $116.4K
158
131178
OpenAI · Proprietary
1365±12
2,481$0.40 / $1.601M
159
132178
Anthropic
Anthropic · Proprietary
1363±10
3,187$3 / $15200K
160
133178
Google · Proprietary
1363±9
3,765$0.10 / $0.401M
161
131182
xAI · Proprietary
1362±15
1,462$0.30 / $0.50131.1K
162
125188
Nvidia · NVIDIA Open Model
1362±23
661$0.06 / $0.24262.1K
163
119194
Ant Group · MIT
1360±29
365N/AN/A
164
142179
Anthropic
Anthropic · Proprietary
1358±7
8,624$3 / $15200K
165
132188
Mistral · Apache 2.0
1357±18
993$0.10 / $0.3032K
166
131188
Tencent
Tencent · Proprietary
1357±20
795N/AN/A
167
138185
Mistral · Proprietary
1356±12
2,192$0.40 / $2131.1K
168
125201
Ant Group · MIT
1355±30
357N/AN/A
169
127200
OpenAI · Proprietary
1354±28
439$0.05 / $0.40400K
170
127200
Ai2 · Apache 2.0
1354±28
428$0.20 / $0.6065.5K
171
146185
Google · Proprietary
1353±8
6,506$3.50 / $10.502.1M
172
132199
Amazon · Proprietary
1351±24
582$0.30 / $2.501M
173
127202
Google · Gemma
1350±30
333$0.05 / $0.15131.1K
174
127226
IBM · Apache 2.0
1341±41
207$0.05 / $0.10131.1K
175
157194
Google · Gemma
1341±10
3,194$0.08 / $0.16131.1K
176
131223
Z.ai · MIT
1340±37
234$0.60 / $1.8065.5K
177
160194
Anthropic
Anthropic · Proprietary
1339±8
10,038$3 / $15200K
178
157200
1338±11
2,399$0.07 / $0.301M
179
132228
Ai2 · Apache 2.0
1335±40
202$0.15 / $0.5065.5K
180
151210
Alibaba · Proprietary
1335±22
627$0.40 / $1.20131.1K
181
151219
OpenAI · Apache 2.0
1330±24
583$0.03 / $0.14131.1K
182
167209
Cohere
Cohere · CC-BY-NC-4.0
1326±10
3,635$2.50 / $10256K
183
155226
Tencent
Tencent · Proprietary
1326±26
431N/AN/A
184
157223
Stepfun
StepFun · Proprietary
1325±24
569N/AN/A
185
160223
Stepfun
StepFun · Proprietary
1324±22
543N/AN/A
186
167210
NexusFlow · NexusFlow
1322±10
2,942N/AN/A
187
167213
1320±11
2,593$0.63 / $1.80131.1K
188
171210
Meta
Meta · Llama 3.1 Community
1319±8
7,242$4 / $432.8K
189
172213
Meta
Meta · Llama 3.1 Community
1318±9
4,366$4 / $432.8K
190
171219
DeepSeek · DeepSeek
1318±12
2,326$1.14 / $4.56N/A
191
171213
Google · Proprietary
1318±10
5,787N/AN/A
192
167223
Alibaba · Proprietary
1317±15
1,265N/AN/A
193
171216
01.AI
01 AI · Proprietary
1317±10
3,206N/AN/A
194
171226
1314±14
1,817$0.40 / $0.708.2K
195
176222
OpenAI · Proprietary
1313±8
5,805$2.50 / $10128K
196
163237
Tencent
Tencent · Proprietary
1313±28
364N/AN/A
197
160243
Tencent
Tencent · Proprietary
1312±34
216N/AN/A
198
177223
Google · Proprietary
1311±8
9,328$3.50 / $10.502.1M
199
163242
Ai2 · Apache 2.0
1311±31
340$0.15 / $0.5065.5K
200
177223
OpenAI · Proprietary
1310±7
13,306$5 / $15128K
201
177223
Alibaba · Qwen
1310±9
4,565$1.20 / $1.20N/A
202
177223
Anthropic
Anthropic · Proprietary
1310±7
22,970$15 / $75200K
203
177225
OpenAI · Proprietary
1309±8
11,549$10 / $30128K
204
167239
Tencent
Tencent · Proprietary
1309±26
441N/AN/A
205
163244
IBM · Apache 2.0
1307±36
277N/AN/A
206
175239
Z.ai · Proprietary
1304±21
638N/AN/A
207
167244
Tencent
Tencent · Proprietary
1303±32
224N/AN/A
208
180232
Google · Proprietary
1302±9
4,114$0.07 / $0.301M
209
181229
Meta
Meta · Llama-3.3
1302±8
5,232$0.10 / $0.32131.1K
210
178234
Alibaba · Qwen
1302±13
1,804$1.60 / $6.4032.8K
211
177239
1300±18
874$1.20 / $1.20131.1K
212
181234
Z.ai · Proprietary
1300±11
2,991$0.44 / $1.76204.8K
213
184232
xAI · Proprietary
1300±8
7,604$2 / $10131.1K
214
184232
OpenAI · Proprietary
1299±8
11,796$10 / $30128K
215
184232
OpenAI · Proprietary
1299±8
10,830$10 / $30128K
216
185232
Anthropic
Anthropic · Proprietary
1298±8
5,786$0.80 / $4200K
217
185234
Mistral · Mistral Research
1298±8
5,745$2 / $6131.1K
218
185234
DeepSeek · DeepSeek
1296±10
3,039N/AN/A
219
177249
Mistral · Proprietary
1292±26
582$2 / $540K
220
188243
1291±13
1,939$0.10 / $0.3032K
221
200242
OpenAI · Proprietary
1289±10
6,112$30 / $608.2K
222
187246
DeepSeek · DeepSeek
1287±18
893N/AN/A
223
202242
OpenAI · Proprietary
1286±7
7,968$0.15 / $0.60128K
224
202243
xAI · Proprietary
1286±8
6,253$2 / $10131.1K
225
197246
Google · Gemma
1285±16
1,409$0.06 / $0.1232.8K
226
201243
Amazon · Proprietary
1285±10
2,650$0.80 / $3.20300K
227
202244
Mistral · MRL
1284±10
3,037$2 / $6128K
228
187255
OpenAI · Proprietary
1281±24
534$0.10 / $0.401M
229
207249
Microsoft · MIT
1279±12
2,350$0.07 / $0.1416.4K
230
212248
Meta
Meta · Llama 3.1 Community
1276±8
6,524$0.40 / $0.40131.1K
231
196257
1276±25
422N/AN/A
232
211249
Alibaba · Qianwen LICENSE
1275±10
4,235$0.90 / $0.9032.8K
233
187261
Google · Gemma
1275±30
365$0.05 / $0.10131.1K
234
196260
Ai2 · Llama 3.1
1273±28
349N/AN/A
235
215249
Google · Proprietary
1273±8
7,315$0.07 / $0.301M
236
212255
DeepSeek · DeepSeek License
1270±14
1,669$0.14 / $0.28128K
237
211255
Mistral · Apache 2.0
1270±14
1,447$0.05 / $0.0832.8K
238
211255
Reka AI · Proprietary
1270±16
1,039N/AN/A
239
218251
OpenAI · Proprietary
1269±9
9,862$30 / $608.2K
240
207258
Alibaba · Apache 2.0
1269±21
619$0.87 / $0.8732K
241
215253
NexusFlow · CC-BY-NC-4.0
1269±11
2,559N/AN/A
242
200269
Tencent
Tencent · Proprietary
1267±31
292N/AN/A
243
215258
Z.ai · Proprietary
1264±17
1,067N/AN/A
244
225255
Google · Gemma license
1263±7
8,869$0.65 / $0.658.2K
245
227255
Anthropic
Anthropic · Proprietary
1260±8
12,413$3 / $15200K
246
228255
Meta
Meta · Llama 3 Community
1260±8
18,728$0.51 / $0.748.2K
247
225261
Nvidia · NVIDIA Open Model
1258±13
2,133N/AN/A
248
227260
Amazon · Proprietary
1258±12
2,173$0.06 / $0.24300K
249
222263
AI21 Labs · Jamba Open
1257±17
962$2 / $8256K
250
232261
Google · Proprietary
1253±9
4,171$0.07 / $0.301M
251
234268
Cohere
Cohere · CC-BY-NC-4.0
1247±10
3,239N/AN/A
252
232273
Cohere
Cohere · CC-BY-NC-4.0
1247±15
1,243$2.50 / $10128K
253
233275
Reka AI · Proprietary
1245±15
1,109N/AN/A
254
241273
Mistral · Proprietary
1241±9
7,028$4 / $1232K
255
246275
Anthropic
Anthropic · Proprietary
1238±8
13,491$0.25 / $1.25200K
256
241280
Princeton · MIT
1236±16
1,139$0.03 / $0.098.2K
257
249277
Google · Gemma license
1236±8
6,297$0.03 / $0.098.2K
258
244279
Amazon · Proprietary
1236±12
2,016$0.04 / $0.14128K
259
249280
Alibaba · Qianwen LICENSE
1232±12
2,860N/AN/A
260
250280
Mistral · Apache 2.0
1230±9
6,001$0.90 / $0.9065.5K
261
233292
IBM · Apache 2.0
1229±29
341N/AN/A
262
250282
Mistral · Proprietary
1226±11
3,806$2.70 / $8.1032K
263
234294
IBM · Apache 2.0
1226±31
320N/AN/A
264
251282
Microsoft · MIT
1224±11
2,729$0.17 / $0.68N/A
265
250286
Cohere
Cohere · CC-BY-NC-4.0
1224±14
1,361$0.15 / $0.60128K
266
252282
Alibaba · Qianwen LICENSE
1222±10
4,522N/AN/A
267
252285
01.AI
01 AI · Apache-2.0
1222±12
2,568N/AN/A
268
244294
Alibaba · Apache 2.0
1221±26
422$0.50 / $116.4K
269
242295
Ai2 · Apache-2.0
1220±30
336$0.05 / $0.20128K
270
256284
Cohere
Cohere · CC-BY-NC-4.0
1220±9
8,796$2.50 / $10128K
271
254291
Reka AI · Proprietary
1217±15
1,867N/AN/A
272
250294
Mistral · MRL
1217±22
543$0.10 / $0.10131.1K
273
254292
Cohere
Cohere · CC-BY-NC-4.0
1215±16
1,146N/AN/A
274
252294
IBM · Apache 2.0
1214±21
675N/AN/A
275
257292
Alibaba · Qianwen LICENSE
1214±12
2,405N/AN/A
276
252294
Google · Proprietary
1214±21
844$0.35 / $1.0532.8K
277
256294
InternLM · Other
1213±16
1,086$0 / $032.8K
278
250298
Ai2 · Llama 3.1
1209±29
304N/AN/A
279
258294
Reka AI · Proprietary
1209±12
2,967N/AN/A
280
261296
Microsoft · MIT
1203±13
1,849$0.15 / $0.60N/A
281
261296
OpenAI · Proprietary
1202±16
1,746$1 / $216.4K
282
257299
HuggingFace · Apache 2.0
1201±24
526N/AN/A
283
267294
Meta
Meta · Llama 3 Community
1201±8
12,836$0.14 / $0.148.2K
284
264297
1197±14
1,357$0.13 / $0.524.1K
285
264297
Google · Proprietary
1196±15
1,975$0.35 / $1.0532.8K
286
267296
OpenAI · Proprietary
1196±9
7,493$0.50 / $1.5016.4K
287
267296
Meta
Meta · Llama 3.1 Community
1194±8
5,985$0.02 / $0.03131.1K
288
267296
Mistral · Apache 2.0
1194±9
8,472$0.63 / $0.6332K
289
265299
AI21 Labs · Jamba Open
1194±17
932$0.20 / $0.40256K
290
268299
Databricks · DBRX LICENSE
1190±12
3,599$0.60 / $0.6032.8K
291
267305
IBM · Apache 2.0
1188±21
751N/AN/A
292
271299
Cohere
Cohere · CC-BY-NC-4.0
1188±10
5,906$0.15 / $0.60128K
293
271304
Alibaba · Qianwen LICENSE
1184±14
1,972$0.30 / $0.30N/A
294
280303
Google · Gemma license
1181±8
5,632N/AN/A
295
266319
HuggingFace · Apache 2.0
1177±34
240N/AN/A
296
279309
Meta
Meta · Llama 3.2
1174±17
980$0.05 / $0.34131.1K
297
285310
Nexusflow · Apache-2.0
1169±15
1,816N/AN/A
298
292314
Google · Gemma license
1161±12
2,659$0.03 / $0.098.2K
299
293314
Snowflake · Apache 2.0
1160±12
4,220N/AN/A
300
292315
01.AI
01 AI · Yi License
1159±14
1,719$0.90 / $0.904.1K
301
287321
NousResearch · Apache-2.0
1159±22
568$0.17 / $0.17N/A
302
292321
Microsoft · Llama 2 Community
1156±21
764N/AN/A
303
288323
Alibaba · Qianwen LICENSE
1155±23
578$0.20 / $0.20N/A
304
294320
OpenChat · Apache-2.0
1154±15
1,467N/AN/A
305
295319
Microsoft · MIT
1154±13
2,198$0.13 / $0.52N/A
306
292324
DeepSeek · DeepSeek License
1149±27
484N/AN/A
307
295324
AllenAI/UW · AI2 ImpACT Low-risk
1142±21
751N/AN/A
308
296324
Microsoft · MIT
1142±14
2,569$0.13 / $0.52N/A
309
295324
Meta
Meta · Llama 3.2
1142±18
976$0.03 / $0.20131.1K
310
297324
Meta
Meta · Llama 2 Community
1141±11
4,052$0.70 / $2.804.1K
311
295324
Alibaba · Qianwen LICENSE
1140±26
467N/AN/A
312
299324
Mistral · Apache-2.0
1135±13
2,193$0.20 / $0.2032.8K
313
297324
UC Berkeley · CC-BY-NC-4.0
1134±17
1,103N/AN/A
314
297326
OpenChat · Apache-2.0
1132±20
826$0.20 / $0.20N/A
315
300324
LMSYS · Non-commercial
1129±14
2,352$0 / $02K
316
300329
Google · Proprietary
1123±21
801$0.50 / $0.5025.8K
317
302329
Google · Gemma license
1122±18
982$0.05 / $0.088.2K
318
305327
Meta
Meta · Llama 2 Community
1121±14
1,977$0.25 / $0.254.1K
319
303331
Meta
Meta · Llama 2 Community
1116±21
667$0.35 / $1.4016.4K
320
300334
Nvidia · Llama 2 Community
1114±30
353N/AN/A
321
306331
Google · Gemma license
1114±17
1,224N/AN/A
322
297335
MosaicML · CC-BY-NC-SA-4.0
1113±37
218N/AN/A
323
300335
Cognitive Computations · Apache-2.0
1110±34
201$0.50 / $0.5016.4K
324
305334
Upstage AI · CC-BY-NC-4.0
1109±24
529$0.30 / $0.30N/A
325
315335
HuggingFace · MIT
1094±18
1,113$0.15 / $0.1516.4K
326
317335
Meta
Meta · Llama 2 Community
1090±15
1,401$0.15 / $0.154.1K
327
315336
NousResearch · Apache-2.0
1088±26
464$0.90 / $0.90N/A
328
319335
LMSYS · Llama 2 Community
1084±15
1,884$0.30 / $0.30N/A
329
317336
Together AI · Apache 2.0
1082±23
530$0.20 / $0.20N/A
330
316337
UW · Non-commercial
1076±35
236N/AN/A
331
319337
Google · Gemma license
1075±25
525$0.10 / $0.10N/A
332
321337
Mistral · Apache 2.0
1073±21
814$0.07 / $0.284.1K
333
321337
Microsoft · Llama 2 Community
1070±23
567$0.30 / $0.30N/A
334
321337
Alibaba · Qianwen LICENSE
1067±20
820$0.10 / $0.10N/A
335
323337
Ai2 · Apache-2.0
1059±21
726$0.20 / $0.20N/A
336
328337
LMSYS · Llama 2 Community
1045±24
590$0.20 / $0.20N/A
337
330341
Tsinghua · Apache-2.0
1028±27
466N/AN/A
338
337344
RWKV · Apache 2.0
993±26
488N/AN/A
339
337345
UC Berkeley · Non-commercial
986±23
692N/AN/A
340
337345
MosaicML · CC-BY-NC-SA-4.0
985±28
397N/AN/A
341
337345
Stanford · Non-commercial
980±25
565N/AN/A
342
338346
Tsinghua · Non-commercial
973±28
466N/AN/A
343
338346
OpenAssistant · Apache 2.0
972±25
629N/AN/A
344
339347
Databricks · MIT
935±31
343N/AN/A
345
338347
Meta
Meta · Non-commercial
932±37
228$0.23 / $0.23N/A
346
342347
LMSYS · Apache 2.0
922±29
400N/AN/A
347
344347
Stability
Stability AI · CC-BY-NC-SA-4.0
895±31
294N/AN/A

Default Leaderboard Plots

Confidence Intervals on Model Strength (via Bootstrapping)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Battle Count for Each Combination of Models (without Ties)