Text Arena🧮Math

View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.

May 28, 2026
600,809 votes
349 models
Rank Spread
1
115
Google · Proprietary
1524±26
526$1.50 / $91M
2
113
OpenAI · Proprietary
1515±15
1,682$2.50 / $151.1M
3
113
Anthropic
Anthropic · Proprietary
1514±14
1,981$5 / $251M
4
118
Anthropic
Anthropic · Proprietary
1506±13
2,230$5 / $251M
5
121
Google · Proprietary
1500±12
2,601$2 / $121M
6
124
Anthropic
Anthropic · Proprietary
1500±18
1,161$5 / $251M
7
126
OpenAI · Proprietary
1499±19
1,003$5 / $301.1M
8
126
OpenAI · Proprietary
1497±19
1,000$5 / $301.1M
9
150
Alibaba · Proprietary
1495±40
220$1.25 / $3.751M
10
127
Anthropic
Anthropic · Proprietary
1494±17
1,170$5 / $251M
11
143
Alibaba · Proprietary
1492±31
327$1.04 / $6.24262.1K
12
138
Xiaomi · MIT
1486±20
866$0.43 / $0.871M
13
143
Z.ai · MIT
1481±20
860$1.40 / $4.40202.8K
14
344
Baidu · Proprietary
1480±20
836N/AN/A
15
345
1479±20
886$0.43 / $0.871M
16
537
Google · Proprietary
1478±11
2,653$2 / $121M
17
445
Moonshot · Modified MIT
1478±19
887$0.95 / $4262.1K
18
540
Google · Proprietary
1476±13
2,004$0.50 / $31M
19
547
Alibaba · Proprietary
1472±16
1,272N/AN/A
20
646
1472±15
1,734$2 / $62M
21
644
Moonshot · Modified MIT
1472±12
2,259$0.60 / $3N/A
22
746
Anthropic
1470±12
2,265$5 / $25200K
23
469
Google · Apache 2.0
1470±28
398$0.14 / $0.40262.1K
24
474
Google · Apache 2.0
1468±28
369N/AN/A
25
945
Anthropic
Anthropic · Proprietary
1467±10
4,078$5 / $25200K
26
760
OpenAI · Proprietary
1466±16
1,409$5 / $301.1M
27
669
Meta
Meta · Proprietary
1463±20
795N/AN/A
28
1062
OpenAI · Proprietary
1460±11
2,889$1.75 / $14400K
29
1066
1459±14
1,704$2 / $62M
30
1067
Anthropic
Anthropic · Proprietary
1458±14
1,719$3 / $151M
31
1067
OpenAI · Proprietary
1457±14
1,985$1.75 / $14128K
32
1077
Alibaba · Proprietary
1456±18
1,112$0.33 / $1.951M
33
1067
OpenAI · Proprietary
1455±12
2,500$1.25 / $10400K
34
1071
OpenAI · Proprietary
1455±14
1,763$2.50 / $151.1M
35
1266
1455±11
3,175$0.50 / $31M
36
1366
Anthropic
1454±9
4,651$3 / $15200K
37
1077
Xiaomi · Proprietary
1453±15
1,538$1 / $31M
38
1079
xAI · Proprietary
1452±15
1,491N/AN/A
39
1378
Bytedance
Bytedance · Proprietary
1450±13
2,260N/AN/A
40
2077
OpenAI · Proprietary
1448±10
3,732$2 / $8200K
41
12100
DeepSeek · MIT
1446±19
992$0.10 / $0.201M
42
1791
Alibaba · Apache 2.0
1446±13
2,045$0.39 / $2.34262.1K
43
2485
xAI · Proprietary
1444±10
3,727N/AN/A
44
2491
Anthropic
1443±11
3,028$15 / $75200K
45
13105
Xiaomi · MIT
1442±19
914$0.14 / $0.281M
46
2685
Google · Proprietary
1442±7
7,541$1.25 / $101M
47
11111
Moonshot · Modified MIT
1442±25
515$0.40 / $1.90262.1K
48
15106
1441±19
948$0.10 / $0.201M
49
23104
Z.ai · MIT
1440±16
1,350$1 / $3.20202.8K
50
2595
Moonshot · Modified MIT
1440±10
3,688$1.15 / $8262.1K
51
23104
Meituan · Proprietary
1440±15
1,513N/AN/A
52
24105
OpenAI · Proprietary
1439±15
1,586$0.75 / $4.50400K
53
24105
Alibaba · Proprietary
1439±15
1,524$0.78 / $3.90262.1K
54
24102
Baidu · Proprietary
1439±13
2,094N/AN/A
55
24106
OpenAI · Proprietary
1437±15
1,474$0.20 / $1.25400K
56
23110
DeepSeek · MIT
1437±18
1,045$0.43 / $0.871M
57
26105
Google · Proprietary
1436±13
2,157$0.25 / $1.501M
58
26107
OpenAI · Proprietary
1434±14
1,887$1.25 / $10400K
59
34104
Anthropic
Anthropic · Proprietary
1433±9
4,723$15 / $75200K
60
29106
OpenAI · Proprietary
1433±11
2,794$1.75 / $14400K
61
35106
xAI · Proprietary
1431±9
4,143N/AN/A
62
35109
DeepSeek · MIT
1430±11
2,954$0.25 / $0.38131.1K
63
24129
Alibaba · Proprietary
1429±24
584$0.78 / $3.90262.1K
64
25125
Z.ai · MIT
1429±21
711$0.40 / $1.75202.8K
65
23132
1429±26
481$0.27 / $0.41163.8K
66
10146
1428±39
207N/AN/A
67
22133
Tencent
Tencent · tencent-hunyuan-community
1428±28
378$0.29 / $1.17262.1K
68
35115
xAI · Proprietary
1428±12
2,264$3 / $15256K
69
32120
Alibaba · Apache 2.0
1428±15
1,561$0.20 / $1.56262.1K
70
34117
OpenAI · Proprietary
1428±14
1,944$1.75 / $14128K
71
41110
Anthropic
Anthropic · Proprietary
1427±9
4,673$3 / $15200K
72
39118
DeepSeek · MIT
1426±12
2,456$0.25 / $0.38131.1K
73
26131
xAI · Proprietary
1425±20
846$1.25 / $2.501M
74
24139
xAI · Proprietary
1424±29
399$3 / $15256K
75
41121
OpenAI · Proprietary
1424±11
2,867$1.25 / $10400K
76
41125
Alibaba · Apache 2.0
1422±14
1,682$0.26 / $2.08262.1K
77
43125
Z.ai · MIT
1421±13
2,107$0.43 / $1.74202.8K
78
45124
xAI · Proprietary
1420±10
3,382$0.20 / $0.502M
79
45125
Anthropic
Anthropic · Proprietary
1420±12
2,239$15 / $75200K
80
46121
Alibaba · Apache 2.0
1420±8
5,844$0.26 / $1.06N/A
81
41132
Alibaba · Apache 2.0
1419±17
1,212$0.09 / $1.10262.1K
82
36135
DeepSeek · MIT
1418±21
775$0.27 / $0.41163.8K
83
36138
Meituan · MIT
1417±22
689$0.20 / $0.80131.1K
84
41138
Moonshot · Modified MIT
1416±21
759$0.60 / $2.50262.1K
85
47131
OpenAI · Proprietary
1416±11
2,939$1.10 / $4.40200K
86
43135
DeepSeek · MIT
1415±18
992$1.23 / $4.94N/A
87
45135
MiniMax · Modified MIT
1415±16
1,378$0.26 / $1.20204.8K
88
41141
DeepSeek · MIT
1414±22
665$1.23 / $4.94N/A
89
46135
Z.ai · MIT
1413±15
1,424$0.60 / $2.20131.1K
90
47135
OpenAI · Proprietary
1413±14
1,787$1.25 / $10128K
91
48135
1413±13
1,945$0.30 / $2.501M
92
46141
xAI · Proprietary
1412±18
1,085$0.20 / $0.502M
93
48135
DeepSeek · MIT
1411±14
1,606$0.70 / $2.50163.8K
94
24161
1410±41
200$0.27 / $0.95163.8K
95
43146
Alibaba · Apache 2.0
1410±23
704$0.20 / $0.88262.1K
96
32155
1409±33
263N/AN/A
97
51141
OpenAI · Proprietary
1409±15
1,393$75 / $150128K
98
60135
OpenAI · Proprietary
1409±11
2,986$15 / $60200K
99
45148
Baidu · Proprietary
1407±23
618N/AN/A
100
67134
Google · Proprietary
1407±7
7,775$0.30 / $2.501M
101
61141
Stepfun
StepFun · Apache 2.0
1406±13
2,146$0.09 / $0.30262.1K
102
59145
OpenAI · Proprietary
1406±15
1,460$0.25 / $2400K
103
60141
OpenAI · Proprietary
1406±13
1,909$1.10 / $4.40200K
104
43155
Alibaba · Apache 2.0
1405±28
428$0.26 / $2.60131.1K
105
70137
OpenAI · Proprietary
1404±8
5,725$5 / $15128K
106
66141
Anthropic
Anthropic · Proprietary
1403±11
2,768$15 / $75200K
107
65146
Anthropic
1403±13
2,023$3 / $151M
108
64146
Alibaba · Proprietary
1403±14
1,865N/AN/A
109
67143
Mistral · Apache 2.0
1402±11
2,737$0.50 / $1.50N/A
110
64146
Alibaba · Apache 2.0
1402±14
1,666$0.14 / $1262.1K
111
36167
Tencent
Tencent · Proprietary
1402±38
236N/AN/A
112
40167
1400±37
234N/AN/A
113
43165
Baidu · Proprietary
1400±34
268N/AN/A
114
75143
Mistral · Proprietary
1399±8
5,729$2.70 / $8.1032K
115
46162
Alibaba · Apache 2.0
1399±30
316$0.08 / $0.28131.1K
116
55158
Alibaba · Apache 2.0
1398±24
490$0.10 / $0.10262.1K
117
70148
MiniMax · Modified MIT
1398±13
2,188$0.15 / $1.15204.8K
118
68150
1398±15
1,584N/AN/A
119
64155
DeepSeek · MIT
1396±20
869$0.50 / $2.15163.8K
120
64155
1396±20
805N/AN/A
121
43170
DeepSeek · MIT
1395±39
218$0.27 / $0.95163.8K
122
78148
Anthropic
Anthropic · Proprietary
1395±9
4,744$1 / $5200K
123
75151
Alibaba · Apache 2.0
1394±12
2,390$0.46 / $1.82131.1K
124
75153
Alibaba · Apache 2.0
1393±14
1,604$0.46 / $1.82131.1K
125
70157
MiniMax · MIT
1393±18
1,010$0.29 / $0.95204.8K
126
76155
Z.ai · MIT
1391±15
1,540$0.13 / $0.85131.1K
127
46173
1390±39
194$0.10 / $0.40131.1K
128
71161
Alibaba · Apache 2.0
1389±20
829$0.10 / $0.78262.1K
129
76159
Arcee AI · Apache 2.0
1389±16
1,358$0.22 / $0.85262.1K
130
75161
xAI · Proprietary
1388±18
977$0.25 / $1.27N/A
131
80158
Moonshot · Modified MIT
1388±14
1,694$0.60 / $2.50131.1K
132
82155
Anthropic
Anthropic · Proprietary
1388±12
2,474$3 / $151M
133
93155
OpenAI · Proprietary
1386±10
4,569$15 / $60N/A
134
94158
Anthropic
1384±11
2,793$3 / $15200K
135
65173
Prime Intellect · MIT
1384±31
332$0.20 / $1.10131.1K
136
90162
OpenAI · Apache 2.0
1383±14
1,795$0.04 / $0.18131.1K
137
102158
OpenAI · Proprietary
1382±8
4,722$1.10 / $4.40200K
138
91165
Alibaba · Apache 2.0
1381±15
1,427$0.09 / $0.30262.1K
139
100162
1380±11
2,746$0.10 / $0.30262.1K
140
63178
Nvidia · Nvidia Open Model
1380±37
209$0.60 / $1.80131.1K
141
100167
Alibaba · Apache 2.0
1377±15
1,627$0.40 / $1.60262.1K
142
108167
xAI · Proprietary
1375±11
2,677$3 / $15131.1K
143
81173
Nvidia · NVIDIA Open Model
1375±25
511N/AN/A
144
90173
1374±22
633$0.10 / $0.30262.1K
145
112167
OpenAI · Proprietary
1373±10
3,227$2 / $81M
146
111169
MiniMax · Apache 2.0
1371±13
1,799$0.40 / $2.201M
147
113168
DeepSeek · MIT
1370±10
3,191$3 / $4.5032.8K
148
111171
xAI · Proprietary
1370±14
1,530$0.30 / $0.50131.1K
149
108176
Z.ai · MIT
1366±21
718$0.06 / $0.40202.8K
150
114173
1365±12
2,094$0.10 / $0.401M
151
121173
1365±11
2,875$0.10 / $0.401M
152
122173
Alibaba · Proprietary
1364±10
3,306N/AN/A
153
114173
Alibaba · Apache 2.0
1364±14
1,720$0.50 / $116.4K
154
94179
Stepfun
StepFun · Apache 2.0
1364±31
353$0.57 / $1.4265.5K
155
127173
Anthropic
Anthropic · Proprietary
1362±10
3,357$3 / $15200K
156
130173
OpenAI · Proprietary
1362±8
7,499$1.10 / $4.40N/A
157
121174
Arcee AI · Apache 2.0
1361±14
1,813$0.15 / $0.45131K
158
102185
Z.ai · MIT
1357±34
276$0.60 / $1.8065.5K
159
103185
MiniMax · Apache 2.0
1357±33
318$0.26 / $1204.8K
160
135174
Google · Proprietary
1356±9
4,067$0.10 / $0.401M
161
113184
Ant Group · MIT
1355±27
461N/AN/A
162
135176
OpenAI · Proprietary
1355±11
2,693$0.40 / $1.601M
163
126179
Nvidia · NVIDIA Open Model
1354±19
987$0.06 / $0.24262.1K
164
133178
Alibaba · Apache 2.0
1353±14
1,708$0.09 / $0.45131.1K
165
142176
Anthropic
Anthropic · Proprietary
1350±7
10,019$3 / $15200K
166
140178
Mistral · Proprietary
1349±12
2,229$0.40 / $2131.1K
167
133184
Tencent
Tencent · Proprietary
1348±20
845N/AN/A
168
127194
OpenAI · Proprietary
1345±27
494$0.05 / $0.40400K
169
154179
Anthropic
Anthropic · Proprietary
1341±7
11,359$3 / $15200K
170
133200
Ant Group · MIT
1339±27
453N/AN/A
171
143191
Mistral · Apache 2.0
1339±18
1,042$0.10 / $0.3032K
172
156182
Google · Proprietary
1338±7
7,610$3.50 / $10.502.1M
173
141198
OpenAI · Apache 2.0
1336±22
680$0.03 / $0.14131.1K
174
144198
Amazon · Proprietary
1335±20
825$0.30 / $2.501M
175
162197
1326±10
2,814$0.07 / $0.301M
176
159209
Alibaba · Proprietary
1324±19
732$0.40 / $1.20131.1K
177
165200
Google · Gemma
1322±9
3,579$0.08 / $0.16131.1K
178
168201
Meta
Meta · Llama 3.1 Community
1319±8
8,482$4 / $432.8K
179
166207
1319±11
2,839$0.63 / $1.80131.1K
180
156225
Google · Gemma
1317±27
389$0.04 / $0.13131.1K
181
170207
Meta
Meta · Llama 3.1 Community
1315±8
5,215$4 / $432.8K
182
144237
IBM · Apache 2.0
1315±40
218$0.05 / $0.10131.1K
183
165222
Stepfun
StepFun · Proprietary
1313±20
642N/AN/A
184
170213
NexusFlow · NexusFlow
1312±9
3,412N/AN/A
185
171208
Anthropic
Anthropic · Proprietary
1312±6
25,769$15 / $75200K
186
159233
Ai2 · Apache 2.0
1311±32
314$0.15 / $0.5065.5K
187
170214
DeepSeek · DeepSeek
1311±11
2,721$1.14 / $4.56N/A
188
171214
Cohere
Cohere · CC-BY-NC-4.0
1309±9
3,991$2.50 / $10256K
189
170219
1309±13
1,944$0.40 / $0.708.2K
190
172214
OpenAI · Proprietary
1308±8
6,826$2.50 / $10128K
191
166228
Ai2 · Apache 2.0
1307±23
696$0.20 / $0.6065.5K
192
173219
01.AI
01 AI · Proprietary
1306±10
3,921N/AN/A
193
177214
OpenAI · Proprietary
1305±7
15,103$5 / $15128K
194
175220
Google · Proprietary
1305±10
6,395N/AN/A
195
171224
Alibaba · Proprietary
1305±14
1,404N/AN/A
196
178219
OpenAI · Proprietary
1303±8
13,306$10 / $30128K
197
165240
Tencent
Tencent · Proprietary
1301±31
238N/AN/A
198
170236
Stepfun
StepFun · Proprietary
1300±24
564N/AN/A
199
180224
OpenAI · Proprietary
1299±8
12,374$10 / $30128K
200
172235
Z.ai · Proprietary
1298±19
721N/AN/A
201
181226
Google · Proprietary
1297±8
10,492$3.50 / $10.502.1M
202
170240
Ai2 · Apache 2.0
1297±26
473$0.15 / $0.5065.5K
203
182226
Alibaba · Qwen
1296±8
5,415$1.20 / $1.20N/A
204
182226
OpenAI · Proprietary
1296±8
13,217$10 / $30128K
205
182226
Meta
Meta · Llama-3.3
1296±8
5,779$0.10 / $0.32131.1K
206
183227
xAI · Proprietary
1294±7
8,950$2 / $10131.1K
207
172240
Tencent
Tencent · Proprietary
1293±24
497N/AN/A
208
178236
DeepSeek · DeepSeek
1293±17
1,031N/AN/A
209
182233
Alibaba · Qwen
1291±12
2,249$1.60 / $6.4032.8K
210
175240
Tencent
Tencent · Proprietary
1290±24
499N/AN/A
211
187233
Google · Proprietary
1288±9
4,789$0.07 / $0.301M
212
187233
Mistral · Mistral Research
1288±8
6,664$2 / $6131.1K
213
187235
DeepSeek · DeepSeek
1288±10
3,649N/AN/A
214
187235
Z.ai · Proprietary
1287±10
3,599$0.44 / $1.76204.8K
215
191235
Anthropic
Anthropic · Proprietary
1285±7
6,365$0.80 / $4200K
216
178247
Mistral · Proprietary
1285±26
553$2 / $540K
217
192239
OpenAI · Proprietary
1283±10
7,052$30 / $608.2K
218
192239
Mistral · MRL
1282±9
3,574$2 / $6131.1K
219
178251
Tencent
Tencent · Proprietary
1280±30
351N/AN/A
220
178251
Tencent
Tencent · Proprietary
1279±31
243N/AN/A
221
178251
IBM · Apache 2.0
1279±32
358N/AN/A
222
189244
1278±17
1,041$1.20 / $1.20131.1K
223
194242
1277±13
2,131$0.10 / $0.3032K
224
201240
OpenAI · Proprietary
1276±7
9,322$0.15 / $0.60128K
225
201240
OpenAI · Proprietary
1275±8
11,181$30 / $608.2K
226
187251
OpenAI · Proprietary
1274±23
582$0.10 / $0.401M
227
201243
Alibaba · Qianwen LICENSE
1273±9
4,835$0.90 / $0.9032.8K
228
201240
xAI · Proprietary
1272±8
7,261$2 / $10131.1K
229
200247
DeepSeek · DeepSeek License
1271±13
1,858$0.14 / $0.28128K
230
191251
1271±22
507N/AN/A
231
195251
Alibaba · Apache 2.0
1270±19
725$0.87 / $0.8732K
232
205247
Amazon · Proprietary
1269±10
2,978$0.80 / $3.20300K
233
209244
Meta
Meta · Llama 3.1 Community
1269±8
7,677$0.40 / $0.40131.1K
234
211249
Microsoft · MIT
1265±10
2,764$0.07 / $0.1416.4K
235
199254
Ai2 · Llama 3.1
1263±25
397N/AN/A
236
212251
Mistral · Apache 2.0
1261±13
1,683$0.05 / $0.0832.8K
237
214251
NexusFlow · CC-BY-NC-4.0
1261±10
2,921N/AN/A
238
212252
Google · Gemma
1260±15
1,573$0.06 / $0.1232.8K
239
221251
Meta
Meta · Llama 3 Community
1257±7
20,941$0.51 / $0.748.2K
240
221251
Google · Proprietary
1257±8
8,392$0.07 / $0.301M
241
201264
Google · Gemma
1254±28
423$0.04 / $0.08131.1K
242
225252
Anthropic
Anthropic · Proprietary
1253±8
13,766$3 / $15200K
243
222254
Nvidia · NVIDIA Open Model
1252±12
2,352N/AN/A
244
205268
Tencent
Tencent · Proprietary
1250±29
361N/AN/A
245
223263
Z.ai · Proprietary
1247±16
1,191N/AN/A
246
225262
Reka AI · Proprietary
1245±14
1,207N/AN/A
247
229255
Google · Gemma license
1245±7
10,170$0.65 / $0.658.2K
248
225263
AI21 Labs · Jamba Open
1245±15
1,147$2 / $8256K
249
228259
Amazon · Proprietary
1244±11
2,511$0.06 / $0.24300K
250
229258
Mistral · Proprietary
1244±9
7,987$4 / $1232K
251
241268
Cohere
Cohere · CC-BY-NC-4.0
1232±10
3,854N/AN/A
252
239275
Reka AI · Proprietary
1232±14
1,284N/AN/A
253
243268
Anthropic
Anthropic · Proprietary
1231±7
14,983$0.25 / $1.25200K
254
241277
Cohere
Cohere · CC-BY-NC-4.0
1230±14
1,467$2.50 / $10128K
255
244270
Google · Proprietary
1229±8
5,036$0.07 / $0.301M
256
244272
Mistral · Apache 2.0
1228±9
6,778$0.90 / $0.9065.5K
257
228289
Ai2 · Apache-2.0
1227±28
375$0.05 / $0.20128K
258
245279
Amazon · Proprietary
1224±11
2,455$0.04 / $0.14128K
259
246281
Alibaba · Qianwen LICENSE
1221±11
3,188N/AN/A
260
247281
Mistral · Proprietary
1220±11
4,406$2.70 / $8.1032K
261
250281
Google · Gemma license
1217±8
7,110$0.03 / $0.098.2K
262
249286
Microsoft · MIT
1215±11
3,238$0.17 / $0.68N/A
263
246290
Mistral · MRL
1213±20
683$0.10 / $0.10131.1K
264
244290
Alibaba · Apache 2.0
1213±24
480$0.50 / $116.4K
265
250286
01.AI
01 AI · Apache-2.0
1213±11
2,985N/AN/A
266
253286
Cohere
Cohere · CC-BY-NC-4.0
1213±8
9,769$2.50 / $10128K
267
250289
Reka AI · Proprietary
1211±14
2,028N/AN/A
268
255289
Alibaba · Qianwen LICENSE
1208±10
5,327N/AN/A
269
253290
InternLM · Other
1207±15
1,387$0 / $032.8K
270
246293
Ai2 · Llama 3.1
1207±26
363N/AN/A
271
254290
Cohere
Cohere · CC-BY-NC-4.0
1205±14
1,601$0.15 / $0.60128K
272
254290
Princeton · MIT
1205±15
1,285$0.03 / $0.098.2K
273
256290
OpenAI · Proprietary
1202±15
2,134$1 / $216.4K
274
258290
Alibaba · Qianwen LICENSE
1200±12
2,649N/AN/A
275
257291
Cohere
Cohere · CC-BY-NC-4.0
1200±15
1,307N/AN/A
276
261290
OpenAI · Proprietary
1199±8
8,626$0.50 / $1.5016.4K
277
258290
Reka AI · Proprietary
1198±11
3,363N/AN/A
278
256295
Google · Proprietary
1198±19
993$0.35 / $1.0532.8K
279
250301
IBM · Apache 2.0
1197±26
391N/AN/A
280
257296
IBM · Apache 2.0
1196±19
873N/AN/A
281
255297
HuggingFace · Apache 2.0
1196±22
589N/AN/A
282
261293
Databricks · DBRX LICENSE
1195±11
4,001$0.60 / $0.6032.8K
283
261293
Google · Proprietary
1195±14
2,274$0.35 / $1.0532.8K
284
261295
1193±14
1,568$0.13 / $0.524.1K
285
261294
Microsoft · MIT
1193±13
2,092$0.15 / $0.60N/A
286
264293
Meta
Meta · Llama 3 Community
1192±8
14,252$0.04 / $0.048.2K
287
264293
Mistral · Apache 2.0
1191±9
9,663$0.63 / $0.6332K
288
255306
IBM · Apache 2.0
1190±28
382N/AN/A
289
267293
Meta
Meta · Llama 3.1 Community
1189±8
7,135$0.02 / $0.05131.1K
290
264303
AI21 Labs · Jamba Open
1186±16
1,094$0.20 / $0.40256K
291
277303
Cohere
Cohere · CC-BY-NC-4.0
1175±9
6,682$0.15 / $0.60128K
292
276308
IBM · Apache 2.0
1168±19
908N/AN/A
293
284307
Alibaba · Qianwen LICENSE
1167±13
2,184$0.30 / $0.30N/A
294
283308
Meta
Meta · Llama 3.2
1165±16
1,136$0.05 / $0.34131.1K
295
289307
Google · Gemma license
1162±8
6,599N/AN/A
296
288308
Snowflake · Apache 2.0
1162±11
4,793N/AN/A
297
289309
Google · Gemma license
1159±11
3,039$0.03 / $0.098.2K
298
288310
Nexusflow · Apache-2.0
1158±14
1,973N/AN/A
299
288310
OpenChat · Apache-2.0
1158±14
1,726N/AN/A
300
287316
Microsoft · Llama 2 Community
1157±19
903N/AN/A
301
286318
DeepSeek · DeepSeek License
1155±23
576N/AN/A
302
277324
HuggingFace · Apache 2.0
1152±33
271N/AN/A
303
288318
NousResearch · Apache-2.0
1151±20
697$0.17 / $0.17N/A
304
291316
01.AI
01 AI · Yi License
1151±13
2,043$0.90 / $0.904.1K
305
291316
Microsoft · MIT
1150±12
2,564$0.13 / $0.52N/A
306
291322
AllenAI/UW · AI2 ImpACT Low-risk
1145±19
888N/AN/A
307
294322
Microsoft · MIT
1139±13
2,813$0.13 / $0.52N/A
308
298322
Meta
Meta · Llama 2 Community
1136±10
4,740$0.70 / $2.804.1K
309
300325
Mistral · Apache-2.0
1127±12
2,605$0.20 / $0.2032.8K
310
300326
UC Berkeley · CC-BY-NC-4.0
1126±16
1,300N/AN/A
311
297328
Alibaba · Qianwen LICENSE
1125±24
534N/AN/A
312
292332
Cognitive Computations · Apache-2.0
1124±32
219$0.50 / $0.5016.4K
313
300326
OpenChat · Apache-2.0
1124±18
945$0.20 / $0.20N/A
314
300326
Meta
Meta · Llama 3.2
1124±16
1,162$0.03 / $0.20131.1K
315
300329
Alibaba · Qianwen LICENSE
1120±20
690$0.20 / $0.20N/A
316
303328
Google · Gemma license
1117±16
1,120$0.05 / $0.088.2K
317
305328
LMSYS · Non-commercial
1115±13
2,663$0 / $02K
318
303332
Google · Proprietary
1114±19
901$0.50 / $0.5025.8K
319
300333
Nvidia · Llama 2 Community
1114±27
440N/AN/A
320
308331
Meta
Meta · Llama 2 Community
1110±13
2,218$0.25 / $0.254.1K
321
305333
Upstage AI · CC-BY-NC-4.0
1109±22
604$0.30 / $0.30N/A
322
305333
Meta
Meta · Llama 2 Community
1108±19
770$0.35 / $1.4016.4K
323
308333
Google · Gemma license
1106±16
1,355N/AN/A
324
305337
MosaicML · CC-BY-NC-SA-4.0
1095±34
242N/AN/A
325
309335
NousResearch · Apache-2.0
1093±21
628$0.90 / $0.90N/A
326
317335
Meta
Meta · Llama 2 Community
1086±14
1,656$0.15 / $0.154.1K
327
313336
Alibaba · Qianwen LICENSE
1085±18
988$0.10 / $0.10N/A
328
313337
Together AI · Apache 2.0
1084±20
676$0.20 / $0.20N/A
329
318335
LMSYS · Llama 2 Community
1082±14
2,146$0.30 / $0.30N/A
330
317336
HuggingFace · MIT
1082±17
1,250$0.15 / $0.1516.4K
331
316337
Mistral · Apache 2.0
1081±19
974$0.07 / $0.284.1K
332
310337
UW · Non-commercial
1080±32
280N/AN/A
333
320337
Google · Gemma license
1069±22
597$0.10 / $0.10N/A
334
324337
Microsoft · Llama 2 Community
1064±21
669$0.30 / $0.30N/A
335
324337
Ai2 · Apache-2.0
1054±19
848$0.20 / $0.20N/A
336
327338
LMSYS · Llama 2 Community
1047±22
658$0.20 / $0.20N/A
337
329338
Tsinghua · Apache-2.0
1041±23
576N/AN/A
338
336346
Nomic AI · Non-commercial
997±37
211N/AN/A
339
338346
Stanford · Non-commercial
990±23
652N/AN/A
340
338346
MosaicML · CC-BY-NC-SA-4.0
984±25
471N/AN/A
341
338346
RWKV · Apache 2.0
982±24
544N/AN/A
342
338346
UC Berkeley · Non-commercial
979±21
751N/AN/A
343
338347
Tsinghua · Non-commercial
976±25
525N/AN/A
344
338348
Tsinghua · Apache-2.0
971±35
227N/AN/A
345
338348
OpenAssistant · Apache 2.0
959±22
687N/AN/A
346
338348
Databricks · MIT
949±29
370N/AN/A
347
344349
LMSYS · Apache 2.0
919±26
462N/AN/A
348
343349
Meta
Meta · Non-commercial
918±33
252$0.23 / $0.23N/A
349
347349
Stability
Stability AI · CC-BY-NC-SA-4.0
890±29
353N/AN/A

Default Leaderboard Plots

Battle Count for Each Combination of Models (without Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Confidence Intervals on Model Strength (via Bootstrapping)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)