Text Arena🧮Math

View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.

Apr 14, 2026
559,789 votes
329 models
Rank Spread
1
16
Anthropic
Anthropic · Proprietary
1518±18
1,031$5 / $251M
2
111
OpenAI · Proprietary
1517±22
669$2.50 / $151.1M
3
115
Google · Proprietary
1504±16
1,360$2 / $121M
4
115
Anthropic
Anthropic · Proprietary
1502±17
1,173$5 / $251M
5
225
Moonshot · Modified MIT
1483±15
1,427$0.60 / $3N/A
6
325
Google · Proprietary
1479±12
2,681$2 / $121M
7
150
Meta
Meta · Proprietary
1478±32
289N/AN/A
8
330
Google · Proprietary
1477±13
2,038$0.50 / $31M
9
143
Alibaba · Proprietary
1476±25
544N/AN/A
10
241
1474±22
691$2 / $62M
11
533
Anthropic
1470±12
2,295$5 / $25200K
12
253
Google · Apache 2.0
1469±28
372N/AN/A
13
348
Anthropic
Anthropic · Proprietary
1468±21
767$3 / $151M
14
255
Google · Apache 2.0
1468±28
400$0.14 / $0.40262.1K
15
257
Z.ai · MIT
1468±29
385$0.95 / $3.15202.8K
16
534
Anthropic
Anthropic · Proprietary
1467±11
3,013$5 / $25200K
17
351
1466±22
684$2 / $62M
18
552
OpenAI · Proprietary
1461±18
1,047$1.75 / $14128K
19
549
OpenAI · Proprietary
1460±13
2,000$1.75 / $14400K
20
550
1459±13
2,158$0.50 / $31M
21
569
OpenAI · Proprietary
1456±22
666$2.50 / $151.1M
22
752
OpenAI · Proprietary
1454±12
2,520$1.25 / $10400K
23
851
Anthropic
1454±10
3,710$3 / $15200K
24
572
xAI · Proprietary
1453±22
687N/AN/A
25
565
Bytedance
Bytedance · Proprietary
1452±17
1,240N/AN/A
26
580
Xiaomi · Proprietary
1451±22
671$1 / $31M
27
773
Alibaba · Apache 2.0
1448±17
1,106$0.39 / $2.34262.1K
28
872
Baidu · Proprietary
1447±15
1,463N/AN/A
29
958
OpenAI · Proprietary
1447±10
3,752$2 / $8200K
30
783
Z.ai · MIT
1447±19
894$1 / $3.20202.8K
31
1261
Google · Proprietary
1444±8
6,736$1.25 / $101M
32
1072
Anthropic
1443±11
3,049$15 / $75200K
33
1073
xAI · Proprietary
1442±11
2,871N/AN/A
34
792
Moonshot · Modified MIT
1441±25
529$0.38 / $1.72262.1K
35
1085
Alibaba · Proprietary
1440±15
1,534$0.78 / $3.90262.1K
36
7102
Meituan · Proprietary
1439±26
442N/AN/A
37
1289
OpenAI · Proprietary
1437±14
1,755$1.75 / $14400K
38
1385
Moonshot · Modified MIT
1436±11
2,900$1.15 / $8262.1K
39
1098
Alibaba · Apache 2.0
1434±20
827$0.20 / $1.56262.1K
40
1490
OpenAI · Proprietary
1433±14
1,902$1.25 / $10400K
41
8109
OpenAI · Proprietary
1433±26
495$2.50 / $151.1M
42
11101
OpenAI · Proprietary
1433±19
994$1.75 / $14128K
43
1297
Google · Proprietary
1433±17
1,080$0.25 / $1.501M
44
5123
1432±39
208N/AN/A
45
2088
Anthropic
Anthropic · Proprietary
1432±9
4,766$15 / $75200K
46
10109
Alibaba · Proprietary
1431±24
588$0.78 / $3.90262.1K
47
12107
Z.ai · MIT
1430±21
711$0.39 / $1.75202.8K
48
10113
1429±26
486$0.27 / $0.41163.8K
49
2292
DeepSeek · MIT
1428±12
2,698$0.26 / $0.38163.8K
50
2297
xAI · Proprietary
1428±12
2,294$3 / $15256K
51
2496
xAI · Proprietary
1426±10
3,413N/AN/A
52
16109
Alibaba · Apache 2.0
1426±19
869$0.26 / $2.08262.1K
53
10117
OpenAI · Proprietary
1426±27
437$2.50 / $151.1M
54
24105
DeepSeek · MIT
1424±13
2,223$0.26 / $0.38163.8K
55
11120
xAI · Proprietary
1423±29
399$3 / $15256K
56
27106
OpenAI · Proprietary
1422±11
2,905$1.25 / $10400K
57
26107
Z.ai · MIT
1421±13
2,139$0.39 / $1.90204.8K
58
32106
Anthropic
Anthropic · Proprietary
1421±10
3,714$3 / $15200K
59
32103
Alibaba · Apache 2.0
1421±8
5,082$0.26 / $1.06N/A
60
24112
Alibaba · Apache 2.0
1420±17
1,227$0.09 / $1.10262.1K
61
30107
xAI · Proprietary
1420±12
2,731$0.20 / $0.502M
62
25112
MiniMax · Modified MIT
1420±17
1,169$0.12 / $0.99196.6K
63
21118
Meituan · MIT
1419±22
692$0.20 / $0.80131.1K
64
32109
Anthropic
Anthropic · Proprietary
1418±12
2,272$15 / $75200K
65
27118
DeepSeek · MIT
1415±18
999$1.23 / $4.94N/A
66
25120
Moonshot · Modified MIT
1415±21
767$0.60 / $2.50262.1K
67
32115
Z.ai · MIT
1415±15
1,431$0.60 / $2.20131.1K
68
36112
OpenAI · Proprietary
1414±11
2,961$1.10 / $4.40200K
69
25124
DeepSeek · MIT
1414±22
669$1.23 / $4.94N/A
70
33115
OpenAI · Proprietary
1414±14
1,809$1.25 / $10128K
71
26121
DeepSeek · MIT
1414±21
780$0.27 / $0.41163.8K
72
34115
1413±13
1,957$0.30 / $2.501M
73
26126
Alibaba · Apache 2.0
1412±23
707$0.20 / $0.88262.1K
74
36119
DeepSeek · MIT
1410±14
1,606$0.70 / $2.5064K
75
18137
1410±33
266N/AN/A
76
33123
xAI · Proprietary
1410±18
1,097$0.20 / $0.502M
77
12141
1410±40
201$0.21 / $0.79163.8K
78
36120
OpenAI · Proprietary
1409±15
1,393$75 / $150128K
79
41113
Google · Proprietary
1409±7
6,980$0.30 / $2.501M
80
33126
Alibaba · Proprietary
1409±19
913N/AN/A
81
39117
OpenAI · Proprietary
1408±11
2,986$15 / $60200K
82
25135
Alibaba · Apache 2.0
1408±28
435$0.26 / $2.60131.1K
83
32130
Baidu · Proprietary
1408±23
627N/AN/A
84
38126
OpenAI · Proprietary
1406±15
1,474$0.25 / $2400K
85
39123
OpenAI · Proprietary
1406±13
1,909$1.10 / $4.40200K
86
48119
OpenAI · Proprietary
1404±8
5,779$5 / $15128K
87
45126
Mistral · Apache 2.0
1402±12
2,539$0.50 / $1.50N/A
88
21148
Tencent
Tencent · Proprietary
1402±38
236N/AN/A
89
45127
Anthropic
1402±13
2,049$3 / $151M
90
48126
Anthropic
Anthropic · Proprietary
1401±11
2,797$15 / $75200K
91
23148
1401±37
235N/AN/A
92
34139
MiniMax · Modified MIT
1400±25
493$0.30 / $1.20196.6K
93
45129
1400±15
1,613N/AN/A
94
27146
Baidu · Proprietary
1399±34
270N/AN/A
95
37139
Alibaba · Apache 2.0
1399±24
493$0.15 / $1.50131.1K
96
32144
Alibaba · Apache 2.0
1399±30
316$0.08 / $0.2441K
97
41135
Alibaba · Apache 2.0
1397±19
852$0.16 / $1.30262.1K
98
58127
Mistral · Proprietary
1397±9
4,920$2.70 / $8.1032K
99
41136
DeepSeek · MIT
1397±20
876$0.50 / $2.15163.8K
100
46135
Stepfun
StepFun · Apache 2.0
1396±16
1,250$0.10 / $0.30262.1K
101
26150
DeepSeek · MIT
1396±39
219$0.21 / $0.79163.8K
102
42138
1396±20
809N/AN/A
103
58132
Alibaba · Apache 2.0
1394±12
2,416$0.46 / $1.82131.1K
104
47138
MiniMax · MIT
1394±18
1,025$0.29 / $0.95196.6K
105
54135
Alibaba · Apache 2.0
1394±14
1,617$0.46 / $1.82131.1K
106
49140
Microsoft AI · Proprietary
1392±19
896N/AN/A
107
63132
Anthropic
Anthropic · Proprietary
1392±10
3,717$1 / $5200K
108
32153
1391±39
195$0.10 / $0.40131.1K
109
51140
Alibaba · Apache 2.0
1390±19
838$0.10 / $0.78131.1K
110
58138
Z.ai · MIT
1390±15
1,556$0.13 / $0.85131.1K
111
54140
xAI · Proprietary
1389±18
994$0.30 / $0.50131.1K
112
63140
Moonshot · Modified MIT
1388±14
1,727$0.60 / $2.50131.1K
113
66138
Anthropic
Anthropic · Proprietary
1387±12
2,505$3 / $151M
114
72138
OpenAI · Proprietary
1385±10
4,569$15 / $60N/A
115
44153
Prime Intellect · MIT
1384±31
332$0.20 / $1.10131.1K
116
67140
OpenAI · Apache 2.0
1384±14
1,808$0.04 / $0.19131.1K
117
75140
Anthropic
1383±11
2,807$3 / $15200K
118
80140
OpenAI · Proprietary
1382±8
4,738$1.10 / $4.40200K
119
70145
Alibaba · Apache 2.0
1382±15
1,437$0.09 / $0.30262.1K
120
41158
Nvidia · Nvidia Open Model
1379±37
209$0.60 / $1.80131.1K
121
76148
Alibaba · Apache 2.0
1378±15
1,648$0.40 / $1.60262.1K
122
61153
Nvidia · NVIDIA Open Model
1377±25
513N/AN/A
123
66151
1377±22
648$0.09 / $0.29262.1K
124
85148
1376±13
1,923$0.09 / $0.29262.1K
125
87148
xAI · Proprietary
1376±11
2,696$3 / $15131.1K
126
89148
OpenAI · Proprietary
1374±10
3,270$2 / $81M
127
88150
MiniMax · Apache 2.0
1371±13
1,812$0.40 / $2.201M
128
91149
DeepSeek · MIT
1370±10
3,213$3 / $4.5032.8K
129
89151
xAI · Proprietary
1369±14
1,557$0.30 / $0.50131.1K
130
87154
Z.ai · MIT
1366±21
724$0.06 / $0.40202.8K
131
97153
1365±11
2,931$0.10 / $0.401M
132
96153
1365±12
2,125$0.10 / $0.401M
133
104152
Alibaba · Proprietary
1364±10
3,316N/AN/A
134
95153
Alibaba · Apache 2.0
1364±14
1,726$0.15 / $0.58131.1K
135
76162
Stepfun
StepFun · Apache 2.0
1362±31
354$0.57 / $1.4265.5K
136
112153
OpenAI · Proprietary
1362±8
7,499$1.10 / $4.40N/A
137
91158
Arcee AI · Apache 2.0
1361±19
936N/AN/A
138
111153
Anthropic
Anthropic · Proprietary
1360±10
3,381$3 / $15200K
139
79164
MiniMax · Apache 2.0
1359±33
322$0.26 / $1196.6K
140
80165
Z.ai · MIT
1358±33
279$0.60 / $1.8065.5K
141
102158
Nvidia · NVIDIA Open Model
1356±19
1,001$0.06 / $0.24262.1K
142
115153
Google · Proprietary
1356±9
4,085$0.10 / $0.401M
143
115157
OpenAI · Proprietary
1354±11
2,714$0.40 / $1.601M
144
91164
Ant Group · MIT
1354±27
465N/AN/A
145
114158
Alibaba · Apache 2.0
1353±14
1,730$0.08 / $0.2841K
146
121158
Mistral · Proprietary
1350±12
2,257$0.40 / $2131.1K
147
126158
Anthropic
Anthropic · Proprietary
1348±7
10,044$3 / $15200K
148
113164
Tencent
Tencent · Proprietary
1348±19
853N/AN/A
149
112176
Ant Group · MIT
1343±27
457N/AN/A
150
112176
OpenAI · Proprietary
1342±27
497$0.05 / $0.40400K
151
136159
Anthropic
Anthropic · Proprietary
1340±7
11,359$3 / $15200K
152
124170
Mistral · Apache 2.0
1338±17
1,059$0.10 / $0.3032K
153
136162
Google · Proprietary
1338±7
7,610$3.50 / $10.502.1M
154
122178
OpenAI · Apache 2.0
1336±22
685$0.03 / $0.14131.1K
155
127178
Amazon · Proprietary
1334±20
836$0.30 / $2.501M
156
144178
1325±10
2,814$0.07 / $0.301M
157
137188
Alibaba · Proprietary
1324±19
732$0.40 / $1.20131.1K
158
145180
Google · Gemma
1322±9
3,615$0.08 / $0.16131.1K
159
147187
1319±11
2,861$0.63 / $1.80131.1K
160
150183
Meta
Meta · Llama 3.1 Community
1318±7
8,482$4 / $432.8K
161
137205
Google · Gemma
1317±27
389$0.04 / $0.13131.1K
162
151187
Meta
Meta · Llama 3.1 Community
1314±8
5,215$4 / $432.8K
163
145200
Stepfun
StepFun · Proprietary
1312±20
642N/AN/A
164
151192
NexusFlow · NexusFlow
1312±9
3,412N/AN/A
165
138212
Ai2 · Apache 2.0
1312±31
322$0.15 / $0.5065.5K
166
151193
DeepSeek · DeepSeek
1311±11
2,721$1.14 / $4.56N/A
167
152189
Anthropic
Anthropic · Proprietary
1310±6
25,769$15 / $75200K
168
151197
1309±13
1,966$0.40 / $0.708.2K
169
152194
Cohere
Cohere · CC-BY-NC-4.0
1309±9
4,035$2.50 / $10256K
170
154194
OpenAI · Proprietary
1308±8
6,826$2.50 / $10128K
171
147207
Ai2 · Apache 2.0
1306±23
706$0.20 / $0.6065.5K
172
154198
01.AI
01 AI · Proprietary
1306±10
3,921N/AN/A
173
152201
Alibaba · Proprietary
1304±14
1,404N/AN/A
174
158194
OpenAI · Proprietary
1304±7
15,103$5 / $15128K
175
157200
Google · Proprietary
1303±9
6,395N/AN/A
176
159200
OpenAI · Proprietary
1302±8
13,306$10 / $30128K
177
145219
Tencent
Tencent · Proprietary
1301±31
238N/AN/A
178
151215
Stepfun
StepFun · Proprietary
1300±23
571N/AN/A
179
161203
OpenAI · Proprietary
1298±8
12,374$10 / $30128K
180
152215
Zhipu · Proprietary
1297±19
721N/AN/A
181
162206
Alibaba · Qwen
1296±8
5,415$1.20 / $1.20N/A
182
163206
Google · Proprietary
1296±8
10,492$3.50 / $10.502.1M
183
163206
OpenAI · Proprietary
1296±7
13,217$10 / $30128K
184
164207
Meta
Meta · Llama-3.3
1295±8
5,793$0.10 / $0.32131.1K
185
152220
Ai2 · Apache 2.0
1294±26
476$0.15 / $0.5065.5K
186
165207
xAI · Proprietary
1293±7
8,950$2 / $10131.1K
187
152220
Tencent
Tencent · Proprietary
1293±24
497N/AN/A
188
159216
DeepSeek · DeepSeek
1293±17
1,031N/AN/A
189
163213
Alibaba · Qwen
1291±12
2,249$1.60 / $6.4032.8K
190
157220
Tencent
Tencent · Proprietary
1290±24
499N/AN/A
191
169213
Google · Proprietary
1288±8
4,789$0.07 / $0.301M
192
170213
Mistral · Mistral Research
1287±8
6,664$2 / $6131.1K
193
168215
DeepSeek · DeepSeek
1287±10
3,649N/AN/A
194
168215
Zhipu AI · Proprietary
1287±10
3,599$0.44 / $1.76204.8K
195
158224
Mistral · Proprietary
1286±25
557$2 / $540K
196
174216
Anthropic
Anthropic · Proprietary
1283±7
6,389$0.80 / $4200K
197
174219
Mistral · MRL
1282±9
3,574$2 / $6131.1K
198
175219
OpenAI · Proprietary
1281±10
7,052$30 / $608.2K
199
158231
IBM · Apache 2.0
1280±31
363N/AN/A
200
159231
Tencent
Tencent · Proprietary
1280±30
352N/AN/A
201
159231
Tencent
Tencent · Proprietary
1279±31
243N/AN/A
202
170223
1278±17
1,041$1.20 / $1.20131.1K
203
175220
1277±13
2,152$0.10 / $0.3032K
204
182220
OpenAI · Proprietary
1275±7
9,325$0.15 / $0.60128K
205
168230
OpenAI · Proprietary
1274±23
582$0.10 / $0.401M
206
182220
OpenAI · Proprietary
1273±8
11,181$30 / $608.2K
207
182223
Alibaba · Qianwen LICENSE
1272±9
4,835$0.90 / $0.9032.8K
208
183220
xAI · Proprietary
1272±7
7,261$2 / $10131.1K
209
182227
DeepSeek · DeepSeek License
1270±13
1,858$0.14 / $0.28128K
210
173231
1270±22
507N/AN/A
211
176230
Alibaba · Apache 2.0
1269±19
725$0.87 / $0.8732K
212
190223
Meta
Meta · Llama 3.1 Community
1269±7
7,677$0.40 / $0.40131.1K
213
186227
Amazon · Proprietary
1269±10
2,978$0.80 / $3.20300K
214
192229
Microsoft · MIT
1264±10
2,764$0.07 / $0.1416.4K
215
179234
Ai2 · Llama 3.1
1263±25
397N/AN/A
216
192231
Mistral · Apache 2.0
1261±13
1,683$0.05 / $0.0832.8K
217
195231
NexusFlow · CC-BY-NC-4.0
1260±10
2,921N/AN/A
218
192231
Google · Gemma
1260±15
1,587$0.06 / $0.1232.8K
219
202231
Meta
Meta · Llama 3 Community
1256±7
20,941$0.51 / $0.748.2K
220
202231
Google · Proprietary
1256±8
8,392$0.07 / $0.301M
221
182243
Google · Gemma
1253±28
423$0.04 / $0.08131.1K
222
206232
Anthropic
Anthropic · Proprietary
1252±8
13,766$3 / $15200K
223
202234
Nvidia · NVIDIA Open Model
1251±12
2,352N/AN/A
224
186248
Tencent
Tencent · Proprietary
1250±29
361N/AN/A
225
205242
Zhipu AI · Proprietary
1246±15
1,191N/AN/A
226
206242
Reka AI · Proprietary
1245±14
1,207N/AN/A
227
208239
Amazon · Proprietary
1244±11
2,511$0.06 / $0.24300K
228
206243
AI21 Labs · Jamba Open
1244±15
1,147$2 / $8256K
229
211235
Google · Gemma license
1243±7
10,170$0.65 / $0.658.2K
230
209239
Mistral · Proprietary
1243±9
7,987$4 / $1232K
231
221248
Cohere
Cohere · CC-BY-NC-4.0
1231±10
3,854N/AN/A
232
220254
Reka AI · Proprietary
1230±14
1,284N/AN/A
233
224248
Anthropic
Anthropic · Proprietary
1229±7
14,983$0.25 / $1.25200K
234
221256
Cohere
Cohere · CC-BY-NC-4.0
1229±14
1,467$2.50 / $10128K
235
224249
Google · Proprietary
1228±8
5,036$0.07 / $0.301M
236
224251
Mistral · Apache 2.0
1227±9
6,778$0.90 / $0.9065.5K
237
208267
Ai2 · Apache-2.0
1227±28
375$0.05 / $0.20128K
238
224259
Amazon · Proprietary
1223±11
2,455$0.04 / $0.14128K
239
226260
Alibaba · Qianwen LICENSE
1220±11
3,188N/AN/A
240
228260
Mistral · Proprietary
1219±11
4,406$2.70 / $8.1032K
241
230260
Google · Gemma license
1216±8
7,110$0.03 / $0.098.2K
242
230266
Microsoft · MIT
1214±10
3,238$0.17 / $0.68N/A
243
223270
Alibaba · Apache 2.0
1213±24
480$0.15 / $0.58131.1K
244
226270
Mistral · MRL
1213±20
683$0.10 / $0.10131.1K
245
230266
01.AI
01 AI · Apache-2.0
1212±11
2,985N/AN/A
246
234266
Cohere
Cohere · CC-BY-NC-4.0
1211±8
9,769$2.50 / $10128K
247
230270
Reka AI · Proprietary
1210±14
2,028N/AN/A
248
236270
Alibaba · Qianwen LICENSE
1207±10
5,327N/AN/A
249
226273
Ai2 · Llama 3.1
1206±26
363N/AN/A
250
233270
InternLM · Other
1206±15
1,387$0 / $032.8K
251
235270
Cohere
Cohere · CC-BY-NC-4.0
1204±14
1,601$0.15 / $0.60128K
252
234270
Princeton · MIT
1203±15
1,285$0.03 / $0.098.2K
253
236270
OpenAI · Proprietary
1200±15
2,134$1 / $216.4K
254
238270
Alibaba · Qianwen LICENSE
1199±12
2,649N/AN/A
255
237271
Cohere
Cohere · CC-BY-NC-4.0
1198±15
1,307N/AN/A
256
241270
OpenAI · Proprietary
1197±8
8,626$0.50 / $1.5016.4K
257
241270
Reka AI · Proprietary
1197±11
3,363N/AN/A
258
230280
IBM · Apache 2.0
1196±26
391N/AN/A
259
237276
Google · Proprietary
1196±19
993$0.35 / $1.0532.8K
260
237276
IBM · Apache 2.0
1195±19
873N/AN/A
261
235277
HuggingFace · Apache 2.0
1194±22
589N/AN/A
262
241273
Databricks · DBRX LICENSE
1194±11
4,001$0.60 / $0.6032.8K
263
241275
Google · Proprietary
1192±14
2,274$0.35 / $1.0532.8K
264
241275
1192±14
1,568$0.13 / $0.524.1K
265
241274
Microsoft · MIT
1192±13
2,092$0.15 / $0.60N/A
266
245272
Meta
Meta · Llama 3 Community
1191±8
14,252$0.03 / $0.048.2K
267
245273
Meta
Meta · Llama 3.1 Community
1189±8
7,135$0.02 / $0.0516.4K
268
245273
Mistral · Apache 2.0
1189±8
9,663$0.63 / $0.6332K
269
235286
IBM · Apache 2.0
1189±28
382N/AN/A
270
244281
AI21 Labs · Jamba Open
1185±16
1,094$0.20 / $0.40256K
271
258283
Cohere
Cohere · CC-BY-NC-4.0
1173±9
6,682$0.15 / $0.60128K
272
256288
IBM · Apache 2.0
1166±19
908N/AN/A
273
263287
Alibaba · Qianwen LICENSE
1165±13
2,184$0.30 / $0.30N/A
274
262288
Meta
Meta · Llama 3.2
1164±16
1,136$0.05 / $0.3480K
275
270287
Google · Gemma license
1161±8
6,599N/AN/A
276
268288
Snowflake · Apache 2.0
1160±11
4,793N/AN/A
277
270289
Google · Gemma license
1157±11
3,039$0.03 / $0.098.2K
278
268290
Nexusflow · Apache-2.0
1157±14
1,973N/AN/A
279
267296
Microsoft · Llama 2 Community
1156±19
903N/AN/A
280
268290
OpenChat · Apache-2.0
1156±14
1,726N/AN/A
281
265298
DeepSeek · DeepSeek License
1154±23
576N/AN/A
282
257304
HuggingFace · Apache 2.0
1150±33
271N/AN/A
283
271296
01.AI
01 AI · Yi License
1150±13
2,043$0.90 / $0.904.1K
284
269298
NousResearch · Apache-2.0
1149±20
697$0.17 / $0.17N/A
285
271296
Microsoft · MIT
1149±12
2,564$0.13 / $0.52N/A
286
271302
AllenAI/UW · AI2 ImpACT Low-risk
1144±19
888N/AN/A
287
274302
Microsoft · MIT
1137±13
2,813$0.13 / $0.52N/A
288
278302
Meta
Meta · Llama 2 Community
1135±10
4,740$0.70 / $2.804.1K
289
280305
Mistral · Apache-2.0
1126±12
2,605$0.20 / $0.2032.8K
290
280306
UC Berkeley · CC-BY-NC-4.0
1125±16
1,300N/AN/A
291
272312
Cognitive Computations · Apache-2.0
1123±32
219$0.50 / $0.5016.4K
292
277308
Alibaba · Qianwen LICENSE
1123±24
534N/AN/A
293
280306
Meta
Meta · Llama 3.2
1123±16
1,162$0.03 / $0.2060K
294
280306
OpenChat · Apache-2.0
1123±18
945$0.20 / $0.20N/A
295
280308
Alibaba · Qianwen LICENSE
1119±20
690$0.20 / $0.20N/A
296
283308
Google · Gemma license
1115±16
1,120$0.05 / $0.088.2K
297
285307
LMSYS · Non-commercial
1114±13
2,663$0 / $02K
298
280313
Nvidia · Llama 2 Community
1113±27
440N/AN/A
299
283312
Google · Proprietary
1112±19
901$0.50 / $0.5025.8K
300
288311
Meta
Meta · Llama 2 Community
1108±13
2,218$0.25 / $0.254.1K
301
285313
Upstage AI · CC-BY-NC-4.0
1107±22
604$0.30 / $0.30N/A
302
285313
Meta
Meta · Llama 2 Community
1107±19
770$0.35 / $1.4016.4K
303
288313
Google · Gemma license
1104±16
1,355N/AN/A
304
285317
MosaicML · CC-BY-NC-SA-4.0
1093±34
242N/AN/A
305
289315
NousResearch · Apache-2.0
1092±21
628$0.90 / $0.90N/A
306
297315
Meta
Meta · Llama 2 Community
1084±14
1,656$0.15 / $0.154.1K
307
294316
Alibaba · Qianwen LICENSE
1083±18
988$0.10 / $0.10N/A
308
293317
Together AI · Apache 2.0
1083±20
676$0.20 / $0.20N/A
309
297316
HuggingFace · MIT
1081±17
1,250$0.15 / $0.1516.4K
310
298315
LMSYS · Llama 2 Community
1081±14
2,146$0.30 / $0.30N/A
311
297317
Mistral · Apache 2.0
1080±19
974$0.07 / $0.284.1K
312
290317
UW · Non-commercial
1078±32
280N/AN/A
313
300317
Google · Gemma license
1067±22
597$0.10 / $0.10N/A
314
304317
Microsoft · Llama 2 Community
1063±21
669$0.30 / $0.30N/A
315
304317
Ai2 · Apache-2.0
1053±19
848$0.20 / $0.20N/A
316
307318
LMSYS · Llama 2 Community
1045±21
658$0.20 / $0.20N/A
317
309318
Tsinghua · Apache-2.0
1040±23
576N/AN/A
318
316326
Nomic AI · Non-commercial
996±37
211N/AN/A
319
318326
Stanford · Non-commercial
988±23
652N/AN/A
320
318326
MosaicML · CC-BY-NC-SA-4.0
982±25
471N/AN/A
321
318326
RWKV · Apache 2.0
980±24
544N/AN/A
322
318326
UC Berkeley · Non-commercial
978±21
751N/AN/A
323
318326
Tsinghua · Non-commercial
975±25
525N/AN/A
324
318328
Tsinghua · Apache-2.0
969±35
227N/AN/A
325
318328
OpenAssistant · Apache 2.0
957±22
687N/AN/A
326
318328
Databricks · MIT
947±29
370N/AN/A
327
324329
LMSYS · Apache 2.0
917±26
462N/AN/A
328
324329
Meta
Meta · Non-commercial
915±33
252$0.23 / $0.23N/A
329
327329
Stability
Stability AI · CC-BY-NC-SA-4.0
888±29
353N/AN/A

Default Leaderboard Plots

Confidence Intervals on Model Strength (via Bootstrapping)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Battle Count for Each Combination of Models (without Ties)