Text Arena⚖️Legal & Government

View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.

May 17, 2026
381,765 votes
332 models
Rank Spread
1
121
Anthropic
Anthropic · Proprietary
1510±14
2,046$5 / $251M
2
132
Meta
Meta · Proprietary
1509±21
798N/AN/A
3
121
Anthropic
Anthropic · Proprietary
1509±13
2,106$5 / $251M
4
133
Anthropic
Anthropic · Proprietary
1507±20
941$5 / $251M
5
130
Google · Proprietary
1502±11
2,907$2 / $121M
6
131
Google · Proprietary
1501±12
2,531$2 / $121M
7
158
OpenAI · Proprietary
1491±23
755$5 / $301.1M
8
154
Anthropic
Anthropic · Proprietary
1489±18
1,044$5 / $251M
9
161
Baidu · Proprietary
1489±23
678N/AN/A
10
147
Google · Proprietary
1487±13
2,314$0.50 / $31M
11
161
DeepSeek · MIT
1486±22
740$0.43 / $0.871M
12
161
OpenAI · Proprietary
1485±22
722$5 / $301.1M
13
150
Anthropic
1485±12
2,671$5 / $25200K
14
155
1485±15
1,619$2 / $62M
15
158
Anthropic
Anthropic · Proprietary
1484±16
1,534$3 / $151M
16
176
Alibaba · Proprietary
1484±35
291N/AN/A
17
162
1483±21
774$0.43 / $0.871M
18
161
OpenAI · Proprietary
1482±15
1,596$2.50 / $151.1M
19
353
xAI · Proprietary
1481±10
4,234N/AN/A
20
354
1481±10
3,424$0.50 / $31M
21
361
OpenAI · Proprietary
1480±14
2,032$1.75 / $14128K
22
170
Google · Proprietary
1480±27
462$1.50 / $91M
23
455
Anthropic
Anthropic · Proprietary
1480±10
4,219$5 / $25200K
24
361
xAI · Proprietary
1478±15
1,657N/AN/A
25
168
Xiaomi · MIT
1476±22
729$1 / $31M
26
761
xAI · Proprietary
1475±9
4,542N/AN/A
27
566
1474±15
1,585$2 / $62M
28
763
Bytedance
Bytedance · Proprietary
1474±14
2,118N/AN/A
29
666
OpenAI · Proprietary
1473±15
1,634$2.50 / $151.1M
30
369
Z.ai · MIT
1472±20
870$1.40 / $4.40202.8K
31
374
1472±21
742$0.11 / $0.221M
32
378
OpenAI · Proprietary
1469±23
701$75 / $150128K
33
762
Google · Proprietary
1468±7
8,326$1.25 / $101M
34
768
OpenAI · Proprietary
1468±14
2,004$1.75 / $14128K
35
392
Google · Apache 2.0
1465±27
426$0.14 / $0.40262.1K
36
775
Z.ai · MIT
1464±15
1,565$1 / $3.20202.8K
37
973
Anthropic
1461±11
3,199$15 / $75200K
38
789
Moonshot · Modified MIT
1461±21
731$0.95 / $4262.1K
39
1270
Anthropic
Anthropic · Proprietary
1461±9
4,862$3 / $15200K
40
1108
1461±35
263$0.27 / $0.95163.8K
41
780
Alibaba · Proprietary
1460±16
1,309N/AN/A
42
1674
Anthropic
1459±9
4,815$3 / $15200K
43
1674
OpenAI · Proprietary
1459±8
5,275$5 / $15128K
44
980
OpenAI · Proprietary
1458±14
1,954$1.25 / $10128K
45
1675
Anthropic
Anthropic · Proprietary
1458±9
5,276$15 / $75200K
46
886
Xiaomi · Proprietary
1458±16
1,298$1 / $31M
47
1478
OpenAI · Proprietary
1458±11
2,951$1.25 / $10400K
48
793
Baidu · Proprietary
1458±21
763N/AN/A
49
7107
Google · Apache 2.0
1457±29
384N/AN/A
50
3109
Alibaba · Proprietary
1457±34
289$1.04 / $6.24262.1K
51
989
OpenAI · Proprietary
1457±16
1,416$0.75 / $4.50400K
52
797
OpenAI · Proprietary
1457±21
842$5 / $301.1M
53
893
Z.ai · MIT
1455±19
958$0.40 / $1.75202.8K
54
1489
OpenAI · Proprietary
1455±14
1,871$1.25 / $10400K
55
7102
DeepSeek · MIT
1453±22
706$0.11 / $0.221M
56
8102
Moonshot · Modified MIT
1453±21
781$0.60 / $2.50262.1K
57
2383
OpenAI · Proprietary
1453±10
3,815$2 / $8200K
58
7117
Baidu · Proprietary
1450±30
323N/AN/A
59
7116
Tencent
Tencent · tencent-hunyuan-community
1449±29
391$0.29 / $1.17262.1K
60
2693
OpenAI · Proprietary
1449±11
3,057$1.25 / $10400K
61
2693
OpenAI · Proprietary
1448±11
2,936$1.75 / $14400K
62
2693
OpenAI · Proprietary
1448±11
3,061$1.75 / $14400K
63
14108
xAI · Proprietary
1447±23
670$1.25 / $2.501M
64
7132
1446±36
247N/AN/A
65
10117
Moonshot · Modified MIT
1445±25
518$0.40 / $1.90262.1K
66
28103
Baidu · Proprietary
1443±13
2,227N/AN/A
67
30107
Moonshot · Modified MIT
1440±12
2,247$0.60 / $3N/A
68
34107
Anthropic
Anthropic · Proprietary
1440±11
2,920$15 / $75200K
69
31108
Alibaba · Apache 2.0
1439±14
1,866$0.39 / $2.34262.1K
70
39107
OpenAI · Proprietary
1439±11
3,276$2 / $81M
71
33108
Google · Proprietary
1438±13
2,070$0.25 / $1.501M
72
25125
Alibaba · Apache 2.0
1438±22
753$0.20 / $0.88262.1K
73
28121
Alibaba · Proprietary
1437±20
844$0.33 / $1.951M
74
42107
Moonshot · Modified MIT
1436±10
3,987$1.15 / $8262.1K
75
40109
Z.ai · MIT
1436±12
2,397$0.43 / $1.74202.8K
76
40113
Anthropic
Anthropic · Proprietary
1434±12
2,468$15 / $75200K
77
46108
Alibaba · Apache 2.0
1434±8
6,135$0.26 / $1.06N/A
78
42121
Alibaba · Proprietary
1431±14
1,735$0.78 / $3.90262.1K
79
46117
DeepSeek · MIT
1431±11
2,838$0.25 / $0.38131.1K
80
49119
DeepSeek · MIT
1429±11
3,236$0.25 / $0.38131.1K
81
46125
1428±13
2,161$0.30 / $2.501M
82
45127
Alibaba · Apache 2.0
1428±14
1,731$0.26 / $2.08262.1K
83
56115
Google · Proprietary
1428±7
8,208$0.30 / $2.501M
84
44135
xAI · Proprietary
1426±17
1,189$0.20 / $0.502M
85
55125
xAI · Proprietary
1426±10
3,749$0.20 / $0.502M
86
37145
1425±25
523$0.27 / $0.41163.8K
87
33147
xAI · Proprietary
1424±28
436$3 / $15256K
88
49133
Moonshot · Modified MIT
1424±14
1,762$0.60 / $2.50131.1K
89
44142
DeepSeek · MIT
1422±21
781$1.23 / $4.94N/A
90
56132
xAI · Proprietary
1422±12
2,684$3 / $15256K
91
55135
xAI · Proprietary
1422±14
1,885$3 / $15131.1K
92
56132
Mistral · Apache 2.0
1421±11
2,996$0.50 / $1.50N/A
93
49140
DeepSeek · MIT
1421±18
1,012$1.23 / $4.94N/A
94
50140
Meituan · Proprietary
1421±17
1,226N/AN/A
95
64132
Mistral · Proprietary
1418±8
6,051$2.70 / $8.1032K
96
45150
Alibaba · Proprietary
1418±24
570$0.78 / $3.90262.1K
97
33163
DeepSeek · MIT
1417±35
271$0.27 / $0.95163.8K
98
56145
MiniMax · Modified MIT
1416±17
1,209$0.28 / $1.20204.8K
99
55148
DeepSeek · MIT
1415±21
784$0.27 / $0.41163.8K
100
59145
Z.ai · MIT
1414±16
1,485$0.60 / $2.20131.1K
101
59146
DeepSeek · MIT
1414±16
1,390$0.50 / $2.15163.8K
102
59146
Alibaba · Apache 2.0
1413±16
1,427$0.09 / $1.10262.1K
103
69145
Anthropic
Anthropic · Proprietary
1412±12
2,684$3 / $151M
104
72145
DeepSeek · MIT
1410±11
2,940$3 / $4.5032.8K
105
56157
Xiaomi · MIT
1410±24
675$0.40 / $21M
106
71146
Mistral · Proprietary
1409±13
2,139$0.40 / $2131.1K
107
78145
Anthropic
Anthropic · Proprietary
1409±9
4,963$1 / $5200K
108
72148
Anthropic
1408±13
2,280$3 / $151M
109
45178
Tencent
Tencent · Proprietary
1407±34
273N/AN/A
110
71154
OpenAI · Proprietary
1407±16
1,454$15 / $60200K
111
71155
OpenAI · Proprietary
1407±16
1,384$0.20 / $1.25400K
112
73154
Alibaba · Apache 2.0
1406±14
1,698$0.20 / $1.56262.1K
113
74154
Alibaba · Apache 2.0
1406±14
1,804$0.14 / $1262.1K
114
77154
MiniMax · Modified MIT
1405±13
2,165$0.15 / $1.15204.8K
115
80153
Alibaba · Apache 2.0
1405±12
2,573$0.46 / $1.82131.1K
116
59170
Alibaba · Apache 2.0
1404±25
513$0.26 / $2.60131.1K
117
71161
1403±20
856$0.10 / $0.30262.1K
118
44186
Z.ai · MIT
1402±41
203$0.30 / $0.90131.1K
119
55182
1401±36
264N/AN/A
120
80158
Alibaba · Apache 2.0
1401±15
1,592$0.40 / $1.60262.1K
121
84155
Anthropic
1400±12
2,515$3 / $15200K
122
84155
OpenAI · Proprietary
1400±12
2,779$1.10 / $4.40200K
123
78162
MiniMax · MIT
1400±18
1,186$0.29 / $0.95204.8K
124
80162
Microsoft AI · Proprietary
1400±17
1,191N/AN/A
125
84155
OpenAI · Proprietary
1399±12
2,478$0.40 / $1.601M
126
80165
DeepSeek · MIT
1399±18
1,097$0.70 / $2.50163.8K
127
88155
1399±11
3,152$0.10 / $0.401M
128
89160
Anthropic
Anthropic · Proprietary
1397±12
2,502$3 / $15200K
129
58186
1396±35
256N/AN/A
130
84169
Arcee AI · Apache 2.0
1396±15
1,711$0.15 / $0.45131K
131
89169
OpenAI · Proprietary
1394±15
1,710$0.25 / $2400K
132
83180
Meituan · MIT
1393±21
779$0.20 / $0.80131.1K
133
91170
Stepfun
StepFun · Apache 2.0
1392±13
2,035$0.10 / $0.30262.1K
134
91173
OpenAI · Proprietary
1392±14
1,945$15 / $60N/A
135
93169
1391±11
2,820$0.10 / $0.30262.1K
136
83184
Alibaba · Apache 2.0
1390±24
540$0.15 / $1.50262.1K
137
93178
Alibaba · Proprietary
1388±15
1,683N/AN/A
138
94178
Alibaba · Proprietary
1387±14
1,868N/AN/A
139
94180
DeepSeek · DeepSeek
1385±16
1,312$1.14 / $4.56N/A
140
108176
Anthropic
Anthropic · Proprietary
1384±9
5,534$3 / $15200K
141
91186
Tencent
Tencent · Proprietary
1383±21
741N/AN/A
142
100182
Alibaba · Apache 2.0
1382±15
1,492$0.09 / $0.30262.1K
143
107180
Google · Gemma
1382±12
2,719$0.08 / $0.16131.1K
144
84206
Z.ai · Proprietary
1381±30
367N/AN/A
145
106181
1381±13
2,096$0.10 / $0.401M
146
107180
Google · Proprietary
1381±12
2,590$0.10 / $0.401M
147
94186
Z.ai · MIT
1381±20
826$0.06 / $0.40202.8K
148
77219
Google · Gemma
1381±39
217$0.04 / $0.13131.1K
149
104184
1380±15
1,435$0.07 / $0.301M
150
91207
Alibaba · Proprietary
1379±28
392$0.40 / $1.20131.1K
151
103186
xAI · Proprietary
1379±17
1,171$0.25 / $1.27N/A
152
112184
Google · Proprietary
1377±14
2,962N/AN/A
153
106186
Arcee AI · Apache 2.0
1377±17
1,227$0.22 / $0.85262.1K
154
117182
Cohere
Cohere · CC-BY-NC-4.0
1376±10
3,479$2.50 / $10256K
155
121184
OpenAI · Proprietary
1373±10
6,743$5 / $15128K
156
117187
Z.ai · MIT
1373±14
1,935$0.13 / $0.85131.1K
157
120191
Meta
Meta · Llama 3.1 Community
1370±13
2,330$4 / $432.8K
158
123187
Meta
Meta · Llama 3.1 Community
1370±12
3,383$4 / $432.8K
159
91227
Z.ai · MIT
1370±34
277$0.60 / $1.8065.5K
160
128194
Google · Proprietary
1368±11
3,342$3.50 / $10.502.1M
161
119207
Mistral · Apache 2.0
1368±17
1,171$0.10 / $0.3032K
162
107221
Nvidia · NVIDIA Open Model
1367±26
511N/AN/A
163
125203
1366±14
1,839N/AN/A
164
125206
01.AI
01 AI · Proprietary
1366±15
1,697N/AN/A
165
130202
MiniMax · Apache 2.0
1365±12
2,409$0.40 / $2.201M
166
131197
xAI · Proprietary
1365±11
3,813$2 / $10131.1K
167
130207
Alibaba · Apache 2.0
1364±14
1,737$0.46 / $1.82131.1K
168
125223
Alibaba · Apache 2.0
1361±20
869$0.10 / $0.78262.1K
169
130220
OpenAI · Proprietary
1360±18
1,023$1.10 / $4.40200K
170
135221
xAI · Proprietary
1357±15
1,507$0.30 / $0.50131.1K
171
119229
MiniMax · Apache 2.0
1356±28
429$0.26 / $1204.8K
172
131227
Ai2 · Apache 2.0
1356±20
907$0.20 / $0.6065.5K
173
143217
Anthropic
Anthropic · Proprietary
1355±11
4,818$3 / $15200K
174
124235
OpenAI · Proprietary
1353±29
382$0.10 / $0.401M
175
147219
Anthropic
Anthropic · Proprietary
1352±10
4,272$0.80 / $4200K
176
143227
Z.ai · Proprietary
1350±15
1,723$0.44 / $1.76204.8K
177
153223
Anthropic
Anthropic · Proprietary
1350±9
11,095$15 / $75200K
178
147226
OpenAI · Proprietary
1350±12
2,674$2.50 / $10128K
179
118248
1349±37
233$0.10 / $0.40131.1K
180
131237
Ant Group · MIT
1349±27
437N/AN/A
181
155226
OpenAI · Proprietary
1348±11
3,866$0.15 / $0.60128K
182
125246
Stepfun
StepFun · Proprietary
1348±33
278N/AN/A
183
156228
OpenAI · Apache 2.0
1344±14
1,926$0.04 / $0.18131.1K
184
157228
OpenAI · Proprietary
1343±12
3,138$1.10 / $4.40N/A
185
158228
Meta
Meta · Llama-3.3
1343±11
3,175$0.10 / $0.32131.1K
186
156228
1343±14
1,995$0.40 / $0.708.2K
187
124252
1342±39
208N/AN/A
188
158228
1342±12
2,630$0.63 / $1.80131.1K
189
162228
OpenAI · Proprietary
1341±10
3,633$1.10 / $4.40200K
190
132250
Tencent
Tencent · Proprietary
1340±34
288N/AN/A
191
139248
Stepfun
StepFun · Apache 2.0
1340±28
416$0.57 / $1.4265.5K
192
160230
Mistral · Mistral Research
1340±12
2,735$2 / $6131.1K
193
158235
Alibaba · Apache 2.0
1339±15
1,639$0.50 / $116.4K
194
165230
OpenAI · Proprietary
1339±11
5,381$10 / $30128K
195
132253
Google · Gemma
1338±36
245$0.04 / $0.08131.1K
196
135253
Tencent
Tencent · Proprietary
1336±36
252N/AN/A
197
165237
Google · Proprietary
1333±12
4,795$3.50 / $10.502.1M
198
160246
NexusFlow · CC-BY-NC-4.0
1332±19
1,008N/AN/A
199
140258
1332±36
230N/AN/A
200
165244
DeepSeek · DeepSeek
1332±15
1,518N/AN/A
201
165241
Alibaba · Qwen
1331±13
2,386$1.20 / $1.20N/A
202
160249
1331±21
814N/AN/A
203
158250
Stepfun
StepFun · Proprietary
1331±23
658N/AN/A
204
156252
OpenAI · Proprietary
1330±27
473$0.05 / $0.40400K
205
155253
Reka AI · Proprietary
1330±29
371N/AN/A
206
165249
Amazon · Proprietary
1330±20
891$0.30 / $2.501M
207
171242
Meta
Meta · Llama 3.1 Community
1329±12
3,177$0.40 / $0.40131.1K
208
171242
xAI · Proprietary
1329±12
3,012$2 / $10131.1K
209
155253
Prime Intellect · MIT
1329±30
398$0.20 / $1.10131.1K
210
173243
OpenAI · Proprietary
1328±12
5,037$10 / $30128K
211
169246
Google · Proprietary
1328±13
2,160$0.07 / $0.301M
212
159252
Princeton · MIT
1328±25
543$0.03 / $0.098.2K
213
166248
NexusFlow · NexusFlow
1327±16
1,443N/AN/A
214
165249
Alibaba · Qwen
1327±18
1,085$1.60 / $6.4032.8K
215
157255
DeepSeek · DeepSeek
1327±30
375N/AN/A
216
157253
AI21 Labs · Jamba Open
1327±27
448$2 / $8256K
217
168248
Mistral · MRL
1327±15
1,593$2 / $6131.1K
218
158253
1327±27
489$1.20 / $1.20131.1K
219
166249
Amazon · Proprietary
1326±16
1,296$0.80 / $3.20300K
220
155259
Alibaba · Apache 2.0
1326±33
285$0.08 / $0.28131.1K
221
165253
Alibaba · Proprietary
1325±24
588N/AN/A
222
173249
Google · Gemma
1324±16
1,408$0.06 / $0.1232.8K
223
165253
OpenAI · Apache 2.0
1324±23
695$0.03 / $0.14131.1K
224
175249
Alibaba · Apache 2.0
1323±14
1,770$0.09 / $0.45131.1K
225
165256
Cohere
Cohere · CC-BY-NC-4.0
1321±23
592$2.50 / $10128K
226
178253
Nvidia · NVIDIA Open Model
1317±18
1,177N/AN/A
227
187251
OpenAI · Proprietary
1316±11
5,572$10 / $30128K
228
187253
Cohere
Cohere · CC-BY-NC-4.0
1314±12
4,403$2.50 / $10128K
229
173262
Z.ai · Proprietary
1314±25
556N/AN/A
230
189253
Google · Gemma license
1312±10
4,458$0.65 / $0.658.2K
231
165268
Alibaba · Apache 2.0
1310±34
273$0.87 / $0.8732K
232
191258
1308±13
2,160$0.10 / $0.3032K
233
187260
Nvidia · NVIDIA Open Model
1307±17
1,202$0.06 / $0.24262.1K
234
191258
Google · Proprietary
1307±12
3,784$0.07 / $0.301M
235
184266
Mistral · Proprietary
1306±22
853$2 / $540K
236
192259
Anthropic
Anthropic · Proprietary
1306±12
6,062$3 / $15200K
237
187267
Cohere
Cohere · CC-BY-NC-4.0
1303±24
569$0.15 / $0.60128K
238
199259
Meta
Meta · Llama 3 Community
1302±11
8,328$0.51 / $0.748.2K
239
196265
Cohere
Cohere · CC-BY-NC-4.0
1300±15
1,742N/AN/A
240
195266
Microsoft · MIT
1300±17
1,341$0.07 / $0.1416.4K
241
185270
Ai2 · Apache 2.0
1299±29
430$0.15 / $0.5065.5K
242
203266
OpenAI · Proprietary
1298±14
2,897$30 / $608.2K
243
191268
Ai2 · Apache 2.0
1297±24
635$0.15 / $0.5065.5K
244
209267
Google · Proprietary
1295±13
2,128$0.07 / $0.301M
245
189273
Tencent
Tencent · Proprietary
1292±31
397N/AN/A
246
191273
Ant Group · MIT
1291±29
445N/AN/A
247
187277
IBM · Apache 2.0
1290±35
323N/AN/A
248
199273
AI21 Labs · Jamba Open
1287±25
531$0.20 / $0.40256K
249
226268
Anthropic
Anthropic · Proprietary
1287±11
6,799$0.25 / $1.25200K
250
194277
Reka AI · Proprietary
1287±30
414N/AN/A
251
227268
Google · Gemma license
1286±12
3,243$0.03 / $0.098.2K
252
215271
Amazon · Proprietary
1284±18
1,079$0.06 / $0.24300K
253
212273
Reka AI · Proprietary
1283±21
876N/AN/A
254
228271
Cohere
Cohere · CC-BY-NC-4.0
1283±14
3,073$0.15 / $0.60128K
255
226277
DeepSeek · DeepSeek License
1278±20
888$0.14 / $0.28128K
256
228277
Mistral · Apache 2.0
1276±21
869$0.05 / $0.0832.8K
257
236273
OpenAI · Proprietary
1276±12
4,695$30 / $608.2K
258
235273
Alibaba · Qianwen LICENSE
1276±14
2,262$0.90 / $0.9032.8K
259
196285
Ai2 · Apache-2.0
1274±41
202$0.05 / $0.20128K
260
237277
Mistral · Proprietary
1271±14
3,306$4 / $1232K
261
211284
Mistral · MRL
1271±34
324$0.10 / $0.10131.1K
262
236277
Alibaba · Qianwen LICENSE
1271±17
1,506N/AN/A
263
235281
Google · Proprietary
1268±21
970$0.35 / $1.0532.8K
264
236281
Amazon · Proprietary
1267±19
1,082$0.04 / $0.14128K
265
234282
Cohere
Cohere · CC-BY-NC-4.0
1266±25
551N/AN/A
266
240281
Reka AI · Proprietary
1264±18
1,350N/AN/A
267
242281
Alibaba · Qianwen LICENSE
1262±15
2,091N/AN/A
268
246282
Mistral · Apache 2.0
1258±14
2,798$0.90 / $0.9065.5K
269
246283
Mistral · Proprietary
1255±16
1,811$2.70 / $8.1032K
270
249282
OpenAI · Proprietary
1253±13
3,573$0.50 / $1.5016.4K
271
247284
01.AI
01 AI · Apache-2.0
1253±17
1,510N/AN/A
272
231305
IBM · Apache 2.0
1252±42
215N/AN/A
273
255284
Meta
Meta · Llama 3 Community
1247±12
5,578$0.04 / $0.048.2K
274
255285
Meta
Meta · Llama 3.1 Community
1247±12
2,882$0.02 / $0.05131.1K
275
255307
OpenChat · Apache-2.0
1232±28
467$0.20 / $0.20N/A
276
261303
Alibaba · Qianwen LICENSE
1230±18
1,227N/AN/A
277
268301
Mistral · Apache 2.0
1227±13
3,953$0.63 / $0.6332K
278
249312
HuggingFace · Apache 2.0
1226±38
240N/AN/A
279
255311
Google · Proprietary
1226±34
309$0.35 / $1.0532.8K
280
265307
01.AI
01 AI · Yi License
1222±22
857$0.90 / $0.904.1K
281
272305
Google · Gemma license
1222±13
2,655N/AN/A
282
261311
Microsoft · Llama 2 Community
1221±29
434N/AN/A
283
261311
AllenAI/UW · AI2 ImpACT Low-risk
1220±31
377N/AN/A
284
274308
Microsoft · MIT
1214±17
1,474$0.17 / $0.68N/A
285
261318
NousResearch · Apache-2.0
1214±36
275$0.17 / $0.17N/A
286
274309
Google · Gemma license
1212±18
1,416$0.03 / $0.098.2K
287
274309
Databricks · DBRX LICENSE
1211±17
1,748$0.60 / $0.6032.8K
288
274311
Nexusflow · Apache-2.0
1209±21
878N/AN/A
289
274312
OpenAI · Proprietary
1208±23
944$1 / $216.4K
290
274313
OpenChat · Apache-2.0
1208±25
637N/AN/A
291
274312
Alibaba · Qianwen LICENSE
1207±20
1,037$0.30 / $0.30N/A
292
274317
UC Berkeley · CC-BY-NC-4.0
1205±27
504N/AN/A
293
269320
DeepSeek · DeepSeek License
1204±35
300N/AN/A
294
274313
Snowflake · Apache 2.0
1200±18
1,535N/AN/A
295
274320
Microsoft · Llama 2 Community
1196±32
365$0.30 / $0.30N/A
296
274318
Microsoft · MIT
1196±18
1,187$0.15 / $0.60N/A
297
274321
IBM · Apache 2.0
1195±34
428N/AN/A
298
274320
InternLM · Other
1194±24
685$0 / $032.8K
299
276320
LMSYS · Non-commercial
1192±19
1,246$0 / $02K
300
278318
Meta
Meta · Llama 2 Community
1192±16
2,073$0.70 / $2.804.1K
301
274320
Meta
Meta · Llama 3.2
1190±28
506$0.05 / $0.34131.1K
302
275320
HuggingFace · MIT
1188±26
621$0.15 / $0.1516.4K
303
278320
Mistral · Apache-2.0
1186±21
976$0.20 / $0.2032.8K
304
274324
Upstage AI · CC-BY-NC-4.0
1184±40
219$0.30 / $0.30N/A
305
275323
IBM · Apache 2.0
1182±32
435N/AN/A
306
274327
Alibaba · Qianwen LICENSE
1180±41
230$0.20 / $0.20N/A
307
276327
Alibaba · Qianwen LICENSE
1175±38
274N/AN/A
308
281322
LMSYS · Llama 2 Community
1173±22
899$0.30 / $0.30N/A
309
283328
Meta
Meta · Llama 2 Community
1160±32
383$0.35 / $1.4016.4K
310
283328
Google · Proprietary
1160±32
428$0.50 / $0.5025.8K
311
287328
Google · Gemma license
1159±29
462$0.05 / $0.088.2K
312
293327
Meta
Meta · Llama 2 Community
1157±21
1,004$0.25 / $0.254.1K
313
296327
Microsoft · MIT
1156±19
1,126$0.13 / $0.52N/A
314
280329
Alibaba · Apache 2.0
1155±42
214$0.50 / $116.4K
315
292328
1155±25
706$0.13 / $0.524.1K
316
292329
Mistral · Apache 2.0
1152±30
487$0.07 / $0.284.1K
317
292329
Meta
Meta · Llama 3.2
1151±30
489$0.03 / $0.20131.1K
318
290329
LMSYS · Llama 2 Community
1149±35
305$0.20 / $0.20N/A
319
296329
Alibaba · Qianwen LICENSE
1142±32
390$0.10 / $0.10N/A
320
292330
Google · Gemma license
1141±38
273$0.10 / $0.10N/A
321
303329
Meta
Meta · Llama 2 Community
1138±23
760$0.15 / $0.154.1K
322
306330
Google · Gemma license
1121±28
578N/AN/A
323
307330
Microsoft · MIT
1119±23
1,033$0.13 / $0.52N/A
324
305330
Together AI · Apache 2.0
1115±36
309$0.20 / $0.20N/A
325
304331
RWKV · Apache 2.0
1113±48
190N/AN/A
326
307331
Ai2 · Apache-2.0
1106±35
301$0.20 / $0.20N/A
327
307331
Stanford · Non-commercial
1096±43
230N/AN/A
328
311331
Tsinghua · Apache-2.0
1094±41
268N/AN/A
329
315332
UC Berkeley · Non-commercial
1083±40
248N/AN/A
330
321332
OpenAssistant · Apache 2.0
1068±41
266N/AN/A
331
325332
LMSYS · Apache 2.0
1033±46
184N/AN/A
332
329332
Tsinghua · Non-commercial
996±48
195N/AN/A

Default Leaderboard Plots

Battle Count for Each Combination of Models (without Ties)

Confidence Intervals on Model Strength (via Bootstrapping)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles