Text Arena🧠Hard Prompts (English)

View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.

Apr 10, 2026
1,196,586 votes
338 models
Rank Spread
1
12
Anthropic
Anthropic · Proprietary
1546±10
4,086$5 / $251M
2
13
Anthropic
Anthropic · Proprietary
1537±9
4,626$5 / $251M
3
222
Meta
Meta · Proprietary
1515±20
862N/AN/A
4
313
Google · Proprietary
1513±8
5,459$2 / $121M
5
320
OpenAI · Proprietary
1509±12
2,514$2.50 / $151.1M
6
319
Anthropic
1506±7
9,428$5 / $25200K
7
322
Google · Proprietary
1502±7
10,758$2 / $121M
8
326
Anthropic
Anthropic · Proprietary
1501±11
2,891$3 / $151M
9
326
OpenAI · Proprietary
1500±9
4,264$1.75 / $14128K
10
333
Z.ai · MIT
1499±15
1,484$0.95 / $3.15202.8K
11
425
Anthropic
Anthropic · Proprietary
1498±6
12,104$5 / $25200K
12
333
xAI · Proprietary
1497±12
2,446N/AN/A
13
426
Anthropic
1496±5
15,680$3 / $15200K
14
333
1496±11
2,656$2 / $62M
15
436
1493±11
2,707$2 / $62M
16
337
Xiaomi · Proprietary
1493±13
2,068$1 / $31M
17
637
Google · Proprietary
1489±7
7,894$0.50 / $31M
18
836
Anthropic
1489±6
12,606$15 / $75200K
19
438
OpenAI · Proprietary
1489±11
2,706$2.50 / $151.1M
20
837
Anthropic
Anthropic · Proprietary
1487±6
15,460$3 / $15200K
21
442
Alibaba · Proprietary
1487±13
2,122N/AN/A
22
443
Google · Apache 2.0
1486±15
1,522$0.14 / $0.40262.1K
23
839
Bytedance
Bytedance · Proprietary
1485±9
4,971N/AN/A
24
546
Meituan · Proprietary
1484±14
1,599N/AN/A
25
645
OpenAI · Proprietary
1483±13
1,940$2.50 / $151.1M
26
1239
xAI · Proprietary
1483±6
12,012N/AN/A
27
1238
Anthropic
Anthropic · Proprietary
1483±5
19,454$15 / $75200K
28
943
Z.ai · MIT
1482±10
3,679$1 / $3.20202.8K
29
1242
xAI · Proprietary
1479±6
13,171N/AN/A
30
1242
OpenAI · Proprietary
1479±7
10,376$1.25 / $10400K
31
1245
Moonshot · Modified MIT
1478±9
4,397$0.60 / $3N/A
32
1248
OpenAI · Proprietary
1478±10
3,754$1.75 / $14128K
33
1544
1477±7
8,841$0.50 / $31M
34
1554
Z.ai · MIT
1473±11
3,102$0.39 / $1.75202.8K
35
1753
Alibaba · Apache 2.0
1473±9
4,194$0.39 / $2.34262.1K
36
1566
Google · Apache 2.0
1469±15
1,505N/AN/A
37
2254
Baidu · Proprietary
1469±8
5,746N/AN/A
38
2454
OpenAI · Proprietary
1468±7
7,881$1.75 / $14400K
39
1277
1467±19
891N/AN/A
40
2068
Moonshot · Modified MIT
1466±13
2,159$0.38 / $1.72262.1K
41
2469
1464±12
2,395$0.27 / $0.41163.8K
42
2963
OpenAI · Proprietary
1463±7
7,256$1.75 / $14400K
43
3264
Anthropic
Anthropic · Proprietary
1462±7
8,448$15 / $75200K
44
3365
DeepSeek · MIT
1462±7
8,780$0.26 / $0.38163.8K
45
3464
Moonshot · Modified MIT
1462±6
11,793$1.15 / $8262.1K
46
2775
Baidu · Proprietary
1461±12
2,532N/AN/A
47
3568
Google · Proprietary
1458±4
26,518$1.25 / $101M
48
3077
Alibaba · Proprietary
1458±12
2,547$0.78 / $3.90262.1K
49
3471
OpenAI · Proprietary
1458±6
11,343$1.25 / $10400K
50
3474
Alibaba · Proprietary
1457±7
6,872$0.78 / $3.90262.1K
51
2492
1456±19
944N/AN/A
52
3873
OpenAI · Proprietary
1455±5
20,061$5 / $15128K
53
3482
DeepSeek · MIT
1454±11
3,226$0.27 / $0.41163.8K
54
3877
DeepSeek · MIT
1454±7
10,344$0.26 / $0.38163.8K
55
3880
OpenAI · Proprietary
1453±7
7,750$1.25 / $10400K
56
3488
Alibaba · Apache 2.0
1453±12
2,981$0.20 / $0.88262.1K
57
3877
Anthropic
Anthropic · Proprietary
1452±5
15,765$1 / $5200K
58
3977
Alibaba · Apache 2.0
1451±5
20,480$0.26 / $1.06N/A
59
4185
Z.ai · MIT
1449±7
9,655$0.39 / $1.90204.8K
60
33106
1448±20
848$0.21 / $0.79163.8K
61
4291
OpenAI · Proprietary
1447±7
7,804$1.25 / $10128K
62
4391
Anthropic
1447±7
8,169$3 / $151M
63
4791
OpenAI · Proprietary
1446±6
14,046$2 / $8200K
64
4691
xAI · Proprietary
1445±6
11,061$0.20 / $0.502M
65
38105
xAI · Proprietary
1445±15
1,615$3 / $15256K
66
4394
Google · Proprietary
1445±9
4,235$0.25 / $1.501M
67
3899
DeepSeek · MIT
1445±12
2,475$1.23 / $4.94N/A
68
38102
OpenAI · Proprietary
1444±12
2,260$75 / $150128K
69
38106
MiniMax · Proprietary
1444±14
1,770$0.30 / $1.20196.6K
70
4596
MiniMax · Modified MIT
1444±9
4,403$0.12 / $0.99196.6K
71
38115
1442±19
903N/AN/A
72
5096
Anthropic
Anthropic · Proprietary
1441±7
9,977$15 / $75200K
73
47105
Alibaba · Apache 2.0
1441±10
3,274$0.26 / $2.08262.1K
74
46106
Meituan · MIT
1440±11
2,869$0.20 / $0.80131.1K
75
55102
Mistral · Apache 2.0
1439±7
10,066$0.50 / $1.50N/A
76
48106
Moonshot · Modified MIT
1439±11
2,852$0.60 / $2.50262.1K
77
50106
DeepSeek · MIT
1438±10
3,518$0.50 / $2.15163.8K
78
5799
Mistral · Proprietary
1438±5
20,012$2.70 / $8.1032K
79
57105
OpenAI · Proprietary
1437±6
11,989$2 / $81M
80
55109
DeepSeek · MIT
1436±10
3,382$1.23 / $4.94N/A
81
55109
Alibaba · Apache 2.0
1435±10
3,113$0.20 / $1.56262.1K
82
58108
xAI · Proprietary
1434±8
6,372$3 / $15131.1K
83
58107
1434±7
7,649$0.09 / $0.29262.1K
84
59109
Moonshot · Modified MIT
1433±8
6,412$0.60 / $2.50131.1K
85
56116
DeepSeek · MIT
1432±11
2,656$0.70 / $2.5064K
86
59111
Z.ai · MIT
1432±8
5,737$0.60 / $2.20131.1K
87
57122
OpenAI · Proprietary
1429±14
1,807$2.50 / $151.1M
88
64116
Alibaba · Apache 2.0
1429±8
6,101$0.09 / $1.10262.1K
89
63118
Alibaba · Proprietary
1429±10
3,221N/AN/A
90
59123
Alibaba · Apache 2.0
1428±13
2,021$0.15 / $1.50131.1K
91
67116
Anthropic
1427±7
7,808$3 / $15200K
92
56130
DeepSeek · MIT
1427±18
1,054$0.21 / $0.79163.8K
93
65117
Alibaba · Apache 2.0
1427±8
6,089$0.40 / $1.60262.1K
94
64119
MiniMax · MIT
1426±9
4,214$0.29 / $0.95196.6K
95
58126
Baidu · Proprietary
1426±15
1,376N/AN/A
96
49137
Tencent
Tencent · Proprietary
1426±23
583N/AN/A
97
71117
xAI · Proprietary
1425±6
10,450$3 / $15256K
98
64124
1425±11
2,781$0.09 / $0.29262.1K
99
69122
xAI · Proprietary
1424±8
5,249$0.20 / $0.502M
100
71119
Anthropic
Anthropic · Proprietary
1424±7
9,321$3 / $151M
101
69123
Stepfun
StepFun · Apache 2.0
1424±9
4,552$0.10 / $0.30262.1K
102
74120
1423±7
8,845$0.30 / $2.501M
103
67125
Alibaba · Apache 2.0
1423±10
3,204$0.16 / $1.30262.1K
104
71124
OpenAI · Proprietary
1422±9
4,238$15 / $60200K
105
69126
Arcee AI · Apache 2.0
1422±10
3,471N/AN/A
106
67131
Alibaba · Apache 2.0
1421±13
2,119$0.26 / $2.60131.1K
107
79124
Alibaba · Apache 2.0
1420±7
8,696$0.46 / $1.82131.1K
108
85124
Google · Proprietary
1419±4
26,068$0.30 / $2.501M
109
85126
DeepSeek · MIT
1417±6
10,301$3 / $4.5032.8K
110
85128
OpenAI · Proprietary
1417±6
10,547$1.10 / $4.40200K
111
81132
Microsoft AI · Proprietary
1416±9
3,950N/AN/A
112
84131
Alibaba · Apache 2.0
1416±8
5,659$0.09 / $0.30262.1K
113
84139
OpenAI · Proprietary
1413±11
2,944$1.10 / $4.40200K
114
89135
OpenAI · Proprietary
1413±7
8,790$0.40 / $1.601M
115
86136
OpenAI · Proprietary
1412±9
4,917$15 / $60N/A
116
91136
Mistral · Proprietary
1411±7
7,358$0.40 / $2131.1K
117
94141
OpenAI · Proprietary
1409±8
6,523$0.25 / $2400K
118
104141
1405±8
6,563N/AN/A
119
97143
Z.ai · MIT
1405±11
3,086$0.06 / $0.40202.8K
120
80155
Z.ai · MIT
1404±22
717$0.30 / $0.90131.1K
121
107141
Anthropic
Anthropic · Proprietary
1404±7
8,880$3 / $15200K
122
109141
Anthropic
Anthropic · Proprietary
1403±5
15,843$3 / $15200K
123
108142
Z.ai · MIT
1403±7
7,715$0.13 / $0.85131.1K
124
107143
Alibaba · Apache 2.0
1403±8
5,774$0.46 / $1.82131.1K
125
95153
Stepfun
StepFun · Apache 2.0
1401±15
1,568$0.57 / $1.4265.5K
126
99155
Prime Intellect · MIT
1399±15
1,436$0.20 / $1.10131.1K
127
92158
Tencent
Tencent · Proprietary
1399±19
937N/AN/A
128
116145
1397±6
12,598$0.10 / $0.401M
129
112153
Mistral · Apache 2.0
1397±9
4,101$0.10 / $0.3032K
130
85168
Nvidia · Nvidia Open Model
1396±27
448$0.60 / $1.80131.1K
131
112153
Alibaba · Apache 2.0
1396±10
3,526$0.10 / $0.78131.1K
132
103158
Z.ai · MIT
1396±16
1,241$0.60 / $1.8065.5K
133
112153
1396±11
3,004N/AN/A
134
117153
MiniMax · Apache 2.0
1395±7
8,220$0.40 / $2.201M
135
111157
Nvidia · NVIDIA Open Model
1394±13
1,999N/AN/A
136
113159
Tencent
Tencent · Proprietary
1392±13
2,224N/AN/A
137
115159
Ant Group · MIT
1389±13
1,850N/AN/A
138
117160
MiniMax · Apache 2.0
1388±13
1,923$0.26 / $1196.6K
139
122158
Alibaba · Proprietary
1387±8
6,032N/AN/A
140
95180
1387±29
344N/AN/A
141
125160
1383±7
7,587$0.10 / $0.401M
142
125159
OpenAI · Proprietary
1383±6
11,318$1.10 / $4.40200K
143
124161
xAI · Proprietary
1383±9
4,079$0.30 / $0.50131.1K
144
108185
Tencent
Tencent · Proprietary
1382±28
367N/AN/A
145
125161
xAI · Proprietary
1382±8
5,331$0.30 / $0.50131.1K
146
116176
Alibaba · Apache 2.0
1382±21
729$0.08 / $0.2441K
147
124169
Ant Group · MIT
1380±13
1,884N/AN/A
148
125168
Amazon · Proprietary
1379±11
3,206$0.30 / $2.501M
149
132167
Cohere
Cohere · CC-BY-NC-4.0
1376±6
12,799$2.50 / $10256K
150
132168
OpenAI · Proprietary
1375±7
8,369$1.10 / $4.40N/A
151
130172
Ai2 · Apache 2.0
1374±11
3,045$0.20 / $0.6065.5K
152
133172
Alibaba · Apache 2.0
1372±8
5,092$0.15 / $0.58131.1K
153
125187
Inception AI · Proprietary
1371±20
862$0.25 / $0.75128K
154
125188
1370±21
718$0.10 / $0.40131.1K
155
125188
1370±20
786N/AN/A
156
139172
Google · Gemma
1369±6
9,974$0.08 / $0.16131.1K
157
121193
Tencent
Tencent · Proprietary
1369±28
343N/AN/A
158
130188
Alibaba · Proprietary
1367±17
964$0.40 / $1.20131.1K
159
141177
OpenAI · Apache 2.0
1367±7
7,690$0.04 / $0.19131.1K
160
143177
Google · Proprietary
1366±7
8,572$0.10 / $0.401M
161
136187
OpenAI · Proprietary
1366±13
1,984$0.05 / $0.40400K
162
143179
Anthropic
Anthropic · Proprietary
1365±7
14,269$3 / $15200K
163
144186
Alibaba · Apache 2.0
1362±8
5,847$0.08 / $0.2841K
164
143187
DeepSeek · DeepSeek
1362±10
3,637$1.14 / $4.56N/A
165
143187
01.AI
01 AI · Proprietary
1360±10
3,939N/AN/A
166
147187
Meta
Meta · Llama 3.1 Community
1360±7
6,791$4 / $432.8K
167
143193
Ai2 · Apache 2.0
1357±15
1,513$0.15 / $0.5065.5K
168
151188
Anthropic
Anthropic · Proprietary
1355±6
13,467$0.80 / $4200K
169
152188
Google · Proprietary
1353±7
9,056$3.50 / $10.502.1M
170
151190
1353±9
4,113$0.07 / $0.301M
171
148193
Mistral · Proprietary
1353±11
2,794$2 / $540K
172
151192
Nvidia · NVIDIA Open Model
1352±10
3,903$0.06 / $0.24262.1K
173
148200
Stepfun
StepFun · Proprietary
1352±13
1,902N/AN/A
174
154190
Meta
Meta · Llama 3.1 Community
1352±7
9,964$4 / $432.8K
175
156190
1351±7
8,749$0.63 / $1.80131.1K
176
156193
1350±8
6,690$0.40 / $0.708.2K
177
157193
OpenAI · Proprietary
1348±6
19,579$5 / $15128K
178
125224
Ai2 · Apache 2.0
1348±41
217$0.20 / $0.2036.9K
179
148210
Stepfun
StepFun · Proprietary
1347±19
786N/AN/A
180
156200
NexusFlow · NexusFlow
1346±9
4,039N/AN/A
181
154207
Ai2 · Apache 2.0
1346±13
2,066$0.15 / $0.5065.5K
182
143219
Inception AI · Proprietary
1344±26
528$0.25 / $0.75128K
183
156208
Alibaba · Proprietary
1344±13
1,713N/AN/A
184
151219
Tencent
Tencent · Proprietary
1340±23
601N/AN/A
185
156215
OpenAI · Proprietary
1339±17
1,062$0.10 / $0.401M
186
163212
OpenAI · Apache 2.0
1338±12
2,454$0.03 / $0.14131.1K
187
168209
OpenAI · Proprietary
1338±8
8,010$2.50 / $10128K
188
158217
DeepSeek · DeepSeek
1337±16
1,132N/AN/A
189
155219
Google · Gemma
1337±21
700$0.04 / $0.13131.1K
190
171208
Meta
Meta · Llama-3.3
1337±6
10,119$0.10 / $0.32131.1K
191
172210
1335±7
7,673$0.10 / $0.3032K
192
177212
Mistral · Mistral Research
1333±8
7,818$2 / $6131.1K
193
179213
xAI · Proprietary
1330±7
10,486$2 / $10131.1K
194
179213
Google · Proprietary
1330±7
13,552$3.50 / $10.502.1M
195
177218
DeepSeek · DeepSeek
1330±9
4,055N/AN/A
196
177219
Alibaba · Qwen
1329±11
2,531$1.60 / $6.4032.8K
197
179215
OpenAI · Proprietary
1329±7
17,922$10 / $30128K
198
179219
Google · Proprietary
1327±9
8,405N/AN/A
199
179218
Alibaba · Qwen
1327±8
6,208$1.20 / $1.20N/A
200
179219
Zhipu AI · Proprietary
1327±10
4,016$0.44 / $1.76204.8K
201
171221
1327±17
1,136$1.20 / $1.20131.1K
202
177221
IBM · Apache 2.0
1326±15
1,609N/AN/A
203
182218
Anthropic
Anthropic · Proprietary
1325±6
34,293$15 / $75200K
204
179220
NexusFlow · CC-BY-NC-4.0
1324±10
3,605N/AN/A
205
177225
Zhipu · Proprietary
1324±18
955N/AN/A
206
183219
OpenAI · Proprietary
1323±7
18,316$10 / $30128K
207
180220
Mistral · MRL
1323±9
4,452$2 / $6131.1K
208
168228
Tencent
Tencent · Proprietary
1323±24
587N/AN/A
209
177225
Tencent
Tencent · Proprietary
1322±17
1,207N/AN/A
210
185220
OpenAI · Proprietary
1321±6
11,380$0.15 / $0.60128K
211
177228
Alibaba · Apache 2.0
1319±20
759$0.87 / $0.8732K
212
187221
OpenAI · Proprietary
1318±7
16,792$10 / $30128K
213
185222
Google · Gemma
1318±9
4,566$0.02 / $0.0432.8K
214
189223
xAI · Proprietary
1315±7
8,664$2 / $10131.1K
215
189226
Amazon · Proprietary
1314±9
4,069$0.80 / $3.20300K
216
191223
Meta
Meta · Llama 3.1 Community
1314±7
9,133$0.40 / $0.40131.1K
217
192226
Google · Proprietary
1313±8
5,484$0.07 / $0.301M
218
191226
OpenAI · Proprietary
1312±9
9,838$30 / $608.2K
219
202227
Meta
Meta · Llama 3 Community
1309±7
29,608$0.51 / $0.748.2K
220
211236
OpenAI · Proprietary
1299±8
16,244$30 / $608.2K
221
208239
DeepSeek · DeepSeek License
1297±12
2,699$0.14 / $0.28128K
222
195248
Ai2 · Apache-2.0
1295±24
538$0.05 / $0.20128K
223
212241
Mistral · Apache 2.0
1294±12
2,355$0.05 / $0.0832.8K
224
209242
AI21 Labs · Jamba Open
1294±14
1,528$2 / $8256K
225
218239
Google · Proprietary
1292±7
10,974$0.07 / $0.301M
226
205250
Ai2 · Llama 3.1
1290±23
499N/AN/A
227
220242
Microsoft · MIT
1289±9
3,804$0.07 / $0.1416.4K
228
220242
Google · Gemma license
1287±6
12,606$0.65 / $0.658.2K
229
220243
Anthropic
Anthropic · Proprietary
1284±7
19,193$3 / $15200K
230
220248
Zhipu AI · Proprietary
1284±14
1,775N/AN/A
231
217252
Google · Gemma
1283±21
764$0.04 / $0.08131.1K
232
220245
Alibaba · Qianwen LICENSE
1283±9
6,483$0.90 / $0.9032.8K
233
220250
Reka AI · Proprietary
1281±14
1,341N/AN/A
234
221250
Nvidia · NVIDIA Open Model
1279±11
3,257N/AN/A
235
220251
Princeton · MIT
1278±14
1,661$0.03 / $0.098.2K
236
221250
Amazon · Proprietary
1277±10
3,136$0.06 / $0.24300K
237
214264
Tencent
Tencent · Proprietary
1277±28
370N/AN/A
238
220257
1276±22
578N/AN/A
239
223250
Cohere
Cohere · CC-BY-NC-4.0
1275±9
4,272N/AN/A
240
227252
Mistral · Proprietary
1271±8
11,296$4 / $1232K
241
223257
Reka AI · Proprietary
1269±14
1,389N/AN/A
242
224257
Cohere
Cohere · CC-BY-NC-4.0
1269±13
1,735$2.50 / $10128K
243
228254
Anthropic
Anthropic · Proprietary
1268±7
20,673$0.25 / $1.25200K
244
221269
Mistral · MRL
1266±20
703$0.10 / $0.10131.1K
245
229257
Google · Proprietary
1266±8
5,581$0.07 / $0.301M
246
229259
Alibaba · Qianwen LICENSE
1263±10
4,572N/AN/A
247
228266
Cohere
Cohere · CC-BY-NC-4.0
1261±13
1,822$0.15 / $0.60128K
248
231257
Google · Gemma license
1261±7
9,094$0.03 / $0.098.2K
249
237266
Cohere
Cohere · CC-BY-NC-4.0
1256±8
13,980$2.50 / $10128K
250
236270
Amazon · Proprietary
1255±10
3,050$0.04 / $0.14128K
251
231272
InternLM · Other
1255±15
1,456$0 / $032.8K
252
239270
Mistral · Apache 2.0
1253±8
9,466$0.90 / $0.9065.5K
253
240272
Alibaba · Qianwen LICENSE
1250±9
7,597N/AN/A
254
229275
IBM · Apache 2.0
1248±24
517N/AN/A
255
239274
AI21 Labs · Jamba Open
1247±14
1,545$0.20 / $0.40256K
256
246273
Meta
Meta · Llama 3 Community
1243±7
19,891$0.03 / $0.048.2K
257
245275
Reka AI · Proprietary
1241±13
2,928N/AN/A
258
246274
Mistral · Proprietary
1241±10
6,398$2.70 / $8.1032K
259
246274
01.AI
01 AI · Apache-2.0
1240±10
3,931N/AN/A
260
249274
Meta
Meta · Llama 3.1 Community
1240±8
8,195$0.02 / $0.0516.4K
261
240279
Google · Proprietary
1239±19
1,189$0.35 / $1.0532.8K
262
246275
OpenAI · Proprietary
1238±14
3,150$1 / $216.4K
263
247275
Reka AI · Proprietary
1238±11
4,796N/AN/A
264
250274
OpenAI · Proprietary
1238±8
12,416$0.50 / $1.5016.4K
265
240285
IBM · Apache 2.0
1236±24
511N/AN/A
266
249275
Databricks · DBRX LICENSE
1235±11
5,516$0.60 / $0.6032.8K
267
249278
Google · Proprietary
1234±13
3,298$0.35 / $1.0532.8K
268
252277
Alibaba · Qianwen LICENSE
1234±11
3,830N/AN/A
269
245283
IBM · Apache 2.0
1233±20
900N/AN/A
270
252282
Cohere
Cohere · CC-BY-NC-4.0
1230±14
1,620N/AN/A
271
247290
HuggingFace · Apache 2.0
1228±21
820N/AN/A
272
254281
Microsoft · MIT
1227±10
4,087$0.17 / $0.68N/A
273
255279
Mistral · Apache 2.0
1227±8
13,360$0.63 / $0.6332K
274
246293
Ai2 · Llama 3.1
1225±26
452N/AN/A
275
260284
Cohere
Cohere · CC-BY-NC-4.0
1221±9
9,264$0.15 / $0.60128K
276
265293
Microsoft · MIT
1211±12
3,070$0.15 / $0.60N/A
277
265296
Meta
Meta · Llama 3.2
1207±15
1,377$0.05 / $0.3480K
278
267295
Alibaba · Qianwen LICENSE
1207±13
3,024$0.30 / $0.30N/A
279
270295
Google · Gemma license
1206±10
4,180$0.03 / $0.098.2K
280
269296
Nexusflow · Apache-2.0
1206±13
2,864N/AN/A
281
266299
AllenAI/UW · AI2 ImpACT Low-risk
1204±18
1,200N/AN/A
282
274296
Snowflake · Apache 2.0
1200±11
6,488N/AN/A
283
272299
1200±13
2,028$0.13 / $0.524.1K
284
274301
01.AI
01 AI · Yi License
1196±12
2,844$0.90 / $0.904.1K
285
274304
OpenChat · Apache-2.0
1195±13
2,468N/AN/A
286
275299
Google · Gemma license
1194±8
7,586N/AN/A
287
273305
NousResearch · Apache-2.0
1193±19
917$0.17 / $0.17N/A
288
274307
IBM · Apache 2.0
1191±19
974N/AN/A
289
274309
DeepSeek · DeepSeek License
1188±20
946N/AN/A
290
271314
MosaicML · CC-BY-NC-SA-4.0
1185±30
359N/AN/A
291
275307
Microsoft · Llama 2 Community
1185±16
1,475N/AN/A
292
277307
UC Berkeley · CC-BY-NC-4.0
1184±14
1,894N/AN/A
293
269319
Meta
Meta · Llama 2 Community
1181±37
207$0.70 / $2.8016.4K
294
279312
OpenChat · Apache-2.0
1177±17
1,389$0.20 / $0.20N/A
295
275315
Alibaba · Apache 2.0
1177±25
519$0.15 / $0.58131.1K
296
282310
Microsoft · MIT
1176±12
3,447$0.13 / $0.52N/A
297
286310
Meta
Meta · Llama 2 Community
1174±9
6,989$0.70 / $2.804.1K
298
285312
Mistral · Apache-2.0
1173±11
3,709$0.20 / $0.2032.8K
299
285312
LMSYS · Non-commercial
1173±11
3,964$0 / $02K
300
286315
Google · Gemma license
1167±15
1,588$0.05 / $0.088.2K
301
282319
Upstage AI · CC-BY-NC-4.0
1167±22
765$0.30 / $0.30N/A
302
286315
Google · Proprietary
1166±17
1,366$0.50 / $0.5025.8K
303
277320
Cognitive Computations · Apache-2.0
1165±31
286$0.50 / $0.5016.4K
304
282323
HuggingFace · Apache 2.0
1157±32
334N/AN/A
305
288320
Alibaba · Qianwen LICENSE
1156±18
1,019$0.20 / $0.20N/A
306
292319
Meta
Meta · Llama 2 Community
1155±12
3,347$0.25 / $0.254.1K
307
287321
Nvidia · Llama 2 Community
1154±23
645N/AN/A
308
291320
Meta
Meta · Llama 3.2
1153±15
1,437$0.03 / $0.2060K
309
288321
Alibaba · Qianwen LICENSE
1151±21
830N/AN/A
310
294320
Microsoft · MIT
1151±13
4,149$0.13 / $0.52N/A
311
294320
Google · Gemma license
1148±14
1,941N/AN/A
312
297322
Meta
Meta · Llama 2 Community
1144±16
1,278$0.35 / $1.4016.4K
313
297323
NousResearch · Apache-2.0
1141±19
929$0.90 / $0.90N/A
314
298322
LMSYS · Llama 2 Community
1140±12
3,220$0.30 / $0.30N/A
315
291327
HuggingFace · MIT
1135±33
278N/AN/A
316
301323
HuggingFace · MIT
1134±16
1,847$0.15 / $0.1516.4K
317
301324
Mistral · Apache 2.0
1130±17
1,616$0.07 / $0.284.1K
318
301325
Microsoft · Llama 2 Community
1127±18
1,066$0.30 / $0.30N/A
319
304326
Together AI · Apache 2.0
1121±18
1,029$0.20 / $0.20N/A
320
301327
UW · Non-commercial
1117±28
434N/AN/A
321
313327
Meta
Meta · Llama 2 Community
1114±12
2,560$0.15 / $0.154.1K
322
309327
Google · Gemma license
1113±20
832$0.10 / $0.10N/A
323
311327
LMSYS · Llama 2 Community
1111±19
1,066$0.20 / $0.20N/A
324
317327
Alibaba · Qianwen LICENSE
1097±16
1,419$0.10 / $0.10N/A
325
318329
Ai2 · Apache-2.0
1084±19
1,076$0.20 / $0.20N/A
326
319330
Tsinghua · Apache-2.0
1081±21
835N/AN/A
327
316331
Nomic AI · Non-commercial
1080±34
287N/AN/A
328
325334
Tsinghua · Apache-2.0
1046±29
408N/AN/A
329
326333
UC Berkeley · Non-commercial
1042±20
1,100N/AN/A
330
325334
MosaicML · CC-BY-NC-SA-4.0
1042±24
626N/AN/A
331
327334
OpenAssistant · Apache 2.0
1031±21
1,002N/AN/A
332
328335
RWKV · Apache 2.0
1019±23
771N/AN/A
333
328335
Stanford · Non-commercial
1018±22
899N/AN/A
334
329336
Tsinghua · Non-commercial
996±24
732N/AN/A
335
332338
Databricks · MIT
970±27
523N/AN/A
336
334338
Stability
Stability AI · CC-BY-NC-SA-4.0
955±27
497N/AN/A
337
335338
LMSYS · Apache 2.0
944±23
690N/AN/A
338
335338
Meta
Meta · Non-commercial
915±34
371$0.23 / $0.23N/A

Default Leaderboard Plots

Battle Count for Each Combination of Models (without Ties)

Confidence Intervals on Model Strength (via Bootstrapping)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)