Text Arena🌶️Hard Prompts

View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.

Jun 4, 2026
2,764,408 votes
363 models
Rank Spread
1
13
Anthropic
Anthropic · Proprietary
1534±5
23,384$5 / $251M
2
13
Anthropic
Anthropic · Proprietary
1528±5
25,560$5 / $251M
3
14
Anthropic
Anthropic · Proprietary
1523±6
16,079$5 / $251M
4
36
Anthropic
Anthropic · Proprietary
1517±6
16,696$5 / $251M
5
413
Google · Proprietary
1509±5
30,796$2 / $121M
6
415
Meta
Meta · Proprietary
1507±7
8,161N/AN/A
7
517
Google · Proprietary
1504±5
22,460$2 / $121M
8
520
Anthropic
Anthropic · Proprietary
1503±5
19,705$3 / $151M
9
520
OpenAI · Proprietary
1502±5
20,466$2.50 / $151.1M
10
525
Z.ai · MIT
1500±7
9,648$1.40 / $4.40202.8K
11
524
OpenAI · Proprietary
1500±6
13,095$5 / $301.1M
12
522
Anthropic
1499±5
19,776$5 / $25200K
13
624
Anthropic
Anthropic · Proprietary
1498±4
40,786$5 / $25200K
14
627
OpenAI · Proprietary
1497±5
20,854$1.75 / $14128K
15
536
Alibaba · Proprietary
1495±12
2,556$1.25 / $3.751M
16
730
Xiaomi · MIT
1493±6
12,688$0.43 / $0.871M
17
830
OpenAI · Proprietary
1493±6
13,760$5 / $301.1M
18
830
Google · Proprietary
1493±6
16,568$0.50 / $31M
19
830
Alibaba · Proprietary
1492±6
13,164N/AN/A
20
734
Google · Proprietary
1492±8
6,395$1.50 / $91M
21
1036
Baidu · Proprietary
1489±7
11,682N/AN/A
22
1035
OpenAI · Proprietary
1489±6
17,461$5 / $301.1M
23
1034
1489±5
21,571$2 / $62M
24
1135
OpenAI · Proprietary
1488±5
22,109$2.50 / $151.1M
25
1340
Moonshot · Modified MIT
1487±6
12,246$0.95 / $4262.1K
26
1535
Anthropic
1487±4
46,184$3 / $15200K
27
1440
xAI · Proprietary
1486±6
15,740N/AN/A
28
1540
1486±5
20,907$2 / $62M
29
1540
Anthropic
Anthropic · Proprietary
1483±4
45,833$3 / $15200K
30
1449
Alibaba · Proprietary
1482±11
3,184$1.04 / $6.24262.1K
31
1946
Bytedance
Bytedance · Proprietary
1480±5
26,116N/AN/A
32
1946
Anthropic
1480±5
24,734$15 / $75200K
33
1946
1479±6
13,016$0.43 / $0.871M
34
1946
Z.ai · MIT
1479±6
13,557$1 / $3.20202.8K
35
2646
xAI · Proprietary
1478±4
36,672N/AN/A
36
2646
Anthropic
Anthropic · Proprietary
1477±4
38,917$15 / $75200K
37
2646
1477±4
34,762$0.50 / $31M
38
2350
DeepSeek · MIT
1477±6
13,690$0.43 / $0.871M
39
2650
Xiaomi · Proprietary
1476±6
15,037$1 / $31M
40
3053
OpenAI · Proprietary
1474±5
21,746$1.25 / $10400K
41
3053
xAI · Proprietary
1474±4
37,541N/AN/A
42
3054
OpenAI · Proprietary
1473±5
19,907$0.75 / $4.50400K
43
2159
Google · Apache 2.0
1473±10
3,354$0.14 / $0.40262.1K
44
3054
Moonshot · Modified MIT
1473±5
24,608$0.60 / $3N/A
45
3054
OpenAI · Proprietary
1472±5
19,812$1.75 / $14128K
46
3073
MiniMax · Proprietary
1466±12
2,887$0.60 / $2.40N/A
47
3768
Alibaba · Proprietary
1466±6
14,605$0.33 / $1.951M
48
4066
Alibaba · Apache 2.0
1465±5
22,510$0.39 / $2.34262.1K
49
4066
Baidu · Proprietary
1465±5
20,454N/AN/A
50
3773
Baidu · Proprietary
1463±8
5,205N/AN/A
51
3872
Z.ai · MIT
1463±8
6,608$0.40 / $1.75202.8K
52
4272
Xiaomi · MIT
1462±6
13,260$0.14 / $0.281M
53
3779
Google · Apache 2.0
1461±10
3,266N/AN/A
54
4076
Moonshot · Modified MIT
1461±9
4,537$0.40 / $1.90262.1K
55
4572
OpenAI · Proprietary
1460±4
30,999$1.75 / $14400K
56
4572
OpenAI · Proprietary
1460±5
27,809$1.75 / $14400K
57
4575
Alibaba · Proprietary
1459±6
13,565$0.78 / $3.90262.1K
58
4576
Meituan · Proprietary
1459±6
17,239N/AN/A
59
4673
Google · Proprietary
1458±3
63,839$1.25 / $101M
60
4577
xAI · Proprietary
1458±6
13,210$1.25 / $2.501M
61
4679
DeepSeek · MIT
1457±6
13,661$0.10 / $0.201M
62
4682
1456±6
13,536$0.10 / $0.201M
63
4976
OpenAI · Proprietary
1456±4
38,330$5 / $15128K
64
4682
Anthropic
Anthropic · Proprietary
1456±6
16,393$15 / $75200K
65
4880
OpenAI · Proprietary
1455±5
23,601$1.25 / $10400K
66
4591
Xiaomi · Proprietary
1453±10
3,967$0.40 / $2262.1K
67
4984
Moonshot · Modified MIT
1453±4
35,062$1.15 / $8262.1K
68
46103
1449±13
1,935N/AN/A
69
5691
OpenAI · Proprietary
1448±6
15,221$1.25 / $10128K
70
6190
Alibaba · Apache 2.0
1448±3
51,577$0.26 / $1.06N/A
71
4995
Alibaba · Proprietary
1448±8
4,871$0.78 / $3.90262.1K
72
4995
DeepSeek · MIT
1448±8
6,468$0.27 / $0.41163.8K
73
46105
1448±13
1,853N/AN/A
74
6192
DeepSeek · MIT
1447±5
25,741$0.23 / $0.34131.1K
75
5398
1446±9
4,701$0.27 / $0.41163.8K
76
6095
OpenAI · Proprietary
1446±6
14,988$1.25 / $10400K
77
48108
1446±14
1,679$0.27 / $0.95163.8K
78
6493
DeepSeek · MIT
1446±5
22,072$0.23 / $0.34131.1K
79
6595
Google · Proprietary
1445±5
25,089$0.25 / $1.501M
80
56105
Mistral · Modified MIT
1444±10
3,906$1.50 / $7.50262.1K
81
67100
Z.ai · MIT
1442±5
19,087$0.43 / $1.74202.8K
82
67100
xAI · Proprietary
1441±4
31,825$0.20 / $0.502M
83
64108
Tencent
Tencent · tencent-hunyuan-community
1441±10
4,055$0.29 / $1.17262.1K
84
63109
OpenAI · Proprietary
1441±10
3,445$75 / $150128K
85
66108
Alibaba · Apache 2.0
1440±9
5,717$0.20 / $0.88262.1K
86
67104
OpenAI · Proprietary
1440±5
25,754$2 / $8200K
87
67105
MiniMax · Modified MIT
1439±6
17,397$0.28 / $1.20204.8K
88
70104
Anthropic
Anthropic · Proprietary
1439±4
47,551$1 / $5200K
89
57120
1438±15
1,548N/AN/A
90
67109
DeepSeek · MIT
1437±9
5,154$1.23 / $4.94N/A
91
67110
Moonshot · Modified MIT
1436±8
5,583$0.60 / $2.50262.1K
92
72109
Anthropic
Anthropic · Proprietary
1435±5
19,167$15 / $75200K
93
71115
DeepSeek · MIT
1434±8
6,957$0.50 / $2.15163.8K
94
68120
xAI · Proprietary
1433±11
3,166$3 / $15256K
95
72118
DeepSeek · MIT
1433±8
6,811$1.23 / $4.94N/A
96
76115
Z.ai · MIT
1433±6
11,207$0.60 / $2.20131.1K
97
77115
Alibaba · Apache 2.0
1432±5
17,096$0.26 / $2.08262.1K
98
79114
Mistral · Apache 2.0
1432±5
23,989$0.50 / $1.50N/A
99
77118
Moonshot · Modified MIT
1431±6
12,224$0.60 / $2.50131.1K
100
80115
OpenAI · Proprietary
1431±5
22,169$2 / $81M
101
79118
Anthropic
1431±6
15,623$3 / $151M
102
85115
Mistral · Proprietary
1429±3
50,847$2.70 / $8.1032K
103
76129
Baidu · Proprietary
1428±11
2,532N/AN/A
104
85121
Alibaba · Apache 2.0
1427±5
16,427$0.20 / $1.56262.1K
105
82128
Meituan · MIT
1427±8
5,585$0.20 / $0.80131.1K
106
85125
xAI · Proprietary
1426±6
10,695$3 / $15131.1K
107
79134
DeepSeek · MIT
1424±13
1,928$0.27 / $0.95163.8K
108
92129
OpenAI · Proprietary
1422±6
19,345$0.20 / $1.25400K
109
76139
Tencent
Tencent · Proprietary
1421±17
1,093N/AN/A
110
93129
Alibaba · Apache 2.0
1421±6
16,581$0.46 / $1.82131.1K
111
92129
Alibaba · Apache 2.0
1421±6
11,901$0.09 / $1.10262.1K
112
98129
1420±5
17,499$0.30 / $2.501M
113
98129
xAI · Proprietary
1420±5
19,657$3 / $15256K
114
101129
Google · Proprietary
1420±3
62,996$0.30 / $2.501M
115
91135
Alibaba · Apache 2.0
1419±9
4,038$0.26 / $2.60131.1K
116
92135
DeepSeek · MIT
1418±9
4,116$0.70 / $2.50163.8K
117
92136
Alibaba · Apache 2.0
1418±9
3,847$0.10 / $0.10262.1K
118
98135
OpenAI · Proprietary
1418±8
6,453$15 / $60200K
119
101133
Anthropic
Anthropic · Proprietary
1418±6
17,674$3 / $151M
120
88139
1417±13
1,858N/AN/A
121
103134
Anthropic
1416±6
13,859$3 / $15200K
122
104135
MiniMax · Modified MIT
1415±5
24,859$0.15 / $1.15204.8K
123
104135
Alibaba · Apache 2.0
1415±5
17,644$0.14 / $1262.1K
124
105135
1414±5
27,116$0.10 / $0.30262.1K
125
104138
Alibaba · Apache 2.0
1414±6
11,653$0.40 / $1.60262.1K
126
105137
Alibaba · Proprietary
1413±5
21,022N/AN/A
127
106137
Stepfun
StepFun · Apache 2.0
1413±5
22,885$0.09 / $0.30262.1K
128
104139
1413±8
6,003$0.10 / $0.30262.1K
129
105139
xAI · Proprietary
1413±6
9,916$0.20 / $0.502M
130
113140
DeepSeek · MIT
1409±5
18,710$3 / $4.5032.8K
131
113142
MiniMax · MIT
1408±7
9,072$0.29 / $0.95204.8K
132
113142
Alibaba · Apache 2.0
1408±6
11,075$0.05 / $0.19131.1K
133
114143
OpenAI · Proprietary
1405±5
19,534$1.10 / $4.40200K
134
123146
Arcee AI · Apache 2.0
1403±5
17,882$0.15 / $0.45131K
135
122146
OpenAI · Proprietary
1403±6
12,809$0.25 / $2400K
136
125146
OpenAI · Proprietary
1402±6
16,251$0.40 / $1.601M
137
115151
OpenAI · Proprietary
1401±9
4,386$1.10 / $4.40200K
138
126148
Mistral · Proprietary
1401±6
13,734$0.40 / $2131.1K
139
113159
Tencent
Tencent · Proprietary
1399±13
1,977N/AN/A
140
131151
Anthropic
Anthropic · Proprietary
1398±4
27,293$3 / $15200K
141
131151
Anthropic
Anthropic · Proprietary
1398±6
15,276$3 / $15200K
142
130156
OpenAI · Proprietary
1397±8
8,496$15 / $60N/A
143
134156
1394±6
13,647N/AN/A
144
134159
Alibaba · Apache 2.0
1392±6
10,495$0.46 / $1.82131.1K
145
133164
Tencent
Tencent · Proprietary
1391±10
3,914N/AN/A
146
137159
Z.ai · MIT
1391±5
14,889$0.13 / $0.85131.1K
147
138159
1390±4
25,069$0.10 / $0.401M
148
138163
Arcee AI · Apache 2.0
1388±6
18,218$0.22 / $0.85262.1K
149
138165
Z.ai · MIT
1387±7
6,519$0.06 / $0.40202.8K
150
141165
Alibaba · Proprietary
1385±6
9,672N/AN/A
151
141166
Alibaba · Apache 2.0
1385±8
6,694$0.10 / $0.78262.1K
152
134176
Z.ai · MIT
1383±15
1,496$0.30 / $0.90131.1K
153
143169
1381±6
14,574$0.10 / $0.401M
154
143169
MiniMax · Apache 2.0
1381±5
15,831$0.40 / $2.201M
155
141174
Nvidia · NVIDIA Open Model
1380±9
4,102N/AN/A
156
141177
Stepfun
StepFun · Apache 2.0
1378±11
3,007$0.57 / $1.4265.5K
157
147175
xAI · Proprietary
1377±7
7,566$0.25 / $1.27N/A
158
143183
Z.ai · MIT
1376±12
2,391$0.60 / $1.8065.5K
159
137190
Nvidia · Nvidia Open Model
1375±21
713$0.60 / $1.80131.1K
160
148177
Mistral · Apache 2.0
1375±7
7,814$0.10 / $0.3032K
161
147184
Prime Intellect · MIT
1373±11
2,740$0.20 / $1.10131.1K
162
149182
1373±8
5,956N/AN/A
163
154181
OpenAI · Proprietary
1370±5
20,116$1.10 / $4.40200K
164
152183
xAI · Proprietary
1370±7
9,627$0.30 / $0.50131.1K
165
151185
MiniMax · Apache 2.0
1369±10
3,696$0.26 / $1204.8K
166
154184
Cohere
Cohere · CC-BY-NC-4.0
1368±5
23,792$2.50 / $10256K
167
141197
Tencent
Tencent · Proprietary
1368±23
531N/AN/A
168
147191
Alibaba · Apache 2.0
1368±16
1,207$0.08 / $0.28131.1K
169
152187
Ant Group · MIT
1366±10
3,450N/AN/A
170
155185
Google · Gemma
1365±5
17,871$0.08 / $0.16131.1K
171
156187
OpenAI · Apache 2.0
1362±6
14,765$0.04 / $0.18131.1K
172
147209
1361±24
510N/AN/A
173
159189
Google · Proprietary
1361±5
14,138$0.10 / $0.401M
174
152197
1361±15
1,437$0.10 / $0.40131.1K
175
159189
OpenAI · Proprietary
1361±6
13,899$1.10 / $4.40N/A
176
160189
Anthropic
Anthropic · Proprietary
1360±6
23,249$3 / $15200K
177
159191
Amazon · Proprietary
1359±8
6,479$0.30 / $2.501M
178
154197
Inception AI · Proprietary
1359±14
1,758$0.25 / $0.75128K
179
154200
1358±15
1,418N/AN/A
180
163191
Alibaba · Apache 2.0
1357±6
8,826$0.50 / $116.4K
181
161197
OpenAI · Proprietary
1354±10
3,816$0.05 / $0.40400K
182
159207
Alibaba · Proprietary
1353±14
1,481$0.40 / $1.20131.1K
183
168197
Google · Proprietary
1351±6
14,853$3.50 / $10.502.1M
184
165202
Ant Group · MIT
1351±10
3,574N/AN/A
185
167200
DeepSeek · DeepSeek
1350±8
5,408$1.14 / $4.56N/A
186
167201
Ai2 · Apache 2.0
1350±8
6,444$0.20 / $0.6065.5K
187
154223
Tencent
Tencent · Proprietary
1349±23
496N/AN/A
188
169202
1348±7
6,186$0.07 / $0.301M
189
173206
Alibaba · Apache 2.0
1345±6
10,693$0.09 / $0.45131.1K
190
176205
Anthropic
Anthropic · Proprietary
1344±5
23,783$0.80 / $4200K
191
176212
01.AI
01 AI · Proprietary
1341±8
6,961N/AN/A
192
156235
Ai2 · Apache 2.0
1341±28
438$0.20 / $0.2036.9K
193
176211
Meta
Meta · Llama 3.1 Community
1341±6
10,652$4 / $432.8K
194
172225
Stepfun
StepFun · Proprietary
1338±15
1,221N/AN/A
195
181214
1338±6
15,997$0.63 / $1.80131.1K
196
181216
OpenAI · Proprietary
1337±5
32,416$5 / $15128K
197
184217
Meta
Meta · Llama 3.1 Community
1335±6
16,228$4 / $432.8K
198
181225
Stepfun
StepFun · Proprietary
1333±10
3,706N/AN/A
199
186221
NexusFlow · NexusFlow
1333±7
6,517N/AN/A
200
176230
OpenAI · Proprietary
1333±14
1,669$0.10 / $0.401M
201
186225
Mistral · Proprietary
1332±8
5,659$2 / $540K
202
176235
Google · Gemma
1332±18
977$0.04 / $0.13131.1K
203
176238
Tencent
Tencent · Proprietary
1330±19
889N/AN/A
204
190225
1330±6
13,000$0.40 / $0.708.2K
205
183234
DeepSeek · DeepSeek
1329±13
1,790N/AN/A
206
193226
Anthropic
Anthropic · Proprietary
1328±5
55,074$15 / $75200K
207
190228
Nvidia · NVIDIA Open Model
1328±7
8,152$0.06 / $0.24262.1K
208
188232
Ai2 · Apache 2.0
1328±11
3,024$0.15 / $0.5065.5K
209
191228
Google · Proprietary
1327±6
22,512$3.50 / $10.502.1M
210
192228
OpenAI · Proprietary
1327±6
12,655$2.50 / $10128K
211
189234
Alibaba · Proprietary
1326±11
2,671N/AN/A
212
194228
xAI · Proprietary
1326±6
17,283$2 / $10131.1K
213
187235
IBM · Apache 2.0
1326±13
2,481$0.05 / $0.10131.1K
214
186238
Z.ai · Proprietary
1326±14
1,468N/AN/A
215
192230
Google · Proprietary
1325±7
14,100N/AN/A
216
193235
OpenAI · Apache 2.0
1323±9
4,799$0.03 / $0.14131.1K
217
195235
DeepSeek · DeepSeek
1322±8
6,869N/AN/A
218
196234
Meta
Meta · Llama-3.3
1321±5
17,020$0.10 / $0.32131.1K
219
189243
Inception AI · Proprietary
1320±19
1,031$0.25 / $0.75128K
220
196235
Mistral · Mistral Research
1319±6
12,682$2 / $6131.1K
221
195238
Z.ai · Proprietary
1319±8
7,067$0.44 / $1.76204.8K
222
197235
1319±6
14,702$0.10 / $0.3032K
223
195241
Alibaba · Qwen
1318±9
4,447$1.60 / $6.4032.8K
224
197238
Alibaba · Qwen
1318±6
10,545$1.20 / $1.20N/A
225
202238
OpenAI · Proprietary
1316±6
28,310$10 / $30128K
226
206241
Mistral · MRL
1313±7
6,954$2 / $6128K
227
206241
Google · Gemma
1313±7
8,560$0.06 / $0.1232.8K
228
202242
NexusFlow · CC-BY-NC-4.0
1312±9
5,448N/AN/A
229
208241
OpenAI · Proprietary
1312±6
26,242$10 / $30128K
230
209241
OpenAI · Proprietary
1311±5
18,259$0.15 / $0.60128K
231
195251
Tencent
Tencent · Proprietary
1311±19
873N/AN/A
232
201245
1310±13
1,953$1.20 / $1.20131.1K
233
219243
OpenAI · Proprietary
1306±6
25,575$10 / $30128K
234
212245
Ai2 · Apache 2.0
1305±9
4,380$0.15 / $0.5065.5K
235
219245
OpenAI · Proprietary
1304±8
13,935$30 / $608.2K
236
223243
xAI · Proprietary
1304±6
14,208$2 / $10131.1K
237
209252
Tencent
Tencent · Proprietary
1303±13
2,172N/AN/A
238
208254
Alibaba · Apache 2.0
1303±15
1,386$0.87 / $0.8732K
239
224245
Google · Proprietary
1303±7
9,262$0.07 / $0.301M
240
224246
Amazon · Proprietary
1302±7
6,300$0.80 / $3.20300K
241
219252
IBM · Apache 2.0
1301±11
2,992N/AN/A
242
229249
Meta
Meta · Llama 3.1 Community
1298±6
14,981$0.40 / $0.40131.1K
243
238259
Google · Proprietary
1288±6
18,228$0.07 / $0.301M
244
233263
DeepSeek · DeepSeek License
1288±10
4,439$0.14 / $0.28128K
245
238259
OpenAI · Proprietary
1288±7
23,040$30 / $608.2K
246
238264
Mistral · Apache 2.0
1285±10
3,569$0.05 / $0.0832.8K
247
230267
Google · Gemma
1284±17
1,092$0.04 / $0.08131.1K
248
237264
AI21 Labs · Jamba Open
1283±12
2,372$2 / $8256K
249
243264
Google · Gemma license
1282±5
20,494$0.65 / $0.658.2K
250
243264
Anthropic
Anthropic · Proprietary
1281±6
30,725$3 / $15200K
251
233270
1280±17
1,002N/AN/A
252
239266
Reka AI · Proprietary
1280±12
2,145N/AN/A
253
242264
Nvidia · NVIDIA Open Model
1280±9
5,577N/AN/A
254
243264
Meta
Meta · Llama 3 Community
1279±6
45,956$0.51 / $0.748.2K
255
243266
Microsoft · MIT
1278±8
5,747$0.07 / $0.1416.4K
256
242267
Z.ai · Proprietary
1277±11
2,970N/AN/A
257
239272
Ai2 · Llama 3.1
1273±19
779N/AN/A
258
245270
Alibaba · Qianwen LICENSE
1272±7
10,780$0.90 / $0.9032.8K
259
243271
Princeton · MIT
1272±11
2,554$0.03 / $0.098.2K
260
245270
Amazon · Proprietary
1271±8
4,913$0.06 / $0.24300K
261
245270
Cohere
Cohere · CC-BY-NC-4.0
1271±8
7,233N/AN/A
262
240274
Tencent
Tencent · Proprietary
1270±21
700N/AN/A
263
246272
Reka AI · Proprietary
1266±11
2,289N/AN/A
264
252271
Anthropic
Anthropic · Proprietary
1264±6
33,698$0.25 / $1.25200K
265
254273
Mistral · Proprietary
1261±7
17,187$4 / $1232K
266
245284
Ai2 · Apache-2.0
1261±20
789$0.05 / $0.20128K
267
252275
Cohere
Cohere · CC-BY-NC-4.0
1260±11
2,822$2.50 / $10128K
268
256274
Google · Proprietary
1259±7
9,654$0.07 / $0.301M
269
260274
Google · Gemma license
1257±6
14,756$0.03 / $0.098.2K
270
256279
Cohere
Cohere · CC-BY-NC-4.0
1255±10
2,976$0.15 / $0.60128K
271
262280
Cohere
Cohere · CC-BY-NC-4.0
1251±6
22,394$2.50 / $10128K
272
256288
Mistral · MRL
1251±15
1,257$0.10 / $0.10131.1K
273
264287
Alibaba · Qianwen LICENSE
1245±9
7,695N/AN/A
274
265288
Amazon · Proprietary
1244±8
4,859$0.04 / $0.14128K
275
269288
Mistral · Apache 2.0
1242±7
14,707$0.90 / $0.9065.5K
276
269291
Alibaba · Qianwen LICENSE
1238±8
10,854N/AN/A
277
269294
AI21 Labs · Jamba Open
1236±12
2,350$0.20 / $0.40256K
278
271293
Mistral · Proprietary
1234±9
8,650$2.70 / $8.1032K
279
270295
Reka AI · Proprietary
1234±11
4,506N/AN/A
280
271295
Reka AI · Proprietary
1233±9
7,446N/AN/A
281
271297
Google · Proprietary
1231±11
4,515$0.35 / $1.0532.8K
282
268299
IBM · Apache 2.0
1230±20
798N/AN/A
283
272297
Cohere
Cohere · CC-BY-NC-4.0
1229±11
2,576N/AN/A
284
269299
Google · Proprietary
1229±18
1,407$0.35 / $1.0532.8K
285
276296
OpenAI · Proprietary
1227±7
18,544$0.50 / $1.5016.4K
286
272299
OpenAI · Proprietary
1227±13
3,846$1 / $216.4K
287
273299
InternLM · Other
1225±11
2,597$0 / $032.8K
288
276299
01.AI
01 AI · Apache-2.0
1225±8
6,679N/AN/A
289
271301
Ai2 · Llama 3.1
1223±20
728N/AN/A
290
277299
Meta
Meta · Llama 3.1 Community
1222±6
13,535$0.02 / $0.03131.1K
291
277299
Alibaba · Qianwen LICENSE
1220±9
6,072N/AN/A
292
279299
Meta
Meta · Llama 3 Community
1218±6
30,221$0.04 / $0.048.2K
293
272303
IBM · Apache 2.0
1218±19
832N/AN/A
294
278300
Databricks · DBRX LICENSE
1216±9
8,865$0.60 / $0.6032.8K
295
276303
HuggingFace · Apache 2.0
1214±17
1,240N/AN/A
296
281300
Cohere
Cohere · CC-BY-NC-4.0
1214±7
15,124$0.15 / $0.60128K
297
282301
Microsoft · MIT
1211±8
6,823$0.17 / $0.68N/A
298
284301
Mistral · Apache 2.0
1210±7
19,541$0.63 / $0.6332K
299
284314
IBM · Apache 2.0
1202±15
1,666N/AN/A
300
292313
Alibaba · Qianwen LICENSE
1199±11
4,910$0.30 / $0.30N/A
301
294317
Nexusflow · Apache-2.0
1194±11
4,525N/AN/A
302
297316
Google · Gemma license
1192±9
6,987$0.03 / $0.098.2K
303
299320
Microsoft · MIT
1188±10
5,229$0.15 / $0.60N/A
304
297323
AllenAI/UW · AI2 ImpACT Low-risk
1186±16
1,419N/AN/A
305
299321
OpenChat · Apache-2.0
1186±11
3,435N/AN/A
306
299320
Snowflake · Apache 2.0
1185±9
9,592N/AN/A
307
299320
Google · Gemma license
1185±6
12,322N/AN/A
308
299323
01.AI
01 AI · Yi License
1183±11
3,838$0.90 / $0.904.1K
309
299328
NousResearch · Apache-2.0
1179±17
1,139$0.17 / $0.17N/A
310
299329
IBM · Apache 2.0
1176±15
1,740N/AN/A
311
299332
OpenChat · Apache-2.0
1174±16
1,663$0.20 / $0.20N/A
312
301328
1174±11
3,155$0.13 / $0.524.1K
313
299332
DeepSeek · DeepSeek License
1174±18
1,120N/AN/A
314
301332
Microsoft · Llama 2 Community
1171±15
1,752N/AN/A
315
300334
Alibaba · Apache 2.0
1168±19
852$0.50 / $116.4K
316
303332
Meta
Meta · Llama 3.2
1167±13
2,216$0.05 / $0.34131.1K
317
303332
UC Berkeley · CC-BY-NC-4.0
1167±13
2,471N/AN/A
318
307332
LMSYS · Non-commercial
1163±10
5,071$0 / $02K
319
309333
Meta
Meta · Llama 2 Community
1159±8
9,650$0.70 / $2.804.1K
320
299344
Meta
Meta · Llama 2 Community
1159±31
306$0.70 / $2.8016.4K
321
302341
MosaicML · CC-BY-NC-SA-4.0
1156±27
434N/AN/A
322
306339
Upstage AI · CC-BY-NC-4.0
1156±20
932$0.30 / $0.30N/A
323
309337
Mistral · Apache-2.0
1155±10
5,150$0.20 / $0.2032.8K
324
303344
Cognitive Computations · Apache-2.0
1154±27
359$0.50 / $0.5016.4K
325
309339
Google · Gemma license
1153±14
2,267$0.05 / $0.088.2K
326
311337
Microsoft · MIT
1152±10
5,872$0.13 / $0.52N/A
327
307345
HuggingFace · Apache 2.0
1150±25
572N/AN/A
328
309339
Alibaba · Qianwen LICENSE
1149±16
1,333$0.20 / $0.20N/A
329
312340
Google · Proprietary
1145±15
1,605$0.50 / $0.5025.8K
330
309345
Nvidia · Llama 2 Community
1145±21
787N/AN/A
331
312346
Alibaba · Qianwen LICENSE
1140±19
1,006N/AN/A
332
320344
Meta
Meta · Llama 2 Community
1137±10
4,475$0.25 / $0.254.1K
333
319345
Google · Gemma license
1137±12
3,183N/AN/A
334
320347
Meta
Meta · Llama 2 Community
1132±15
1,540$0.35 / $1.4016.4K
335
322347
LMSYS · Llama 2 Community
1130±11
3,971$0.30 / $0.30N/A
336
322347
Microsoft · MIT
1129±11
6,167$0.13 / $0.52N/A
337
320347
NousResearch · Apache-2.0
1128±18
1,066$0.90 / $0.90N/A
338
312350
TII · Falcon-180B TII License
1126±35
231N/AN/A
339
318350
HuggingFace · MIT
1121±29
333N/AN/A
340
326349
HuggingFace · MIT
1115±14
2,220$0.15 / $0.1516.4K
341
330349
Meta
Meta · Llama 3.2
1113±13
2,247$0.03 / $0.20131.1K
342
325349
Microsoft · Llama 2 Community
1113±17
1,278$0.30 / $0.30N/A
343
327349
Mistral · Apache 2.0
1113±15
1,917$0.07 / $0.284.1K
344
327349
Google · Gemma license
1111±18
1,191$0.10 / $0.10N/A
345
333350
Together AI · Apache 2.0
1106±17
1,237$0.20 / $0.20N/A
346
327352
UW · Non-commercial
1104±25
517N/AN/A
347
334350
LMSYS · Llama 2 Community
1103±17
1,258$0.20 / $0.20N/A
348
338350
Meta
Meta · Llama 2 Community
1095±11
3,289$0.15 / $0.154.1K
349
338351
Alibaba · Qianwen LICENSE
1093±14
2,075$0.10 / $0.10N/A
350
348355
Ai2 · Apache-2.0
1067±17
1,462$0.20 / $0.20N/A
351
343357
Nomic AI · Non-commercial
1063±30
347N/AN/A
352
349356
Tsinghua · Apache-2.0
1059±20
999N/AN/A
353
350359
UC Berkeley · Non-commercial
1033±18
1,250N/AN/A
354
350359
MosaicML · CC-BY-NC-SA-4.0
1031±22
714N/AN/A
355
350359
Tsinghua · Apache-2.0
1027±27
490N/AN/A
356
351359
OpenAssistant · Apache 2.0
1021±19
1,153N/AN/A
357
352360
RWKV · Apache 2.0
1015±21
876N/AN/A
358
353360
Stanford · Non-commercial
1007±20
1,057N/AN/A
359
353360
Tsinghua · Non-commercial
999±22
848N/AN/A
360
357362
Databricks · MIT
971±24
615N/AN/A
361
360363
Stability
Stability AI · CC-BY-NC-SA-4.0
942±24
575N/AN/A
362
360363
LMSYS · Apache 2.0
937±21
797N/AN/A
363
361363
Meta
Meta · Non-commercial
916±30
430$0.23 / $0.23N/A

Default Leaderboard Plots

Fraction of Model A Wins for All Non-tied A vs. B Battles

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Battle Count for Each Combination of Models (without Ties)

Confidence Intervals on Model Strength (via Bootstrapping)