Text Arena📝Instruction Following

View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.

Mar 31, 2026
1,819,357 votes
337 models
Rank Spread
1
12
Anthropic
Anthropic · Proprietary
1512±10
3,525$5 / $251M
2
15
Anthropic
Anthropic · Proprietary
1500±10
4,106$5 / $251M
3
28
Google · Proprietary
1490±9
4,805$2 / $121M
4
211
OpenAI · Proprietary
1488±14
1,988$2.50 / $151.1M
5
210
Anthropic
1486±7
9,523$5 / $25200K
6
314
Anthropic
Anthropic · Proprietary
1479±11
3,021$3 / $151M
7
313
Anthropic
Anthropic · Proprietary
1477±6
11,764$5 / $25200K
8
416
Google · Proprietary
1474±6
11,265$2 / $121M
9
324
OpenAI · Proprietary
1470±13
2,030$2.50 / $151.1M
10
427
xAI · Proprietary
1467±14
1,743N/AN/A
11
724
Anthropic
1463±5
15,542$3 / $15200K
12
627
OpenAI · Proprietary
1463±10
3,608$1.75 / $14128K
13
532
Alibaba · Proprietary
1461±15
1,572N/AN/A
14
631
1460±13
1,964$2 / $62M
15
927
Anthropic
Anthropic · Proprietary
1460±6
14,927$3 / $15200K
16
927
Anthropic
1459±6
13,073$15 / $75200K
17
928
Google · Proprietary
1459±7
8,158$0.50 / $31M
18
929
Anthropic
Anthropic · Proprietary
1455±5
20,448$15 / $75200K
19
841
OpenAI · Proprietary
1452±16
1,245$0.75 / $4.50400K
20
842
Google · Apache 2.0
1451±16
1,293$0.14 / $0.40262.1K
21
1133
1450±7
8,259$0.50 / $31M
22
1133
OpenAI · Proprietary
1449±7
10,831$1.25 / $10400K
23
937
Z.ai · MIT
1448±10
3,427$1 / $3.20202.8K
24
944
Xiaomi · Proprietary
1446±14
1,740$1 / $31M
25
946
Google · Apache 2.0
1446±16
1,204N/AN/A
26
1837
Google · Proprietary
1443±4
27,607$1.25 / $101M
27
1146
1443±12
2,130$2 / $62M
28
1742
Anthropic
Anthropic · Proprietary
1442±7
9,224$15 / $75200K
29
1644
Moonshot · Modified MIT
1442±9
4,752$0.60 / $3N/A
30
1545
OpenAI · Proprietary
1441±10
3,242$1.75 / $14128K
31
965
Bytedance
Bytedance · Proprietary
1437±21
756N/AN/A
32
1755
Moonshot · Modified MIT
1436±12
2,249$0.38 / $1.72262.1K
33
1948
OpenAI · Proprietary
1436±8
5,501$75 / $150128K
34
2152
xAI · Proprietary
1433±6
12,449N/AN/A
35
2352
xAI · Proprietary
1432±6
11,641N/AN/A
36
2158
Alibaba · Apache 2.0
1431±10
3,562$0.39 / $2.34262.1K
37
2358
Baidu · Proprietary
1430±8
5,548N/AN/A
38
2358
OpenAI · Proprietary
1429±7
6,819$1.75 / $14400K
39
2162
Z.ai · MIT
1429±10
3,219$0.39 / $1.75202.8K
40
2458
OpenAI · Proprietary
1428±7
7,637$1.75 / $14400K
41
2658
OpenAI · Proprietary
1428±6
11,919$1.25 / $10400K
42
2856
OpenAI · Proprietary
1427±4
22,952$5 / $15128K
43
2661
Alibaba · Proprietary
1427±7
7,354$0.78 / $3.90262.1K
44
2188
1420±19
935N/AN/A
45
3270
DeepSeek · MIT
1420±7
10,092$0.26 / $0.38163.8K
46
2979
Baidu · Proprietary
1420±11
2,646N/AN/A
47
3474
Moonshot · Modified MIT
1418±6
11,773$1.15 / $8262.1K
48
3180
DeepSeek · MIT
1418±11
2,880$1.23 / $4.94N/A
49
3475
DeepSeek · MIT
1418±7
8,607$0.26 / $0.38163.8K
50
2395
1418±20
874$0.21 / $0.79163.8K
51
3283
DeepSeek · MIT
1416±10
3,342$0.27 / $0.41163.8K
52
3285
1416±11
2,561$0.27 / $0.41163.8K
53
3677
Z.ai · MIT
1416±6
10,109$0.39 / $1.90204.8K
54
4175
Alibaba · Apache 2.0
1416±5
21,072$0.26 / $1.06N/A
55
3678
OpenAI · Proprietary
1415±7
8,230$1.25 / $10128K
56
3285
Alibaba · Proprietary
1415±11
2,661$0.78 / $3.90262.1K
57
3487
Alibaba · Apache 2.0
1414±11
2,969$0.20 / $0.88262.1K
58
4181
Anthropic
1414±7
8,635$3 / $151M
59
3585
Google · Proprietary
1414±10
3,654$0.25 / $1.501M
60
4385
Anthropic
Anthropic · Proprietary
1411±6
10,796$15 / $75200K
61
3199
Meituan · Proprietary
1411±18
1,005N/AN/A
62
4384
Anthropic
Anthropic · Proprietary
1411±5
15,389$1 / $5200K
63
4388
OpenAI · Proprietary
1409±7
8,352$1.25 / $10400K
64
4489
Anthropic
1408±6
10,183$3 / $15200K
65
4593
xAI · Proprietary
1406±7
9,701$3 / $15131.1K
66
4594
Z.ai · MIT
1405±8
6,220$0.60 / $2.20131.1K
67
4693
OpenAI · Proprietary
1405±6
10,246$15 / $60200K
68
4497
MiniMax · Modified MIT
1404±9
3,992$0.12 / $0.99196.6K
69
4599
DeepSeek · MIT
1403±10
3,720$1.23 / $4.94N/A
70
5193
Google · Proprietary
1403±4
27,071$0.30 / $2.501M
71
4997
Mistral · Apache 2.0
1403±7
9,637$0.50 / $1.50N/A
72
45103
Alibaba · Apache 2.0
1402±11
2,678$0.26 / $2.08262.1K
73
5098
1402±7
9,228$0.30 / $2.501M
74
41110
1401±19
929N/AN/A
75
5397
OpenAI · Proprietary
1401±6
13,385$2 / $81M
76
5498
OpenAI · Proprietary
1401±6
15,621$2 / $8200K
77
44107
xAI · Proprietary
1400±14
1,745$3 / $15256K
78
5399
xAI · Proprietary
1400±6
10,807$0.20 / $0.502M
79
44108
Baidu · Proprietary
1400±15
1,328N/AN/A
80
59100
xAI · Proprietary
1398±6
10,926$3 / $15256K
81
6099
Mistral · Proprietary
1397±5
20,206$2.70 / $8.1032K
82
42115
Tencent
Tencent · Proprietary
1397±22
627N/AN/A
83
44114
OpenAI · Proprietary
1396±18
1,031$0.20 / $1.25400K
84
52108
Alibaba · Apache 2.0
1396±11
2,543$0.20 / $1.56262.1K
85
59104
DeepSeek · MIT
1395±7
6,426$0.70 / $2.5064K
86
48114
MiniMax · Proprietary
1394±17
1,203$0.30 / $1.20204.8K
87
48116
DeepSeek · MIT
1392±18
1,039$0.21 / $0.79163.8K
88
62110
DeepSeek · MIT
1392±9
4,078$0.45 / $2.15163.8K
89
66108
Anthropic
Anthropic · Proprietary
1392±7
10,067$3 / $151M
90
63113
Meituan · MIT
1390±11
2,974$0.20 / $0.80131.1K
91
63113
Alibaba · Apache 2.0
1390±11
2,709$0.16 / $1.30262.1K
92
67110
xAI · Proprietary
1389±8
5,371$0.20 / $0.502M
93
63114
Moonshot · Modified MIT
1388±11
2,903$0.60 / $2.50262.1K
94
63115
Alibaba · Apache 2.0
1387±12
2,115$0.15 / $1.50131.1K
95
68113
MiniMax · MIT
1387±9
4,494$0.27 / $0.95196.6K
96
55127
1386±19
927N/AN/A
97
71114
Stepfun
StepFun · Apache 2.0
1386±9
4,335$0.10 / $0.30262.1K
98
68122
Alibaba · Apache 2.0
1385±12
2,153$0.26 / $2.60131.1K
99
78114
1384±7
7,576$0.09 / $0.29262.1K
100
77115
Alibaba · Apache 2.0
1384±8
6,275$0.40 / $1.60262.1K
101
80115
Anthropic
Anthropic · Proprietary
1381±6
12,378$3 / $15200K
102
80117
Alibaba · Apache 2.0
1381±7
9,332$0.46 / $1.82131.1K
103
78126
Alibaba · Proprietary
1380±11
2,703N/AN/A
104
80120
Alibaba · Apache 2.0
1380±8
6,380$0.09 / $1.10262.1K
105
79126
1380±11
2,990$0.09 / $0.29262.1K
106
81123
OpenAI · Proprietary
1379±7
12,782$15 / $60N/A
107
84123
DeepSeek · MIT
1377±6
12,491$3 / $4.5032.8K
108
84126
Moonshot · Modified MIT
1376±8
6,823$0.60 / $2.50131.1K
109
73134
Tencent
Tencent · Proprietary
1376±17
1,111N/AN/A
110
87128
OpenAI · Proprietary
1373±8
6,977$0.25 / $2400K
111
87130
Microsoft AI · Proprietary
1372±9
4,240N/AN/A
112
90129
OpenAI · Proprietary
1371±6
10,116$0.40 / $1.601M
113
87134
Arcee AI · Apache 2.0
1370±11
2,898N/AN/A
114
95131
1369±7
8,150$0.10 / $0.401M
115
78142
Z.ai · MIT
1369±22
734$0.30 / $0.90131.1K
116
101130
Anthropic
Anthropic · Proprietary
1369±4
31,312$6 / $30200K
117
99134
1367±7
6,749N/AN/A
118
100134
Alibaba · Apache 2.0
1367±8
6,020$0.09 / $0.30262.1K
119
102133
OpenAI · Proprietary
1366±6
11,989$1.10 / $4.40200K
120
101135
OpenAI · Proprietary
1365±8
6,681$1.10 / $4.40200K
121
102135
Mistral · Proprietary
1365±7
8,002$0.40 / $2131.1K
122
105134
1365±6
12,994$0.10 / $0.401M
123
102136
xAI · Proprietary
1363±9
4,248$0.30 / $0.50131.1K
124
105136
Z.ai · MIT
1363±7
8,202$0.13 / $0.85131.1K
125
108142
Alibaba · Apache 2.0
1358±10
3,566$0.10 / $0.78131.1K
126
111141
Alibaba · Apache 2.0
1357±8
6,235$0.46 / $1.82131.1K
127
113139
Alibaba · Proprietary
1357±6
10,992N/AN/A
128
101158
Nvidia · NVIDIA Open Model
1355±19
935N/AN/A
129
114146
xAI · Proprietary
1354±8
5,585$0.30 / $0.50131.1K
130
110153
Tencent
Tencent · Proprietary
1353±12
2,445N/AN/A
131
114153
Z.ai · MIT
1351±10
3,184$0.06 / $0.40202.8K
132
109162
Tencent
Tencent · Proprietary
1350±18
886N/AN/A
133
115153
1349±11
3,046N/AN/A
134
105168
Nvidia · Nvidia Open Model
1349±22
660$0.60 / $1.80131.1K
135
124153
Google · Proprietary
1347±6
13,668$0.10 / $0.401M
136
124154
MiniMax · Apache 2.0
1345±7
8,772$0.40 / $2.201M
137
120166
Stepfun
StepFun · Apache 2.0
1344±14
1,645$0.57 / $1.4265.5K
138
125159
DeepSeek · DeepSeek
1343±7
8,606$1.14 / $4.56N/A
139
126156
Google · Gemma
1343±6
12,594$0.08 / $0.16131.1K
140
128156
OpenAI · Proprietary
1342±5
17,011$1.10 / $4.40200K
141
122169
Z.ai · MIT
1341±16
1,311$0.60 / $1.8065.5K
142
128159
Cohere
Cohere · CC-BY-NC-4.0
1341±5
15,586$2.50 / $10256K
143
124168
MiniMax · Apache 2.0
1339±13
2,007$0.26 / $1196.6K
144
128166
Mistral · Apache 2.0
1338±9
4,420$0.10 / $0.3032K
145
129162
Google · Proprietary
1337±5
22,789$3.50 / $10.502.1M
146
133166
Anthropic
Anthropic · Proprietary
1333±5
32,074$6 / $30200K
147
129173
Alibaba · Proprietary
1331±12
2,249$0.40 / $1.20131.1K
148
129172
Amazon · Proprietary
1331±10
3,344$0.30 / $2.501M
149
136168
OpenAI · Proprietary
1331±5
21,478$1.10 / $4.40N/A
150
125184
Alibaba · Apache 2.0
1331±19
858$0.08 / $0.2441K
151
137170
1330±6
9,249$0.07 / $0.301M
152
129183
Prime Intellect · MIT
1328±16
1,395$0.20 / $1.10131.1K
153
139173
OpenAI · Apache 2.0
1326±7
7,901$0.04 / $0.19131.1K
154
134184
OpenAI · Proprietary
1325±13
2,036$0.05 / $0.40400K
155
129190
1325±19
803N/AN/A
156
141175
Alibaba · Apache 2.0
1323±7
7,198$0.15 / $0.58131.1K
157
144175
OpenAI · Proprietary
1323±5
43,766$5 / $15128K
158
129192
Inception AI · Proprietary
1323±20
838$0.25 / $0.75128K
159
139184
Ai2 · Apache 2.0
1322±11
3,246$0.20 / $0.6065.5K
160
129194
1322±20
815$0.10 / $0.40131.1K
161
136191
Google · Gemma
1320±16
1,145$0.04 / $0.13131.1K
162
139191
Ant Group · MIT
1318±14
1,821N/AN/A
163
141191
Ant Group · MIT
1318±14
1,881N/AN/A
164
134202
1317±21
729N/AN/A
165
149185
OpenAI · Proprietary
1316±6
18,305$2.50 / $10128K
166
144192
Stepfun
StepFun · Proprietary
1315±13
1,950N/AN/A
167
141195
Tencent
Tencent · Proprietary
1315±16
1,300N/AN/A
168
147194
Zhipu · Proprietary
1314±12
2,160N/AN/A
169
148193
DeepSeek · DeepSeek
1313±11
2,970N/AN/A
170
150188
Google · Proprietary
1313±7
18,524N/AN/A
171
141203
Tencent
Tencent · Proprietary
1313±18
842N/AN/A
172
152189
1312±6
10,553$0.63 / $1.80131.1K
173
154187
Anthropic
Anthropic · Proprietary
1311±5
22,062$0.80 / $4200K
174
154189
Meta
Meta · Llama 3.1 Community
1311±5
23,585$4 / $432.8K
175
152192
Alibaba · Apache 2.0
1311±8
6,172$0.08 / $0.2841K
176
154190
Meta
Meta · Llama 3.1 Community
1311±5
16,174$4 / $432.8K
177
154190
xAI · Proprietary
1310±5
25,659$2 / $10131.1K
178
154190
Anthropic
Anthropic · Proprietary
1310±4
72,001$15 / $75200K
179
149200
Stepfun
StepFun · Proprietary
1309±13
2,112N/AN/A
180
154193
Google · Proprietary
1308±6
29,835$3.50 / $10.502.1M
181
154194
01.AI
01 AI · Proprietary
1308±7
10,932N/AN/A
182
128224
Ai2 · Apache 2.0
1307±39
219$0.20 / $0.2036.9K
183
159203
NexusFlow · NexusFlow
1301±6
10,236N/AN/A
184
161203
OpenAI · Proprietary
1300±5
36,297$10 / $30128K
185
159205
Alibaba · Qwen
1300±8
6,919$1.60 / $6.4032.8K
186
155209
OpenAI · Proprietary
1300±12
2,015$0.10 / $0.401M
187
158208
Mistral · Proprietary
1300±11
3,133$2 / $540K
188
160205
Zhipu AI · Proprietary
1300±7
10,743$0.44 / $1.76204.8K
189
163207
1298±7
7,525$0.40 / $0.708.2K
190
154214
Ai2 · Apache 2.0
1298±16
1,507$0.15 / $0.5065.5K
191
173207
Mistral · Mistral Research
1297±6
18,321$2 / $6131.1K
192
167211
Nvidia · NVIDIA Open Model
1296±9
4,238$0.06 / $0.24262.1K
193
170211
Alibaba · Proprietary
1295±9
4,234N/AN/A
194
177210
1294±7
8,099$0.10 / $0.3032K
195
179210
OpenAI · Proprietary
1293±6
34,416$10 / $30128K
196
179210
Mistral · MRL
1293±6
10,971$2 / $6131.1K
197
179209
OpenAI · Proprietary
1292±5
26,707$0.15 / $0.60128K
198
179212
Alibaba · Qwen
1291±6
16,363$1.20 / $1.20N/A
199
181212
Meta
Meta · Llama-3.3
1290±5
18,823$0.10 / $0.32131.1K
200
179212
DeepSeek · DeepSeek
1290±7
10,175N/AN/A
201
184214
Google · Proprietary
1288±6
14,561$0.07 / $0.301M
202
184215
OpenAI · Proprietary
1287±6
33,252$10 / $30128K
203
175224
Tencent
Tencent · Proprietary
1286±17
1,219N/AN/A
204
191216
xAI · Proprietary
1282±5
21,131$2 / $10131.1K
205
180225
Tencent
Tencent · Proprietary
1281±15
1,329N/AN/A
206
189222
Google · Gemma
1280±9
5,013$0.02 / $0.0432.8K
207
186225
OpenAI · Apache 2.0
1280±12
2,566$0.03 / $0.11131.1K
208
194221
OpenAI · Proprietary
1279±7
18,087$30 / $608.2K
209
188224
1279±11
2,959$1.20 / $1.20131.1K
210
186225
Ai2 · Apache 2.0
1278±13
2,171$0.15 / $0.5065.5K
211
199225
NexusFlow · CC-BY-NC-4.0
1275±8
7,490N/AN/A
212
202224
Amazon · Proprietary
1275±6
9,525$0.80 / $3.20300K
213
203225
OpenAI · Proprietary
1271±6
29,706$30 / $608.2K
214
180240
Inception AI · Proprietary
1270±26
581$0.25 / $0.75128K
215
203225
Meta
Meta · Llama 3.1 Community
1270±5
21,910$0.40 / $0.40131.1K
216
196234
Ai2 · Llama 3.1
1270±16
1,170N/AN/A
217
204227
Google · Gemma license
1267±5
29,545$0.65 / $0.658.2K
218
199236
Google · Gemma
1266±16
1,233$0.04 / $0.08131.1K
219
201235
IBM · Apache 2.0
1266±15
1,604N/AN/A
220
203234
AI21 Labs · Jamba Open
1264±11
3,266$2 / $8256K
221
205232
Anthropic
Anthropic · Proprietary
1264±6
38,802$3 / $15200K
222
203235
Alibaba · Apache 2.0
1262±12
2,227$0.87 / $0.8732K
223
209233
Google · Proprietary
1262±6
23,685$0.07 / $0.301M
224
205237
Reka AI · Proprietary
1259±10
3,118N/AN/A
225
203240
1258±15
1,484N/AN/A
226
215237
Nvidia · NVIDIA Open Model
1255±8
7,354N/AN/A
227
216237
Meta
Meta · Llama 3 Community
1254±5
56,558$0.51 / $0.748.2K
228
216239
Mistral · Apache 2.0
1254±8
5,485$0.05 / $0.0832.8K
229
215242
Zhipu AI · Proprietary
1253±10
3,766N/AN/A
230
216243
Cohere
Cohere · CC-BY-NC-4.0
1250±9
4,024$2.50 / $10128K
231
217245
DeepSeek · DeepSeek License
1248±9
5,614$0.14 / $0.28128K
232
216245
Princeton · MIT
1248±10
3,741$0.03 / $0.098.2K
233
220245
Cohere
Cohere · CC-BY-NC-4.0
1246±7
11,265N/AN/A
234
218245
Reka AI · Proprietary
1245±10
3,246N/AN/A
235
222245
Microsoft · MIT
1243±7
9,162$0.07 / $0.1416.4K
236
223245
Amazon · Proprietary
1242±7
7,809$0.06 / $0.24300K
237
216246
Tencent
Tencent · Proprietary
1242±17
1,098N/AN/A
238
226245
Google · Gemma license
1241±5
21,359$0.03 / $0.098.2K
239
226245
Anthropic
Anthropic · Proprietary
1240±5
43,031$0.25 / $1.25200K
240
227245
Alibaba · Qianwen LICENSE
1238±7
14,194$0.90 / $0.9032.8K
241
229245
Cohere
Cohere · CC-BY-NC-4.0
1237±6
28,069$2.50 / $10128K
242
230245
Google · Proprietary
1236±6
14,894$0.07 / $0.301M
243
231246
Mistral · Proprietary
1233±7
21,532$4 / $1232K
244
231247
Cohere
Cohere · CC-BY-NC-4.0
1232±9
4,153$0.15 / $0.60128K
245
229257
Ai2 · Apache-2.0
1226±17
1,063$0.05 / $0.20128K
246
244258
Alibaba · Qianwen LICENSE
1214±8
9,518N/AN/A
247
245258
Amazon · Proprietary
1213±7
7,716$0.04 / $0.14128K
248
242263
Google · Proprietary
1212±17
1,897$0.35 / $1.0532.8K
249
245259
Mistral · Apache 2.0
1211±7
18,515$0.90 / $0.9065.5K
250
245261
OpenAI · Proprietary
1209±6
23,523$0.50 / $1.5016.4K
251
245263
Mistral · MRL
1209±13
1,946$0.10 / $0.10131.1K
252
245262
Alibaba · Qianwen LICENSE
1207±7
13,814N/AN/A
253
244266
Ai2 · Llama 3.1
1207±16
1,172N/AN/A
254
245262
Mistral · Proprietary
1207±8
11,466$2.70 / $8.1032K
255
245265
Google · Proprietary
1203±11
5,876$0.35 / $1.0532.8K
256
245266
Cohere
Cohere · CC-BY-NC-4.0
1202±10
4,006N/AN/A
257
245266
AI21 Labs · Jamba Open
1202±11
3,254$0.20 / $0.40256K
258
246268
Reka AI · Proprietary
1197±10
5,558N/AN/A
259
250268
Cohere
Cohere · CC-BY-NC-4.0
1195±7
19,085$0.15 / $0.60128K
260
249274
OpenAI · Proprietary
1190±12
5,238$1 / $216.4K
261
254270
Meta
Meta · Llama 3.1 Community
1190±6
19,781$0.02 / $0.0516.4K
262
248277
IBM · Apache 2.0
1189±17
1,258N/AN/A
263
249276
HuggingFace · Apache 2.0
1188±16
1,593N/AN/A
264
254270
Meta
Meta · Llama 3 Community
1188±6
37,733$0.03 / $0.048.2K
265
252273
Reka AI · Proprietary
1188±8
9,018N/AN/A
266
255274
01.AI
01 AI · Apache-2.0
1185±8
8,996N/AN/A
267
258275
Databricks · DBRX LICENSE
1182±9
11,274$0.60 / $0.6032.8K
268
258277
Alibaba · Qianwen LICENSE
1180±9
7,653N/AN/A
269
262277
Mistral · Apache 2.0
1176±6
24,974$0.63 / $0.6332K
270
260278
InternLM · Other
1175±10
4,092$0 / $032.8K
271
262278
Microsoft · MIT
1173±7
9,385$0.17 / $0.68N/A
272
260288
IBM · Apache 2.0
1170±16
1,252N/AN/A
273
265281
Google · Gemma license
1168±6
18,240N/AN/A
274
262289
AllenAI/UW · AI2 ImpACT Low-risk
1167±15
2,008N/AN/A
275
263287
IBM · Apache 2.0
1167±12
2,597N/AN/A
276
265287
Alibaba · Qianwen LICENSE
1164±10
6,231$0.30 / $0.30N/A
277
267291
Microsoft · Llama 2 Community
1159±13
2,680N/AN/A
278
270296
DeepSeek · DeepSeek License
1152±17
1,525N/AN/A
279
273293
OpenChat · Apache-2.0
1150±11
4,414N/AN/A
280
273293
Google · Gemma license
1150±8
8,852$0.03 / $0.098.2K
281
273293
Microsoft · MIT
1150±9
6,632$0.15 / $0.60N/A
282
272296
OpenChat · Apache-2.0
1149±14
2,391$0.20 / $0.20N/A
283
273295
01.AI
01 AI · Yi License
1148±10
5,099$0.90 / $0.904.1K
284
272298
NousResearch · Apache-2.0
1148±15
1,577$0.17 / $0.17N/A
285
273294
Snowflake · Apache 2.0
1148±8
11,736N/AN/A
286
272298
Alibaba · Apache 2.0
1147±16
1,329$0.15 / $0.58131.1K
287
274298
Meta
Meta · Llama 3.2
1143±11
3,171$0.05 / $0.3480K
288
276298
Nexusflow · Apache-2.0
1143±10
5,765N/AN/A
289
278302
LMSYS · Non-commercial
1135±9
6,983$0 / $02K
290
278306
UC Berkeley · CC-BY-NC-4.0
1132±12
3,316N/AN/A
291
283303
Meta
Meta · Llama 2 Community
1131±8
12,635$0.70 / $2.804.1K
292
277312
MosaicML · CC-BY-NC-SA-4.0
1129±21
718N/AN/A
293
273315
TII · Falcon-180B TII License
1129±29
389N/AN/A
294
281310
IBM · Apache 2.0
1127±12
2,698N/AN/A
295
277315
Cognitive Computations · Apache-2.0
1124±25
497$0.50 / $0.5016.4K
296
282314
Nvidia · Llama 2 Community
1120±18
1,076N/AN/A
297
289312
1120±10
4,431$0.13 / $0.524.1K
298
285312
Alibaba · Qianwen LICENSE
1120±14
1,715$0.20 / $0.20N/A
299
289312
Mistral · Apache-2.0
1119±9
6,659$0.20 / $0.2032.8K
300
285313
Microsoft · Llama 2 Community
1119±14
2,003$0.30 / $0.30N/A
301
289316
Alibaba · Qianwen LICENSE
1113±16
1,470N/AN/A
302
291314
LMSYS · Llama 2 Community
1112±10
5,665$0.30 / $0.30N/A
303
289318
Upstage AI · CC-BY-NC-4.0
1111±19
1,188$0.30 / $0.30N/A
304
290316
Google · Proprietary
1110±14
2,536$0.50 / $0.5025.8K
305
292314
Microsoft · MIT
1110±9
7,636$0.13 / $0.52N/A
306
292316
Meta
Meta · Llama 2 Community
1106±9
6,097$0.25 / $0.254.1K
307
291318
NousResearch · Apache-2.0
1104±16
1,421$0.90 / $0.90N/A
308
293318
Meta
Meta · Llama 2 Community
1099±13
2,294$0.35 / $1.4016.4K
309
293318
Google · Gemma license
1099±13
2,835$0.05 / $0.088.2K
310
292321
HuggingFace · Apache 2.0
1099±21
859N/AN/A
311
297319
Microsoft · MIT
1095±10
7,368$0.13 / $0.52N/A
312
292323
HuggingFace · MIT
1095±24
534N/AN/A
313
298319
Google · Gemma license
1094±11
3,877N/AN/A
314
291323
Meta
Meta · Llama 2 Community
1093±30
358$0.70 / $2.8016.4K
315
301323
Together AI · Apache 2.0
1085±15
1,660$0.20 / $0.20N/A
316
303323
HuggingFace · MIT
1085±13
3,094$0.15 / $0.1516.4K
317
306323
Meta
Meta · Llama 3.2
1083±11
3,248$0.03 / $0.2060K
318
306323
Mistral · Apache 2.0
1081±14
2,768$0.07 / $0.284.1K
319
310323
LMSYS · Llama 2 Community
1071±14
2,020$0.20 / $0.20N/A
320
313324
Meta
Meta · Llama 2 Community
1066±10
4,541$0.15 / $0.154.1K
321
312325
Google · Gemma license
1064±16
1,522$0.10 / $0.10N/A
322
313324
Alibaba · Qianwen LICENSE
1064±13
2,636$0.10 / $0.10N/A
323
312326
UW · Non-commercial
1061±21
777N/AN/A
324
320329
Nomic AI · Non-commercial
1032±25
483N/AN/A
325
322329
Tsinghua · Apache-2.0
1031±18
1,321N/AN/A
326
323329
Ai2 · Apache-2.0
1026±16
1,908$0.20 / $0.20N/A
327
324329
UC Berkeley · Non-commercial
1021±15
1,913N/AN/A
328
324330
Stanford · Non-commercial
1016±16
1,512N/AN/A
329
324332
MosaicML · CC-BY-NC-SA-4.0
1002±19
1,115N/AN/A
330
329334
OpenAssistant · Apache 2.0
981±16
1,744N/AN/A
331
328334
Tsinghua · Apache-2.0
977±23
762N/AN/A
332
329334
Tsinghua · Non-commercial
973±18
1,277N/AN/A
333
330335
RWKV · Apache 2.0
964±17
1,375N/AN/A
334
330336
LMSYS · Apache 2.0
952±19
1,134N/AN/A
335
333337
Databricks · MIT
931±21
899N/AN/A
336
334337
Meta
Meta · Non-commercial
909±25
584$0.23 / $0.23N/A
337
335337
Stability
Stability AI · CC-BY-NC-SA-4.0
905±20
814N/AN/A

Default Leaderboard Plots

Fraction of Model A Wins for All Non-tied A vs. B Battles

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Battle Count for Each Combination of Models (without Ties)

Confidence Intervals on Model Strength (via Bootstrapping)