Text Arena | Math

View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.

Mar 11, 2026
540,345 votes
315 models
Rank Spread
1
18
Anthropic
Anthropic · Proprietary
1520±24
571$5 / $251M
2
111
Google · Proprietary
1514±23
659$2 / $121M
3
113
Anthropic
Anthropic · Proprietary
1509±22
692$5 / $251M
4
124
OpenAI · Proprietary
1499±36
270$2.50 / $151.1M
5
138
OpenAI · Proprietary
1480±26
485$1.75 / $14128K
6
222
Google · Proprietary
1480±12
2,657$2 / $121M
7
222
Google · Proprietary
1479±13
2,014$0.50 / $31M
8
130
MoonshotAI
Moonshot · Modified MIT
1476±19
870$0.60 / $3N/A
9
142
Anthropic
Anthropic · Proprietary
1472±26
471$3 / $151M
10
426
Anthropic
Anthropic · Proprietary
1472±12
2,413$5 / $25200K
11
428
Anthropic
1471±13
2,053$5 / $25200K
12
246
Bytedance
Bytedance · Proprietary
1468±25
525N/AN/A
13
438
OpenAI · Proprietary
1467±16
1,464$1.75 / $14400K
14
439
1467±15
1,537$0.50 / $31M
15
173
OpenAI · Proprietary
1463±36
248$2.50 / $151.1M
16
360
xAI · Proprietary
1462±27
428N/AN/A
17
442
OpenAI · Proprietary
1458±12
2,304$1.25 / $10400K
18
440
Anthropic
1458±11
3,104$3 / $15200K
19
463
Z.ai · MIT
1457±23
598$0.72 / $2.30202.8K
20
377
Qwen Icon
Alibaba · Apache 2.0
1456±33
310$0.26 / $2.08262.1K
21
457
Baidu
Baidu · Proprietary
1455±18
997N/AN/A
22
749
OpenAI · Proprietary
1452±10
3,847$2 / $8200K
23
472
MoonshotAI
Moonshot · Modified MIT
1451±22
651$0.45 / $2.20262.1K
24
950
Google · Proprietary
1449±8
6,242$1.25 / $101M
25
760
xAI · Proprietary
1448±12
2,401$0.20 / $0.50N/A
26
1061
Anthropic
1445±11
3,005$15 / $75200K
27
1070
MoonshotAI
Moonshot · Modified MIT
1443±12
2,393$1.15 / $8262.1K
28
874
Qwen Icon
Alibaba · Proprietary
1442±15
1,506$1.20 / $6262.1K
29
680
Qwen Icon
Alibaba · Apache 2.0
1442±23
577$0.39 / $2.34262.1K
30
977
OpenAI · Proprietary
1440±17
1,194$1.75 / $14400K
31
1277
OpenAI · Proprietary
1438±14
1,907$1.25 / $10400K
32
1477
xAI · Proprietary
1435±12
2,275$3 / $15256K
33
1777
Anthropic
Anthropic · Proprietary
1435±9
4,730$15 / $75200K
34
6101
Qwen Icon
Alibaba · Apache 2.0
1434±31
309$0.20 / $1.56262.1K
35
1480
DeepSeek · MIT
1433±13
2,184$0.26 / $0.38163.8K
36
1094
Z.ai · MIT
1433±21
709$0.38 / $1.98202.8K
37
1098
Qwen Icon
Alibaba · Proprietary
1433±24
580$1.20 / $6262.1K
38
10101
1430±27
470$0.27 / $0.41163.8K
39
8107
xAI · Proprietary
1428±29
398$3 / $15256K
40
1986
xAI · Proprietary
1428±11
2,910$0.20 / $0.50N/A
41
10107
Qwen Icon
Alibaba · Proprietary
1427±30
366N/AN/A
42
13101
Minimax
MiniMax · Modified MIT
1426±23
614$0.27 / $0.95196.6K
43
2284
Qwen Icon
Alibaba · Apache 2.0
1426±9
4,545$0.26 / $1.06N/A
44
1995
Z.ai · MIT
1424±13
2,085$0.39 / $1.90204.8K
45
19100
Qwen Icon
Alibaba · Apache 2.0
1424±17
1,188$0.09 / $1.10131.1K
46
2392
Anthropic
Anthropic · Proprietary
1424±11
3,144$3 / $15200K
47
16105
Meituan · MIT
1422±22
684$0.20 / $0.80131.1K
48
2395
OpenAI · Proprietary
1422±12
2,679$1.25 / $10400K
49
10115
Google · Proprietary
1422±32
270$0.25 / $1.501M
50
17104
DeepSeek · MIT
1422±21
768$0.27 / $0.41163.8K
51
2099
DeepSeek · MIT
1421±14
1,730$0.26 / $0.38163.8K
52
18107
DeepSeek · MIT
1420±22
669$1.23 / $4.94N/A
53
2498
Anthropic
Anthropic · Proprietary
1420±12
2,332$15 / $75200K
54
24101
xAI · Proprietary
1418±13
2,148$0.20 / $0.502M
55
20107
DeepSeek · MIT
1418±18
1,006$1.23 / $4.94N/A
56
2699
OpenAI · Proprietary
1418±11
3,014$1.10 / $4.40200K
57
24102
Z.ai · MIT
1417±15
1,436$0.60 / $2.20131.1K
58
19109
MoonshotAI
Moonshot · Modified MIT
1417±21
762$0.60 / $2.50262.1K
59
25101
OpenAI · Proprietary
1417±14
1,783$1.25 / $10128K
60
19113
Qwen Icon
Alibaba · Apache 2.0
1416±23
705$0.20 / $0.88262.1K
61
17119
Qwen Icon
Alibaba · Apache 2.0
1414±29
415$0.26 / $2.60131.1K
62
24110
xAI · Proprietary
1414±18
1,057$0.20 / $0.502M
63
28106
1414±13
1,900$0.30 / $2.501M
64
10128
1414±40
201$0.21 / $0.79163.8K
65
33101
Google · Proprietary
1413±8
6,458$0.30 / $2.501M
66
20115
Baidu
Baidu · Proprietary
1413±23
624N/AN/A
67
28107
DeepSeek · MIT
1413±14
1,606$0.70 / $2.5064K
68
27108
OpenAI · Proprietary
1412±15
1,393$75 / $150128K
69
16125
1412±34
264N/AN/A
70
28110
OpenAI · Proprietary
1412±16
1,444$0.25 / $2400K
71
33106
OpenAI · Proprietary
1411±11
2,986$15 / $60200K
72
16126
Baidu
Baidu · Proprietary
1410±35
258N/AN/A
73
33111
OpenAI · Proprietary
1408±13
1,909$1.10 / $4.40200K
74
19125
Qwen Icon
Alibaba · Apache 2.0
1407±30
323$0.16 / $1.30262.1K
75
16133
Tencent
Tencent · Proprietary
1406±38
239N/AN/A
76
37107
OpenAI · Proprietary
1406±8
5,811$5 / $15128K
77
25121
Qwen Icon
Alibaba · Apache 2.0
1405±24
502$0.11 / $0.60262.1K
78
35114
Anthropic
1405±13
2,100$3 / $151M
79
35114
Mistral · Apache 2.0
1405±13
1,971$0.50 / $1.50N/A
80
37114
Anthropic
Anthropic · Proprietary
1403±11
2,884$15 / $75200K
81
38113
Mistral · Proprietary
1403±9
4,360$2.70 / $8.1032K
82
24128
Qwen Icon
Alibaba · Apache 2.0
1402±30
316$0.08 / $0.2441K
83
19135
1401±37
238N/AN/A
84
35122
DeepSeek · MIT
1400±19
917$0.45 / $2.15163.8K
85
37121
1399±16
1,374N/AN/A
86
39120
Qwen Icon
Alibaba · Apache 2.0
1398±14
1,644$0.46 / $1.82131.1K
87
41119
Qwen Icon
Alibaba · Apache 2.0
1398±12
2,509$0.46 / $1.82131.1K
88
36125
Qwen Icon
Alibaba · Apache 2.0
1397±20
826$0.10 / $0.78131.1K
89
43119
Anthropic
Anthropic · Proprietary
1397±11
3,094$1 / $5200K
90
36125
1397±20
794N/AN/A
91
37125
Minimax
MiniMax · MIT
1397±18
1,026$0.27 / $0.95196.6K
92
37125
Microsoft AI · Proprietary
1396±19
892N/AN/A
93
24139
DeepSeek · MIT
1395±39
218$0.21 / $0.79163.8K
94
41124
Z.ai · MIT
1394±15
1,538$0.13 / $0.85131.1K
95
37128
Stepfun
StepFun · Apache 2.0
1394±21
750$0.10 / $0.30256K
96
45124
MoonshotAI
Moonshot · Modified MIT
1393±14
1,766$0.60 / $2.50131.1K
97
24139
Nvidia
1393±39
195$0.10 / $0.40131.1K
98
46123
Minimax
MiniMax · Apache 2.0
1393±13
1,940$0.40 / $2.201M
99
41128
xAI · Proprietary
1391±18
1,010$0.30 / $0.50131.1K
100
54124
Anthropic
Anthropic · Proprietary
1390±11
2,598$3 / $151M
101
64125
OpenAI · Proprietary
1388±9
4,569$15 / $60N/A
102
63125
Anthropic
1387±11
2,866$3 / $15200K
103
57128
OpenAI · Apache 2.0
1386±14
1,797$0.04 / $0.19131.1K
104
55130
Qwen Icon
Alibaba · Apache 2.0
1385±15
1,433$0.09 / $0.30262.1K
105
70125
OpenAI · Proprietary
1385±8
4,813$1.10 / $4.40200K
106
35144
Nvidia
Nvidia · Nvidia Open Model
1382±37
209$0.60 / $1.80131.1K
107
38139
PrimeIntellect
Prime Intellect · MIT
1382±31
336$0.20 / $1.10131.1K
108
67134
Qwen Icon
Alibaba · Apache 2.0
1381±14
1,681$0.40 / $1.60262.1K
109
53137
1380±22
646$0.09 / $0.29262.1K
110
75133
xAI · Proprietary
1378±11
2,770$3 / $15131.1K
111
73137
1376±16
1,414$0.09 / $0.29262.1K
112
78135
OpenAI · Proprietary
1374±10
3,326$2 / $81M
113
79135
DeepSeek · MIT
1374±10
3,264$3 / $4.5032.8K
114
75137
xAI · Proprietary
1373±14
1,607$0.30 / $0.50131.1K
115
52144
Stepfun
StepFun · Apache 2.0
1371±31
349$0.57 / $1.4265.5K
116
82137
1369±11
2,868$0.10 / $0.401M
117
81139
1369±12
2,168$0.10 / $0.401M
118
86137
Qwen Icon
Alibaba · Proprietary
1369±10
3,371N/AN/A
119
75144
Z.ai · MIT
1367±21
735$0.06 / $0.40202.8K
120
83139
Qwen Icon
Alibaba · Apache 2.0
1366±13
1,763$0.15 / $0.4032.8K
121
68149
Arcee AI
Arcee AI · Apache 2.0
1364±30
380N/AN/A
122
101139
OpenAI · Proprietary
1364±8
7,499$1.10 / $4.40N/A
123
75146
AntGroup
Ant Group · MIT
1362±27
453N/AN/A
124
65151
Z.ai · MIT
1362±34
271$0.60 / $1.8065.5K
125
53156
Nvidia
Nvidia · NVIDIA Open Model
1362±39
213N/AN/A
126
101139
Anthropic
Anthropic · Proprietary
1362±10
3,461$3 / $15200K
127
68153
Minimax
MiniMax · Apache 2.0
1360±35
293$0.26 / $1196.6K
128
104140
Google · Proprietary
1358±9
4,174$0.10 / $0.401M
129
102144
OpenAI · Proprietary
1357±11
2,771$0.40 / $1.601M
130
95144
Nvidia
Nvidia · NVIDIA Open Model
1357±19
995$0.06 / $0.24262.1K
131
102144
Qwen Icon
Alibaba · Apache 2.0
1357±13
1,773$0.08 / $0.2841K
132
105144
Mistral · Proprietary
1353±12
2,342$0.40 / $2131.1K
133
102149
Tencent
Tencent · Proprietary
1351±19
875N/AN/A
134
113144
Anthropic
Anthropic · Proprietary
1350±6
10,152$6 / $30200K
135
96156
OpenAI · Proprietary
1348±27
489$0.05 / $0.40400K
136
96158
AntGroup
Ant Group · MIT
1348±27
458N/AN/A
137
108153
Mistral · Apache 2.0
1343±17
1,070$0.10 / $0.3032K
138
120145
Anthropic
Anthropic · Proprietary
1343±7
11,359$6 / $30200K
139
121149
Google · Proprietary
1340±7
7,610$3.50 / $10.502.1M
140
108164
OpenAI · Apache 2.0
1340±22
692$0.03 / $0.14131.1K
141
113164
Amazon · Proprietary
1337±20
830$0.30 / $2.501M
142
129164
1328±10
2,814$0.07 / $0.301M
143
121174
Qwen Icon
Alibaba · Proprietary
1328±19
732$0.40 / $1.20131.1K
144
131164
Google · Gemma
1326±9
3,685$0.03 / $0.11128K
145
131169
1324±11
2,932$0.63 / $1.80131.1K
146
135169
Meta
Meta · Llama 3.1 Community
1320±7
8,482$4 / $432.8K
147
121189
Google · Gemma
1320±27
389$0.04 / $0.13131.1K
148
137173
Meta
Meta · Llama 3.1 Community
1317±8
5,215$4 / $432.8K
149
134179
1316±13
2,016$0.40 / $0.708.2K
150
131186
Stepfun
StepFun · Proprietary
1315±20
642N/AN/A
151
121199
AllenAI
Ai2 · Apache 2.0
1315±32
308$0.15 / $0.5065.5K
152
137178
NexusFlow · NexusFlow
1315±9
3,412N/AN/A
153
137179
DeepSeek · DeepSeek
1314±11
2,721$1.14 / $4.56N/A
154
140174
Anthropic
Anthropic · Proprietary
1313±6
25,769$15 / $75200K
155
139179
Cohere
Cohere · CC-BY-NC-4.0
1312±9
4,080$2.50 / $10256K
156
140180
OpenAI · Proprietary
1310±8
6,826$2.50 / $10128K
157
140183
01.AI
01 AI · Proprietary
1309±10
3,921N/AN/A
158
139188
Qwen Icon
Alibaba · Proprietary
1308±14
1,404N/AN/A
159
144181
OpenAI · Proprietary
1307±7
15,103$5 / $15128K
160
134197
Stepfun
StepFun · Proprietary
1307±23
634N/AN/A
161
144186
Google · Proprietary
1305±9
6,395N/AN/A
162
146185
OpenAI · Proprietary
1305±8
13,306$10 / $30128K
163
135201
AllenAI
Ai2 · Apache 2.0
1304±23
691$0.20 / $0.6065.5K
164
130205
Tencent
Tencent · Proprietary
1304±31
238N/AN/A
165
147188
OpenAI · Proprietary
1301±8
12,374$10 / $30128K
166
140202
Zhipu · Proprietary
1300±19
721N/AN/A
167
149192
Qwen Icon
Alibaba · Qwen
1299±8
5,415$1.20 / $1.20N/A
168
149192
Meta
Meta · Llama-3.3
1298±7
5,853$0.10 / $0.32131.1K
169
149192
OpenAI · Proprietary
1298±7
13,217$10 / $30128K
170
149193
Google · Proprietary
1298±8
10,492$3.50 / $10.502.1M
171
140206
Tencent
Tencent · Proprietary
1296±24
497N/AN/A
172
153193
xAI · Proprietary
1296±7
8,950$2 / $10131.1K
173
146202
DeepSeek · DeepSeek
1295±17
1,031N/AN/A
174
150199
Qwen Icon
Alibaba · Qwen
1294±12
2,249$1.60 / $6.4032.8K
175
140206
AllenAI
Ai2 · Apache 2.0
1293±26
476$0.15 / $0.5065.5K
176
144206
Tencent
Tencent · Proprietary
1292±24
499N/AN/A
177
144206
Mistral · Proprietary
1292±25
582$2 / $540K
178
156199
Google · Proprietary
1290±8
4,789$0.07 / $0.301M
179
156199
Mistral · Mistral Research
1290±8
6,664$2 / $6131.1K
180
155202
DeepSeek · DeepSeek
1290±10
3,649N/AN/A
181
155202
Zhipu AI · Proprietary
1290±10
3,599$0.44 / $1.76204.8K
182
144214
Tencent
Tencent · Proprietary
1287±29
371N/AN/A
183
159202
Anthropic
Anthropic · Proprietary
1287±7
6,457$0.80 / $4200K
184
161205
Mistral · MRL
1284±9
3,574$2 / $6131.1K
185
162205
OpenAI · Proprietary
1283±10
7,052$30 / $608.2K
186
146217
Tencent
Tencent · Proprietary
1282±31
243N/AN/A
187
157210
Nvidia
1281±17
1,041$1.20 / $1.20131.1K
188
162207
1279±13
2,216$0.10 / $0.3032K
189
146217
IBM · Apache 2.0
1279±31
358N/AN/A
190
167206
OpenAI · Proprietary
1278±7
9,344$0.15 / $0.60128K
191
154215
OpenAI · Proprietary
1278±23
582$0.10 / $0.401M
192
168206
OpenAI · Proprietary
1275±8
11,181$30 / $608.2K
193
167207
Qwen Icon
Alibaba · Qianwen LICENSE
1275±9
4,835$0.90 / $0.9032.8K
194
172206
xAI · Proprietary
1274±7
7,261$2 / $10131.1K
195
159217
Nvidia
1272±22
507N/AN/A
196
162216
Qwen Icon
Alibaba · Apache 2.0
1272±19
725$0.87 / $0.8732K
197
167213
DeepSeek · DeepSeek License
1272±13
1,858$0.14 / $0.28128K
198
172211
Amazon · Proprietary
1272±10
2,978$0.80 / $3.20300K
199
178210
Meta
Meta · Llama 3.1 Community
1271±7
7,677$0.40 / $0.40131.1K
200
168216
Google · Gemma
1269±14
1,648$0.02 / $0.0432.8K
201
178215
Azure
Microsoft · MIT
1267±10
2,764$0.06 / $0.1416.4K
202
165221
AllenAI
Ai2 · Llama 3.1
1266±25
397N/AN/A
203
178217
Mistral · Apache 2.0
1264±13
1,683$0.05 / $0.0832.8K
204
181217
NexusFlow · CC-BY-NC-4.0
1262±10
2,921N/AN/A
205
190217
Meta
Meta · Llama 3 Community
1258±7
20,941$0.51 / $0.748.2K
206
190217
Google · Proprietary
1257±8
8,392$0.07 / $0.301M
207
167229
Google · Gemma
1256±28
423$0.04 / $0.08131.1K
208
192218
Anthropic
Anthropic · Proprietary
1255±8
13,766$3 / $15200K
209
188220
Nvidia
Nvidia · NVIDIA Open Model
1254±12
2,352N/AN/A
210
173234
Tencent
Tencent · Proprietary
1252±29
361N/AN/A
211
190228
Zhipu AI · Proprietary
1249±15
1,191N/AN/A
212
193228
Reka AI · Proprietary
1247±14
1,207N/AN/A
213
195225
Amazon · Proprietary
1247±11
2,511$0.06 / $0.24300K
214
193229
AI21 Labs · Jamba Open
1246±15
1,147$2 / $8256K
215
197225
Mistral · Proprietary
1245±9
7,987$4 / $1232K
216
199222
Google · Gemma license
1245±7
10,170$0.65 / $0.658.2K
217
207234
Cohere
Cohere · CC-BY-NC-4.0
1233±9
3,854N/AN/A
218
206240
Reka AI · Proprietary
1233±14
1,284N/AN/A
219
209234
Anthropic
Anthropic · Proprietary
1232±7
14,983$0.25 / $1.25200K
220
207241
Cohere
Cohere · CC-BY-NC-4.0
1232±14
1,467$2.50 / $10128K
221
210235
Google · Proprietary
1230±8
5,036$0.07 / $0.301M
222
194253
AllenAI
Ai2 · Apache-2.0
1230±28
375$0.05 / $0.20128K
223
210236
Mistral · Apache 2.0
1229±9
6,778$0.90 / $0.9065.5K
224
210245
Amazon · Proprietary
1226±11
2,455$0.04 / $0.14128K
225
212246
Qwen Icon
Alibaba · Qianwen LICENSE
1223±11
3,188N/AN/A
226
214246
Mistral · Proprietary
1222±11
4,406$2.70 / $8.1032K
227
216247
Google · Gemma license
1218±7
7,110$0.03 / $0.098.2K
228
208256
Qwen Icon
Alibaba · Apache 2.0
1217±24
480$0.15 / $0.4032.8K
229
216252
Azure
Microsoft · MIT
1217±10
3,238$0.17 / $0.68N/A
230
212256
Mistral · MRL
1215±20
683$0.10 / $0.10131.1K
231
216252
01.AI
01 AI · Apache-2.0
1215±11
2,985N/AN/A
232
220252
Cohere
Cohere · CC-BY-NC-4.0
1214±8
9,769$2.50 / $10128K
233
216256
Reka AI · Proprietary
1212±14
2,028N/AN/A
234
222256
Qwen Icon
Alibaba · Qianwen LICENSE
1209±9
5,327N/AN/A
235
212259
AllenAI
Ai2 · Llama 3.1
1209±26
363N/AN/A
236
219256
InternLM
InternLM · Other
1209±15
1,387$0 / $032.8K
237
221256
Cohere
Cohere · CC-BY-NC-4.0
1206±13
1,601$0.15 / $0.60128K
238
221256
Princeton · MIT
1205±15
1,285$0.03 / $0.098.2K
239
223256
OpenAI · Proprietary
1203±15
2,134$1 / $216.4K
240
224256
Qwen Icon
Alibaba · Qianwen LICENSE
1202±12
2,649N/AN/A
241
223257
Cohere
Cohere · CC-BY-NC-4.0
1201±15
1,307N/AN/A
242
226256
Reka AI · Proprietary
1200±11
3,363N/AN/A
243
227256
OpenAI · Proprietary
1200±8
8,626$0.50 / $1.5016.4K
244
216265
IBM · Apache 2.0
1199±26
391N/AN/A
245
223262
Google · Proprietary
1197±19
993$0.35 / $1.0532.8K
246
223262
IBM · Apache 2.0
1197±19
873N/AN/A
247
221263
HuggingFace
HuggingFace · Apache 2.0
1197±22
589N/AN/A
248
227259
Databricks · DBRX LICENSE
1196±11
4,001$0.60 / $0.6032.8K
249
227261
1195±13
1,568$0.13 / $0.524.1K
250
227260
Azure
Microsoft · MIT
1194±13
2,092$0.15 / $0.60N/A
251
227261
Google · Proprietary
1194±14
2,274$0.35 / $1.0532.8K
252
231258
Meta
Meta · Llama 3 Community
1193±8
14,252$0.03 / $0.048.2K
253
231259
Meta
Meta · Llama 3.1 Community
1192±8
7,135$0.02 / $0.0516.4K
254
231259
Mistral · Apache 2.0
1192±8
9,663$0.63 / $0.6332K
255
221272
IBM · Apache 2.0
1192±28
382N/AN/A
256
230267
AI21 Labs · Jamba Open
1187±16
1,094$0.20 / $0.40256K
257
244269
Cohere
Cohere · CC-BY-NC-4.0
1176±9
6,682$0.15 / $0.60128K
258
242274
IBM · Apache 2.0
1169±19
908N/AN/A
259
249273
Qwen Icon
Alibaba · Qianwen LICENSE
1168±13
2,184$0.30 / $0.30N/A
260
248274
Meta
Meta · Llama 3.2
1166±16
1,136$0.05 / $0.3480K
261
254274
Snowflake
Snowflake · Apache 2.0
1163±11
4,793N/AN/A
262
256273
Google · Gemma license
1162±8
6,599N/AN/A
263
254276
Nexusflow · Apache-2.0
1160±14
1,973N/AN/A
264
253282
Azure
Microsoft · Llama 2 Community
1159±19
903N/AN/A
265
255276
OpenChat
OpenChat · Apache-2.0
1159±13
1,726N/AN/A
266
256276
Google · Gemma license
1158±11
3,039$0.03 / $0.098.2K
267
251284
DeepSeek · DeepSeek License
1157±23
576N/AN/A
268
243290
HuggingFace
HuggingFace · Apache 2.0
1153±33
271N/AN/A
269
257282
01.AI
01 AI · Yi License
1152±13
2,043$0.90 / $0.904.1K
270
255284
NousResearch · Apache-2.0
1152±20
697$0.17 / $0.17N/A
271
257282
Azure
Microsoft · MIT
1152±12
2,564$0.13 / $0.52N/A
272
257288
AllenAI/UW · AI2 ImpACT Low-risk
1147±19
888N/AN/A
273
260288
Azure
Microsoft · MIT
1140±13
2,813$0.13 / $0.52N/A
274
263288
Meta
Meta · Llama 2 Community
1137±10
4,740$0.70 / $2.804.1K
275
266291
Mistral · Apache-2.0
1129±12
2,605$0.20 / $0.2032.8K
276
266292
UC Berkeley · CC-BY-NC-4.0
1128±16
1,300N/AN/A
277
258298
Cognitive Computations · Apache-2.0
1126±32
219$0.50 / $0.5016.4K
278
266292
Meta
Meta · Llama 3.2
1126±16
1,162$0.03 / $0.2060K
279
263294
Qwen Icon
Alibaba · Qianwen LICENSE
1125±24
534N/AN/A
280
266292
OpenChat
OpenChat · Apache-2.0
1125±18
945$0.20 / $0.20N/A
281
266294
Qwen Icon
Alibaba · Qianwen LICENSE
1122±20
690$0.20 / $0.20N/A
282
269297
Google · Gemma license
1117±16
1,120$0.05 / $0.088.2K
283
271293
LMSYS · Non-commercial
1117±12
2,663$0 / $02K
284
266299
Nvidia
Nvidia · Llama 2 Community
1116±27
440N/AN/A
285
269298
Google · Proprietary
1114±19
901$0.50 / $0.5025.8K
286
274297
Meta
Meta · Llama 2 Community
1111±13
2,218$0.25 / $0.254.1K
287
271299
Upstage AI · CC-BY-NC-4.0
1110±22
604$0.30 / $0.30N/A
288
271299
Meta
Meta · Llama 2 Community
1109±19
770$0.35 / $1.4016.4K
289
274299
Google · Gemma license
1106±16
1,355N/AN/A
290
271303
MosaicML · CC-BY-NC-SA-4.0
1096±34
242N/AN/A
291
275301
NousResearch · Apache-2.0
1095±21
628$0.90 / $0.90N/A
292
282301
Meta
Meta · Llama 2 Community
1087±14
1,656$0.15 / $0.154.1K
293
280302
Qwen Icon
Alibaba · Qianwen LICENSE
1086±18
988$0.10 / $0.10N/A
294
279303
Together AI · Apache 2.0
1085±20
676$0.20 / $0.20N/A
295
282302
HuggingFace
HuggingFace · MIT
1084±17
1,250$0.15 / $0.1516.4K
296
284301
LMSYS · Llama 2 Community
1083±14
2,146$0.30 / $0.30N/A
297
282303
Mistral · Apache 2.0
1082±19
974$0.07 / $0.284.1K
298
276303
UW · Non-commercial
1081±32
280N/AN/A
299
286303
Google · Gemma license
1069±22
597$0.10 / $0.10N/A
300
290303
Azure
Microsoft · Llama 2 Community
1065±20
669$0.30 / $0.30N/A
301
290303
AllenAI
Ai2 · Apache-2.0
1056±19
848$0.20 / $0.20N/A
302
293304
LMSYS · Llama 2 Community
1048±21
658$0.20 / $0.20N/A
303
295304
Tsinghua · Apache-2.0
1042±23
576N/AN/A
304
302312
Nomic AI · Non-commercial
998±37
211N/AN/A
305
304312
Stanford · Non-commercial
989±23
652N/AN/A
306
304312
MosaicML · CC-BY-NC-SA-4.0
984±25
471N/AN/A
307
304312
RWKV
RWKV · Apache 2.0
983±24
544N/AN/A
308
304312
UC Berkeley · Non-commercial
981±21
751N/AN/A
309
304312
Tsinghua · Non-commercial
977±25
525N/AN/A
310
304314
Tsinghua · Apache-2.0
972±35
227N/AN/A
311
304314
OpenAssistant · Apache 2.0
959±22
687N/AN/A
312
304315
Databricks · MIT
948±29
370N/AN/A
313
310315
LMSYS · Apache 2.0
920±26
462N/AN/A
314
310315
Meta
Meta · Non-commercial
917±33
252$0.23 / $0.23N/A
315
312315
Stability
Stability AI · CC-BY-NC-SA-4.0
891±29
353N/AN/A

Default Leaderboard Plots

Battle Count for Each Combination of Models (without Ties)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Confidence Intervals on Model Strength (via Bootstrapping)