• New Chat
  • Leaderboard
  • Search
Terms of UsePrivacy Policy
Start Voting
Overview
Agent
Start Voting
Agent

Min
Max

Min
Max

Min
Max

Min
Max

Text Arena🇯🇵Japanese

View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.

Jul 1, 2026
98,200 votes
241 models
Rank by
Rank Spread
1
117
gpt-5.5-high
OpenAI · Proprietary
1524±34
390$5 / $301.1M
2
122
gemini-3-pro
Google · Proprietary
1508±32
410$2 / $121M
3
121
gemini-3.1-pro-preview
Google · Proprietary
1505±25
704$2 / $121M
4
137
qwen3.5-max-preview
Alibaba · Proprietary
1502±47
189N/AN/A
5
137
gemini-3-flash
Google · Proprietary
1493±38
276$0.50 / $31M
6
131
Anthropic
claude-opus-4-6-thinking
Anthropic · Proprietary
1492±28
533$5 / $251M
7
134
Anthropic
claude-opus-4-7
Anthropic · Proprietary
1489±31
482$5 / $251M
8
134
Anthropic
claude-opus-4-6
Anthropic · Proprietary
1487±28
572$5 / $251M
9
138
Anthropic
claude-opus-4-7-thinking
Anthropic · Proprietary
1486±32
412$5 / $251M
10
143
gpt-5.5
OpenAI · Proprietary
1483±35
343$5 / $301.1M
11
141
gpt-5.4-high
OpenAI · Proprietary
1481±30
506$2.50 / $151.1M
12
149
deepseek-v4-pro
DeepSeek · MIT
1472±32
412$0.43 / $0.871M
13
158
grok-4.20-beta1
xAI · Proprietary
1470±42
211N/AN/A
14
159
gpt-5.5-instant
OpenAI · Proprietary
1464±39
260$5 / $301.1M
15
157
gpt-5.1-high
OpenAI · Proprietary
1461±31
375$1.25 / $10400K
16
162
Anthropic
claude-opus-4-5-20251101-thinking-32k
Anthropic · Proprietary
1456±35
312$5 / $25200K
17
258
gpt-5.4
OpenAI · Proprietary
1456±29
495$2.50 / $151.1M
18
170
glm-5.1
Z.ai · MIT
1453±42
204$1.40 / $4.40202.8K
19
452
gemini-2.5-pro
Google · Proprietary
1452±16
1,824$1.25 / $101M
20
269
gpt-5.2-chat-latest-20260210
OpenAI · Proprietary
1451±39
245$1.75 / $14128K
21
270
Anthropic
claude-opus-4-8
Anthropic · Proprietary
1446±37
262$5 / $251M
22
276
Anthropic
claude-opus-4-8-thinking
Anthropic · Proprietary
1446±37
279$5 / $251M
23
464
Anthropic
claude-opus-4-5-20251101
Anthropic · Proprietary
1443±26
592$5 / $25200K
24
473
kimi-k2.6
Moonshot · Modified MIT
1442±33
378$0.95 / $4262.1K
25
468
gemini-3-flash (thinking-minimal)
Google · Proprietary
1441±26
649$0.50 / $31M
26
380
gpt-4.5-preview-2025-02-27
OpenAI · Proprietary
1440±38
271$75 / $150128K
27
476
Anthropic
claude-sonnet-4-6
Anthropic · Proprietary
1438±30
471$3 / $151M
28
478
mimo-v2.5-pro
Xiaomi · MIT
1437±32
391$0.43 / $0.871M
29
478
ernie-5.1
Baidu · Proprietary
1437±31
411N/AN/A
30
480
deepseek-v4-pro-thinking
DeepSeek · MIT
1435±32
393$0.43 / $0.871M
31
484
gpt-5.2-high
OpenAI · Proprietary
1432±32
388$1.75 / $14400K
32
776
grok-4.1
xAI · Proprietary
1432±25
611N/AN/A
33
587
gpt-5.4-mini-high
OpenAI · Proprietary
1431±32
403$0.75 / $4.50400K
34
1076
gpt-5-high
OpenAI · Proprietary
1429±23
817$1.25 / $10400K
35
589
gpt-5.1
OpenAI · Proprietary
1428±32
384$1.25 / $10400K
36
1273
o3-2025-04-16
OpenAI · Proprietary
1427±18
1,281$2 / $8200K
37
789
gpt-5-chat
OpenAI · Proprietary
1426±29
466$1.25 / $10128K
38
1089
kimi-k2.5-thinking
Moonshot · Modified MIT
1425±27
561$0.60 / $3N/A
39
790
deepseek-v4-flash-thinking
DeepSeek · MIT
1425±32
410$0.25 / $1.75200K
40
990
qwen3.5-397b-a17b
Alibaba · Apache 2.0
1424±30
462$0.39 / $2.45256K
41
1190
gemini-3.1-flash-lite-preview
Google · Proprietary
1421±28
547$0.25 / $1.501M
42
596
gpt-5.3-chat-latest
OpenAI · Proprietary
1419±41
245$1.75 / $14128K
43
1293
grok-4.20-beta-0309-reasoning
xAI · Proprietary
1416±29
497$2 / $62M
44
1391
Anthropic
claude-opus-4-1-20250805-thinking-16k
Anthropic · Proprietary
1414±25
560$15 / $75200K
45
1490
chatgpt-4o-latest-20250326
OpenAI · Proprietary
1414±19
1,165$5 / $15128K
46
1394
gpt-5.2
OpenAI · Proprietary
1413±27
599$1.75 / $14400K
47
1294
Bytedance
dola-seed-2.0-pro
Bytedance · Proprietary
1413±28
595N/AN/A
48
1294
grok-4.20-multi-agent-beta-0309
xAI · Proprietary
1412±30
445$2 / $62M
49
1295
grok-4.3
xAI · Proprietary
1411±33
390$1.25 / $2.501M
50
1494
Anthropic
claude-sonnet-4-5-20250929
Anthropic · Proprietary
1409±24
681$3 / $15200K
51
11102
minimax-m3
MiniMax · MiniMax Community License
1408±40
254$0.60 / $2.40N/A
52
1494
grok-4.1-thinking
xAI · Proprietary
1408±26
556N/AN/A
53
10108
mimo-v2-pro
Xiaomi · Proprietary
1408±45
162$1 / $31M
54
12102
glm-5
Z.ai · MIT
1407±39
259$1 / $3.20202.8K
55
1594
Anthropic
claude-sonnet-4-5-20250929-thinking-32k
Anthropic · Proprietary
1406±24
703$3 / $15200K
56
1894
Anthropic
claude-opus-4-1-20250805
Anthropic · Proprietary
1405±19
1,043$15 / $75200K
57
1498
qwen3.6-plus
Alibaba · Proprietary
1404±31
406$0.33 / $1.951M
58
14108
gemini-2.5-flash-preview-09-2025
Google · Proprietary
1399±35
302$0.30 / $2.501M
59
13111
longcat-flash-chat-2602-exp
Meituan · Proprietary
1397±40
247N/AN/A
60
19101
glm-4.5
Z.ai · MIT
1394±24
707$0.60 / $2.20131.1K
61
2696
gemini-2.5-flash
Google · Proprietary
1393±15
1,856$0.30 / $2.501M
62
18110
deepseek-v4-flash
DeepSeek · MIT
1392±31
446$0.09 / $0.181M
63
18112
deepseek-v3.2-thinking
DeepSeek · MIT
1391±34
315$0.23 / $0.34131.1K
64
24108
Anthropic
claude-opus-4-20250514-thinking-16k
Anthropic · Proprietary
1387±23
755$15 / $75200K
65
20112
qwen3-max-preview
Alibaba · Proprietary
1386±31
372$0.78 / $3.90262.1K
66
17121
qwen3.7-plus
Alibaba · Proprietary
1385±41
229$0.32 / $1.281M
67
30108
grok-4-0709
xAI · Proprietary
1384±21
860$3 / $15256K
68
24112
deepseek-r1-0528
DeepSeek · MIT
1383±27
549$0.50 / $2.15163.8K
69
35107
qwen3-235b-a22b-instruct-2507
Alibaba · Apache 2.0
1382±18
1,262$0.26 / $1.06N/A
70
21116
ernie-5.0-0110
Baidu · Proprietary
1381±33
355N/AN/A
71
34110
Anthropic
claude-opus-4-20250514
Anthropic · Proprietary
1381±20
974$15 / $75200K
72
34111
grok-3-preview-02-24
xAI · Proprietary
1380±22
777$3 / $15131.1K
73
20122
qwen3-next-80b-a3b-instruct
Alibaba · Apache 2.0
1379±36
268$0.09 / $1.10262.1K
74
24117
gpt-5.4-nano-high
OpenAI · Proprietary
1378±31
441$0.20 / $1.25400K
75
20124
deepseek-v3.1-thinking
DeepSeek · MIT
1378±38
234$1.23 / $4.94N/A
76
26121
deepseek-v3.1
DeepSeek · MIT
1376±32
308$1.23 / $4.94N/A
77
34114
o1-2024-12-17
OpenAI · Proprietary
1375±26
581$15 / $60200K
78
19130
kimi-k2-0905-preview
Moonshot · Modified MIT
1375±43
182$0.60 / $2.50262.1K
79
30121
qwen3-235b-a22b-thinking-2507
Alibaba · Apache 2.0
1375±31
403$0.15 / $1.50262.1K
80
26123
glm-4.6
Z.ai · MIT
1375±34
310$0.43 / $1.74202.8K
81
32123
deepseek-v3.2
DeepSeek · MIT
1373±31
401$0.23 / $0.34131.1K
82
35121
kimi-k2-thinking-turbo
Moonshot · Modified MIT
1372±28
484$1.15 / $8262.1K
83
20135
qwen3-vl-235b-a22b-instruct
Alibaba · Apache 2.0
1369±47
151$0.20 / $0.88262.1K
84
35126
grok-4-1-fast-reasoning
xAI · Proprietary
1368±32
381$0.20 / $0.502M
85
36126
gpt-5-mini-high
OpenAI · Proprietary
1368±31
404$0.25 / $2400K
86
36126
qwen3.5-flash
Alibaba · Proprietary
1368±30
459N/AN/A
87
44120
qwen3-235b-a22b-no-thinking
Alibaba · Apache 2.0
1367±22
793$0.46 / $1.82131.1K
88
45120
kimi-k2-0711-preview
Moonshot · Modified MIT
1366±21
829$0.60 / $2.50131.1K
89
22136
grok-4-fast-reasoning
xAI · Proprietary
1366±46
176$0.20 / $0.502M
90
32134
qwen3.5-122b-a10b
Alibaba · Apache 2.0
1364±40
228$0.26 / $2.08262.1K
91
44132
gemini-2.5-flash-lite-preview-09-2025-no-thinking
Google · Proprietary
1359±29
416$0.10 / $0.401M
92
43134
mistral-large-3
Mistral · Apache 2.0
1358±34
333$0.50 / $1.50N/A
93
52127
gemini-2.5-flash-lite-preview-06-17-thinking
Google · Proprietary
1356±22
727$0.10 / $0.401M
94
55126
mistral-medium-2508
Mistral · Proprietary
1355±20
997$0.40 / $2131.1K
95
39144
qwen3.5-27b
Alibaba · Apache 2.0
1353±43
202$0.20 / $1.56262.1K
96
57132
Anthropic
claude-3-7-sonnet-20250219-thinking-32k
Anthropic · Proprietary
1350±20
934$3 / $15200K
97
56133
Anthropic
claude-haiku-4-5-20251001
Anthropic · Proprietary
1349±22
924$1 / $5200K
98
56134
o1-preview
OpenAI · Proprietary
1348±24
674$15 / $60N/A
99
34150
deepseek-v3.2-exp
DeepSeek · MIT
1348±54
120$0.27 / $0.41163.8K
100
59133
Anthropic
claude-sonnet-4-20250514
Anthropic · Proprietary
1346±21
875$3 / $151M
101
59135
Anthropic
claude-sonnet-4-20250514-thinking-32k
Anthropic · Proprietary
1345±23
736$3 / $151M
102
56139
Stepfun
step-3.5-flash
StepFun · Apache 2.0
1344±28
554$0.10 / $0.30262.1K
103
64134
gpt-4.1-2025-04-14
OpenAI · Proprietary
1342±19
1,017$2 / $81M
104
64135
o4-mini-2025-04-16
OpenAI · Proprietary
1342±20
982$1.10 / $4.40200K
105
52144
trinity-large-thinking
Arcee AI · Apache 2.0
1341±38
259$0.25 / $0.80262.1K
106
66137
Anthropic
claude-3-7-sonnet-20250219
Anthropic · Proprietary
1339±20
920$3 / $15200K
107
60141
deepseek-r1
DeepSeek · MIT
1338±26
468$0.70 / $2.50163.8K
108
68138
deepseek-v3-0324
DeepSeek · MIT
1338±20
961$3 / $4.5032.8K
109
55150
trinity-large-preview
Arcee AI · Apache 2.0
1334±42
204$0.15 / $0.45131K
110
59147
Tencent
hunyuan-turbos-20250416
Tencent · Proprietary
1332±34
316N/AN/A
111
71144
grok-3-mini-beta
xAI · Proprietary
1330±25
591$0.30 / $0.50131.1K
112
59150
mimo-v2-omni
Xiaomi · Proprietary
1328±38
270$0.40 / $2262.1K
113
71144
glm-4.5-air
Z.ai · MIT
1328±22
785$0.13 / $0.85131.1K
114
59152
qwen3.5-35b-a3b
Alibaba · Apache 2.0
1325±41
227$0.14 / $1262.1K
115
72146
qwen3-30b-a3b-instruct-2507
Alibaba · Apache 2.0
1325±24
704$0.05 / $0.19131.1K
116
76144
gpt-4.1-mini-2025-04-14
OpenAI · Proprietary
1324±21
909$0.40 / $1.601M
117
80147
mistral-medium-2505
Mistral · Proprietary
1322±21
802$0.40 / $2131.1K
118
74148
gemini-2.0-flash-lite-preview-02-05
Google · Proprietary
1320±25
542$0.07 / $0.301M
119
73150
gpt-oss-120b
OpenAI · Apache 2.0
1320±27
502$0.03 / $0.15131.1K
120
84147
qwen3-coder-480b-a35b-instruct
Alibaba · Apache 2.0
1318±21
850$0.40 / $1.60262.1K
121
88147
gemini-1.5-pro-002
Google · Proprietary
1317±18
1,210$3.50 / $10.502.1M
122
74152
o3-mini-high
OpenAI · Proprietary
1316±30
374$1.10 / $4.40200K
123
72153
mimo-v2.5
Xiaomi · MIT
1315±34
385$0.10 / $0.281M
124
90150
Cohere
command-a-03-2025
Cohere · CC-BY-NC-4.0
1311±19
1,114$2.50 / $10256K
125
89151
qwen2.5-max
Alibaba · Proprietary
1310±22
690N/AN/A
126
84155
grok-3-mini-high
xAI · Proprietary
1310±29
419$0.25 / $1.27N/A
127
89152
qwen3-235b-a22b
Alibaba · Apache 2.0
1309±24
646$0.46 / $1.82131.1K
128
92151
gemma-3-27b-it
Google · Gemma
1309±20
951$0.08 / $0.16131.1K
129
83159
mimo-v2-flash (non-thinking)
Xiaomi · MIT
1308±32
397$0.10 / $0.30262.1K
130
98150
Anthropic
claude-3-5-sonnet-20241022
Anthropic · Proprietary
1308±14
1,965$3 / $15200K
131
102150
gpt-4o-2024-05-13
OpenAI · Proprietary
1305±15
2,674$5 / $15128K
132
74177
amazon-nova-experimental-chat-11-10
Amazon · Proprietary
1300±46
175N/AN/A
133
105153
o3-mini
OpenAI · Proprietary
1299±17
1,323$1.10 / $4.40200K
134
103156
gemini-2.0-flash-001
Google · Proprietary
1299±20
961$0.10 / $0.401M
135
81179
qwen3-next-80b-a3b-thinking
Alibaba · Apache 2.0
1297±46
180$0.10 / $0.78262.1K
136
94169
nvidia-llama-3.3-nemotron-super-49b-v1.5
Nvidia · Nvidia Open
1294±31
383$0.10 / $0.40131.1K
137
101165
mistral-small-2506
Mistral · Apache 2.0
1294±27
463$0.10 / $0.3032K
138
106163
gpt-4o-2024-08-06
OpenAI · Proprietary
1292±19
1,104$2.50 / $10128K
139
106163
gemini-advanced-0514
Google · Proprietary
1292±20
1,212N/AN/A
140
105164
gemma-3n-e4b-it
Google · Gemma
1291±23
674$0.06 / $0.1232.8K
141
106168
deepseek-v3
DeepSeek · DeepSeek
1288±24
534$1.14 / $4.56N/A
142
90182
longcat-flash-chat
Meituan · MIT
1287±44
163$0.20 / $0.80131.1K
143
111164
gemini-1.5-pro-001
Google · Proprietary
1286±17
1,935$3.50 / $10.502.1M
144
112164
Anthropic
claude-3-5-sonnet-20240620
Anthropic · Proprietary
1285±16
2,127$3 / $15200K
145
89189
minimax-m2.1-preview
MiniMax · MIT
1280±53
147$0.30 / $1.20204.8K
146
104184
qwen-plus-0125
Alibaba · Proprietary
1278±38
206$0.40 / $1.20131.1K
147
116171
o1-mini
OpenAI · Proprietary
1277±18
1,093$1.10 / $4.40N/A
148
117177
Meta
llama-3.1-405b-instruct-bf16
Meta · Llama 3.1 Community
1274±20
846$4 / $432.8K
149
126175
grok-2-2024-08-13
xAI · Proprietary
1270±16
1,478$2 / $10131.1K
150
131181
gpt-4-1106-preview
OpenAI · Proprietary
1263±18
1,422$10 / $30128K
151
111190
gpt-oss-20b
OpenAI · Apache 2.0
1263±40
233$0.03 / $0.14131.1K
152
134180
Anthropic
claude-3-opus-20240229
Anthropic · Proprietary
1262±14
3,650$15 / $75200K
153
129184
Meta
llama-4-scout-17b-16e-instruct
Meta · Llama
1261±23
715$0.40 / $0.708.2K
154
132186
gemini-1.5-flash-002
Google · Proprietary
1258±22
753$0.07 / $0.301M
155
124190
minimax-m2.7
MiniMax · Modified MIT
1257±33
418$0.18 / $0.72204.8K
156
133188
01.AI
yi-lightning
01 AI · Proprietary
1255±24
565N/AN/A
157
134186
Meta
llama-4-maverick-17b-128e-instruct
Meta · Llama 4
1254±22
862$0.63 / $1.80131.1K
158
117199
deepseek-v2.5-1210
DeepSeek · DeepSeek
1252±42
161N/AN/A
159
136186
gpt-4-0125-preview
OpenAI · Proprietary
1252±18
1,474$10 / $30128K
160
134188
qwen3-30b-a3b
Alibaba · Apache 2.0
1252±23
665$0.12 / $0.50131.1K
161
139186
gpt-4-turbo-2024-04-09
OpenAI · Proprietary
1251±16
2,112$10 / $30128K
162
140186
gpt-4o-mini-2024-07-18
OpenAI · Proprietary
1251±15
1,790$0.15 / $0.60128K
163
133190
qwq-32b
Alibaba · Apache 2.0
1250±26
571$0.50 / $116.4K
164
140193
mistral-small-3.1-24b-instruct-2503
Mistral · Apache 2.0
1243±23
714$0.10 / $0.3032K
165
134199
qwen-max-0919
Alibaba · Qwen
1242±32
312$1.60 / $6.4032.8K
166
143190
Anthropic
claude-3-5-haiku-20241022
Anthropic · Proprietary
1242±17
1,448$1 / $5200K
167
143191
Meta
llama-3.1-405b-instruct-fp8
Meta · Llama 3.1 Community
1238±16
1,606$4 / $432.8K
168
144193
grok-2-mini-2024-08-13
xAI · Proprietary
1236±18
1,225$2 / $10131.1K
169
146193
mistral-large-2407
Mistral · Mistral Research
1234±17
1,249$2 / $6131.1K
170
133206
gpt-4.1-nano-2025-04-14
OpenAI · Proprietary
1233±45
160$0.10 / $0.401M
171
140201
qwen2.5-plus-1127
Alibaba · Proprietary
1233±33
300N/AN/A
172
142201
glm-4-plus
Z.ai · Proprietary
1233±27
485$0.44 / $1.76204.8K
173
143201
athene-v2-chat
NexusFlow · NexusFlow
1232±23
584N/AN/A
174
149196
gemma-2-27b-it
Google · Gemma license
1229±15
1,808$0.65 / $0.658.2K
175
144201
amazon-nova-pro-v1.0
Amazon · Proprietary
1228±26
511$0.80 / $3.20300K
176
143201
deepseek-v2.5
DeepSeek · DeepSeek
1228±27
479N/AN/A
177
147201
minimax-m1
MiniMax · Apache 2.0
1226±23
773$0.40 / $2.201M
178
141206
magistral-medium-2506
Mistral · Proprietary
1226±37
295$2 / $540K
179
131209
nvidia-nemotron-3-nano-30b-a3b-bf16
Nvidia · NVIDIA Open Model
1225±56
127$0.06 / $0.24262.1K
180
150201
gemini-1.5-flash-001
Google · Proprietary
1224±18
1,522$0.07 / $0.301M
181
142208
glm-4-plus-0111
Z.ai · Proprietary
1221±38
201N/AN/A
182
152201
Cohere
command-r-plus
Cohere · CC-BY-NC-4.0
1219±19
1,487$2.50 / $10128K
183
150204
gpt-4-0314
OpenAI · Proprietary
1217±26
619$30 / $608.2K
184
146208
Cohere
command-r-plus-08-2024
Cohere · CC-BY-NC-4.0
1215±38
210$2.50 / $10128K
185
152204
mistral-large-2411
Mistral · MRL
1214±23
629$2 / $6128K
186
157202
Meta
llama-3.3-70b-instruct
Meta · Llama-3.3
1213±19
1,121$0.10 / $0.32131.1K
187
148208
Cohere
command-r-08-2024
Cohere · CC-BY-NC-4.0
1211±34
254$0.15 / $0.60128K
188
157205
qwen2.5-72b-instruct
Alibaba · Qwen
1211±21
788$1.20 / $1.20N/A
189
159208
Cohere
c4ai-aya-expanse-32b
Cohere · CC-BY-NC-4.0
1205±23
670N/AN/A
190
165208
gpt-4-0613
OpenAI · Proprietary
1201±20
1,241$30 / $608.2K
191
168208
Anthropic
claude-3-sonnet-20240229
Anthropic · Proprietary
1196±18
1,753$3 / $15200K
192
169208
gemma-2-9b-it
Google · Gemma license
1195±17
1,301$0.03 / $0.098.2K
193
165209
phi-4
Microsoft · MIT
1195±27
500$0.07 / $0.1416.4K
194
160211
deepseek-coder-v2
DeepSeek · DeepSeek License
1194±31
345$0.14 / $0.28128K
195
164212
gemma-2-9b-it-simpo
Princeton · MIT
1189±33
291$0.03 / $0.098.2K
196
171210
athene-70b-0725
NexusFlow · CC-BY-NC-4.0
1187±23
676N/AN/A
197
169212
amazon-nova-micro-v1.0
Amazon · Proprietary
1184±27
443$0.04 / $0.14128K
198
169212
amazon-nova-lite-v1.0
Amazon · Proprietary
1184±28
403$0.06 / $0.24300K
199
171213
nemotron-4-340b-instruct
Nvidia · NVIDIA Open Model
1180±29
434N/AN/A
200
168214
jamba-1.5-large
AI21 Labs · Jamba Open
1178±37
242$2 / $8256K
201
179212
qwen2-72b-instruct
Alibaba · Qianwen LICENSE
1175±21
901$0.90 / $0.9032.8K
202
180212
Meta
llama-3.1-70b-instruct
Meta · Llama 3.1 Community
1175±17
1,428$0.40 / $0.40131.1K
203
168217
minimax-m2.5
MiniMax · Modified MIT
1174±42
342$0.12 / $0.48204.8K
204
183212
Anthropic
claude-3-haiku-20240307
Anthropic · Proprietary
1172±17
2,063$0.25 / $1.25200K
205
180213
gemini-1.5-flash-8b-001
Google · Proprietary
1172±23
691$0.07 / $0.301M
206
185214
Cohere
command-r
Cohere · CC-BY-NC-4.0
1163±23
856$0.15 / $0.60128K
207
182219
Cohere
c4ai-aya-expanse-8b
Cohere · CC-BY-NC-4.0
1154±36
231N/AN/A
208
185219
mistral-small-24b-instruct-2501
Mistral · Apache 2.0
1151±37
269$0.05 / $0.0832.8K
209
195219
gpt-3.5-turbo-0125
OpenAI · Proprietary
1142±22
965$0.50 / $1.5016.4K
210
192224
reka-flash-21b-20240226-online
Reka AI · Proprietary
1139±36
274N/AN/A
211
194220
qwen1.5-110b-chat
Alibaba · Qianwen LICENSE
1137±27
599N/AN/A
212
196222
qwen1.5-72b-chat
Alibaba · Qianwen LICENSE
1133±28
495N/AN/A
213
206224
gemma-2-2b-it
Google · Gemma license
1121±18
1,228N/AN/A
214
204227
reka-flash-21b-20240226
Reka AI · Proprietary
1117±30
440N/AN/A
215
206227
phi-3-medium-4k-instruct
Microsoft · MIT
1116±24
683$0.17 / $0.68N/A
216
207227
mixtral-8x22b-instruct-v0.1
Mistral · Apache 2.0
1110±21
972$0.90 / $0.9065.5K
217
202232
gemini-pro-dev-api
Google · Proprietary
1106±45
173$0.35 / $1.0532.8K
218
206230
qwen1.5-32b-chat
Alibaba · Qianwen LICENSE
1100±34
324N/AN/A
219
211229
Meta
llama-3.1-8b-instruct
Meta · Llama 3.1 Community
1088±19
1,234$0.02 / $0.03131.1K
220
207235
qwen1.5-14b-chat
Alibaba · Qianwen LICENSE
1083±41
217$0.30 / $0.30N/A
221
214230
Meta
llama-3-70b-instruct
Meta · Llama 3 Community
1083±15
3,276$0.51 / $0.748.2K
222
214232
mistral-large-2402
Mistral · Proprietary
1079±23
945$4 / $1232K
223
211235
dbrx-instruct-preview
Databricks · DBRX LICENSE
1079±31
426$0.60 / $0.6032.8K
224
210235
jamba-1.5-mini
AI21 Labs · Jamba Open
1077±36
270$0.20 / $0.40256K
225
212235
01.AI
yi-1.5-34b-chat
01 AI · Apache-2.0
1076±28
506N/AN/A
226
212235
mistral-medium
Mistral · Proprietary
1070±34
337$2.70 / $8.1032K
227
217238
gemma-1.1-7b-it
Google · Gemma license
1052±30
449$0.03 / $0.098.2K
228
214240
01.AI
yi-34b-chat
01 AI · Yi License
1050±44
197$0.90 / $0.904.1K
229
218238
snowflake-arctic-instruct
Snowflake · Apache 2.0
1043±25
667N/AN/A
230
217240
starling-lm-7b-beta
Nexusflow · Apache-2.0
1042±44
181N/AN/A
231
222238
Meta
llama-3-8b-instruct
Meta · Llama 3 Community
1037±18
1,992$0.14 / $0.148.2K
232
220240
phi-3-small-8k-instruct
Microsoft · MIT
1033±32
390$0.15 / $0.60N/A
233
220241
vicuna-33b
LMSYS · Non-commercial
1024±41
195$0 / $02K
234
222240
phi-3-mini-4k-instruct-june-2024
Microsoft · MIT
1023±33
354$0.13 / $0.524.1K
235
227241
mixtral-8x7b-instruct-v0.1
Mistral · Apache 2.0
1003±22
960$0.63 / $0.6332K
236
222241
vicuna-13b
LMSYS · Llama 2 Community
1002±53
125$0.30 / $0.30N/A
237
227241
phi-3-mini-4k-instruct
Microsoft · MIT
1000±34
384$0.13 / $0.52N/A
238
227241
Meta
llama-2-70b-chat
Meta · Llama 2 Community
989±34
341$0.70 / $2.804.1K
239
230241
phi-3-mini-128k-instruct
Microsoft · MIT
981±32
385$0.13 / $0.52N/A
240
230241
Meta
llama-2-13b-chat
Meta · Llama 2 Community
961±50
175$0.25 / $0.254.1K
241
234241
mistral-7b-instruct-v0.2
Mistral · Apache-2.0
943±47
167$0.20 / $0.2032.8K

Default Leaderboard Plots

Confidence Intervals on Model Strength (via Bootstrapping)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Battle Count for Each Combination of Models (without Ties)

USE CASES

  • Chat with AI
  • Build Apps & Websites
  • Write & Edit Text
  • Search the Web
  • Generate Images
  • Generate Videos
  • Chose any model
  • Compare Models Side by Side

LEADERBOARD RANKINGS

  • Overall
  • Agent
  • Text
  • WebDev
  • Image-to-WebDev
  • Text to Image
  • Image Edit
  • Text to Video
  • Image to Video
  • Video Edit
  • Vision
  • Document
  • Search

COMPANY

  • About Us
  • How It Works
  • Blog
  • Careers
  • Changelog
  • Help Center
  • FAQ

LEGAL

  • Terms
  • Privacy
  • Cookies

FOLLOW

  • X
  • LinkedIn
  • YouTube
  • Discord

© Arena Intelligence 2026