Code Arena | WebDev๐ŸงชSimulations

View overall rankings across AI models on front-end web development tasks, including agentic coding workflows that require multi-step reasoning and tool use.

May 24, 2026
48,264 votes
79 models
Rank Spread
1
16
Anthropic
Anthropic ยท Proprietary
1594+26/-26
635$5 / $251M
2
18
Anthropic
Anthropic ยท Proprietary
1592+27/-27
660$5 / $251M
3
19
Z.ai ยท MIT
1585+33/-33
390$1.40 / $4.40202.8K
4
114
Alibaba ยท Proprietary
1577+50/-50
171$1.25 / $3.751M
5
113
Anthropic
Anthropic ยท Proprietary
1559+21/-21
1,022$5 / $251M
6
115
OpenAI ยท Proprietary
1549+29/-29
522N/AN/A
7
214
Anthropic
Anthropic ยท Proprietary
1547+19/-19
1,139$5 / $251M
8
217
Alibaba ยท Proprietary
1530+35/-35
291$1.04 / $6.24262.1K
9
417
Anthropic
Anthropic ยท Proprietary
1521+18/-18
1,327$3 / $151M
10
418
Moonshot ยท Modified MIT
1520+28/-28
491$0.95 / $4262.1K
11
521
OpenAI ยท Proprietary
1511+27/-27
551N/AN/A
12
326
OpenAI ยท Proprietary
1510+51/-51
166$2.50 / $151.1M
13
422
Google ยท Proprietary
1508+39/-39
281$1.50 / $91M
14
428
Meta
Meta ยท Proprietary
1495+48/-48
173N/AN/A
15
822
Anthropic
Anthropic ยท Proprietary
1494+15/-15
2,062$5 / $25200K
16
826
Xiaomi ยท MIT
1480+26/-26
597$0.43 / $0.871M
17
1126
Google ยท Proprietary
1471+18/-18
1,189$2 / $121M
18
1028
Alibaba ยท Proprietary
1470+24/-24
699$0.33 / $1.951M
19
735
OpenAI ยท Proprietary
1470+52/-52
141$2.50 / $151.1M
20
1226
Anthropic
Anthropic ยท Proprietary
1465+14/-14
2,359$5 / $25200K
21
1132
DeepSeek ยท MIT
1460+28/-28
503$0.43 / $0.871M
22
1133
OpenAI ยท Proprietary
1458+27/-27
529N/AN/A
23
1434
Xiaomi ยท Proprietary
1446+21/-21
845$1 / $31M
24
1432
Google ยท Proprietary
1446+14/-14
3,154$2 / $121M
25
1435
Z.ai ยท MIT
1441+21/-21
916$1 / $3.20202.8K
26
1444
OpenAI ยท Proprietary
1430+34/-34
359$1.75 / $14400K
27
2037
Moonshot ยท Modified MIT
1429+17/-17
1,314$0.60 / $3N/A
28
2038
MiniMax ยท Modified MIT
1424+18/-18
1,154$0.15 / $1.15204.8K
29
1842
OpenAI ยท Proprietary
1424+25/-25
664$0.75 / $4.50400K
30
1844
Xiaomi ยท MIT
1420+28/-28
480$0.14 / $0.281M
31
2241
Google ยท Proprietary
1417+15/-15
2,099$0.50 / $31M
32
2044
MiniMax ยท Modified MIT
1412+21/-21
860$0.28 / $1.20204.8K
33
2346
Z.ai ยท MIT
1408+22/-22
867$0.40 / $1.75202.8K
34
2046
OpenAI ยท Proprietary
1404+31/-31
408$1.75 / $14400K
35
2446
Google ยท Apache 2.0
1392+30/-30
450$0.14 / $0.40262.1K
36
2746
Alibaba ยท Apache 2.0
1391+18/-18
1,121$0.39 / $2.34262.1K
37
2846
Anthropic
Anthropic ยท Proprietary
1390+13/-13
2,710$3 / $15200K
38
2846
Google ยท Proprietary
1389+13/-13
2,264$0.50 / $31M
39
3046
Anthropic
Anthropic ยท Proprietary
1386+13/-13
3,144$3 / $15200K
40
2648
Moonshot ยท Modified MIT
1385+27/-27
521$0.40 / $1.90262.1K
41
3046
MiniMax ยท MIT
1382+16/-16
1,569$0.29 / $0.95204.8K
42
2947
xAI ยท Proprietary
1380+21/-21
881$2 / $62M
43
2853
xAI ยท Proprietary
1375+30/-30
422$1.25 / $2.501M
44
3348
Anthropic
Anthropic ยท Proprietary
1374+16/-16
1,742$15 / $75200K
45
2659
Tencent
Tencent ยท tencent-hunyuan-community
1364+50/-50
168N/AN/A
46
3358
OpenAI ยท Proprietary
1353+36/-36
304$1.75 / $14400K
47
4155
DeepSeek ยท MIT
1343+18/-18
1,235$0.25 / $0.38131.1K
48
4456
Z.ai ยท MIT
1339+16/-16
1,766$0.43 / $1.74202.8K
49
4257
Alibaba ยท Apache 2.0
1338+20/-20
1,003$0.26 / $2.08262.1K
50
4458
Alibaba ยท Apache 2.0
1337+20/-20
969$0.20 / $1.56262.1K
51
4458
OpenAI ยท Proprietary
1334+18/-18
1,246$1.25 / $10400K
52
4460
OpenAI ยท Proprietary
1328+19/-19
1,078$1.75 / $14400K
53
4560
1325+19/-19
1,105$0.10 / $0.30262.1K
54
4662
Moonshot ยท Modified MIT
1311+13/-13
2,623$1.15 / $8262.1K
55
4563
OpenAI ยท Proprietary
1310+23/-23
778$1.25 / $10400K
56
4762
OpenAI ยท Proprietary
1306+14/-14
2,252$1.25 / $10400K
57
4469
Google ยท Apache 2.0
1298+55/-55
148N/AN/A
58
5263
DeepSeek ยท MIT
1298+16/-16
1,578$0.25 / $0.38131.1K
59
5165
OpenAI ยท Proprietary
1297+19/-19
1,233$1.25 / $10400K
60
5463
Anthropic
Anthropic ยท Proprietary
1293+12/-12
3,321$1 / $5200K
61
5466
DeepSeek ยท MIT
1284+20/-20
1,059$0.27 / $0.41163.8K
62
4868
Xiaomi ยท MIT
1283+36/-36
296$0.10 / $0.30262.1K
63
5968
Alibaba ยท Apache 2.0
1266+13/-13
2,646$0.40 / $1.60262.1K
64
5968
MiniMax ยท Apache 2.0
1263+17/-17
1,690$0.26 / $1204.8K
65
6172
Google ยท Proprietary
1243+21/-21
1,099$0.25 / $1.501M
66
5674
Arcee AI ยท Apache 2.0
1238+51/-51
178$0.22 / $0.85262.1K
67
6074
Alibaba ยท Apache 2.0
1228+42/-42
232$0.14 / $1262.1K
68
6175
Mistral ยท Apache 2.0
1220+44/-44
207$0.50 / $1.50N/A
69
6475
Kwai
KwaiKAT ยท Proprietary
1214+32/-32
408$0.21 / $0.83256K
70
6575
xAI ยท Proprietary
1203+20/-20
1,121$0.20 / $0.502M
71
6577
OpenAI ยท Proprietary
1188+36/-36
321$0.25 / $2400K
72
6677
Google ยท Proprietary
1182+25/-25
721$1.25 / $101M
73
6579
Alibaba ยท Proprietary
1173+55/-55
150N/AN/A
74
6679
IBM ยท Apache 2.0
1170+46/-46
247$0.05 / $0.10131.1K
75
6879
Mistral ยท Modified MIT
1137+47/-47
205N/AN/A
76
7179
xAI ยท Proprietary
1127+48/-48
209$0.20 / $1.50N/A
77
7179
xAI ยท Proprietary
1115+52/-52
181$0.20 / $0.502M
78
7379
xAI ยท Proprietary
1093+44/-44
236N/AN/A
79
7379
Mistral ยท Proprietary
1085+46/-46
239$0.40 / $2128K

Remove Style Control Leaderboard Plots

Confidence Intervals on Model Strength (via Bootstrapping)

Battle Count for Each Combination of Models (without Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)