Code Arena🏆WebDev

View overall rankings across AI models on agentic coding tasks involving multi-step reasoning and tool use.

Apr 14, 2026
235,275 votes
60 models
Rank Spread
1
13
Anthropic
Anthropic · Proprietary
1548+11/-11
4,237$5 / $251M
2
13
Anthropic
Anthropic · Proprietary
1545+10/-10
5,085$5 / $251M
3
14
Z.ai · MIT
1537+17/-17
1,482$0.95 / $3.15202.8K
4
34
Anthropic
Anthropic · Proprietary
1524+9/-9
7,077$3 / $151M
5
55
Anthropic
1490+7/-7
13,065$5 / $25200K
6
69
Anthropic
Anthropic · Proprietary
1468+7/-7
14,690$5 / $25200K
7
615
OpenAI · Proprietary
1457+17/-17
1,483$2.50 / $151.1M
8
613
Google · Proprietary
1454+9/-9
6,069$2 / $121M
9
615
Alibaba · Proprietary
1453+13/-13
2,211$0.33 / $1.951M
10
717
Z.ai · MIT
1440+9/-9
4,959$1 / $3.20202.8K
11
717
Z.ai · MIT
1440+10/-10
4,878$0.39 / $1.75202.8K
12
717
Google · Proprietary
1438+7/-7
17,162$2 / $121M
13
717
OpenAI · Proprietary
1437+16/-16
1,449$2.50 / $151.1M
14
817
Google · Proprietary
1437+7/-7
13,269$0.50 / $31M
15
817
Xiaomi · Proprietary
1432+11/-11
3,234$1 / $31M
16
1017
Moonshot · Modified MIT
1429+8/-8
7,339$0.60 / $3N/A
17
1020
MiniMax · Modified MIT
1422+11/-11
3,026$0.30 / $1.20196.6K
18
1726
Moonshot · Modified MIT
1408+11/-11
3,610$0.38 / $1.72262.1K
19
1728
OpenAI · Proprietary
1407+12/-12
2,971$1.75 / $14400K
20
1731
OpenAI · Proprietary
1404+17/-17
1,461$1.75 / $14400K
21
1831
1396+11/-11
3,378$2 / $62M
22
1831
OpenAI · Proprietary
1393+15/-15
1,843$2.50 / $151.1M
23
1831
OpenAI · Proprietary
1393+13/-13
3,755$1.25 / $10400K
24
1831
MiniMax · MIT
1392+8/-8
9,273$0.29 / $0.95196.6K
25
1831
OpenAI · Proprietary
1391+9/-9
6,124$1.25 / $10400K
26
1831
Alibaba · Apache 2.0
1389+8/-8
5,978$0.39 / $2.34262.1K
27
1931
1389+7/-7
12,677$0.50 / $31M
28
1931
MiniMax · Modified MIT
1388+8/-8
7,184$0.12 / $0.99196.6K
29
2031
Anthropic
1388+6/-6
15,750$3 / $15200K
30
2031
Anthropic
Anthropic · Proprietary
1386+6/-6
18,409$3 / $15200K
31
2032
Anthropic
Anthropic · Proprietary
1385+9/-9
8,573$15 / $75200K
32
3134
DeepSeek · MIT
1368+8/-8
7,910$0.26 / $0.38163.8K
33
3235
Alibaba · Apache 2.0
1364+9/-9
4,758$0.26 / $2.08262.1K
34
3236
Z.ai · MIT
1354+9/-9
8,351$0.39 / $1.90204.8K
35
3340
Alibaba · Apache 2.0
1347+10/-10
4,369$0.20 / $1.56262.1K
36
3441
OpenAI · Proprietary
1339+7/-7
12,875$1.25 / $10400K
37
3541
1337+8/-8
6,734$0.09 / $0.29262.1K
38
3541
OpenAI · Proprietary
1335+8/-8
7,765$1.75 / $14400K
39
3541
DeepSeek · MIT
1330+7/-7
9,997$0.26 / $0.38163.8K
40
3641
Moonshot · Modified MIT
1330+6/-6
15,368$1.15 / $8262.1K
41
3542
OpenAI · Proprietary
1329+9/-9
6,229$1.25 / $10400K
42
4144
Anthropic
Anthropic · Proprietary
1314+6/-6
17,091$1 / $5200K
43
4245
MiniMax · Apache 2.0
1304+9/-9
8,402$0.26 / $1196.6K
44
4246
1300+14/-14
2,095$0.09 / $0.29262.1K
45
4346
DeepSeek · MIT
1286+11/-11
4,870$0.27 / $0.41163.8K
46
4446
Alibaba · Apache 2.0
1281+7/-7
15,212$0.40 / $1.60262.1K
47
4752
Kwai
KwaiKAT · Proprietary
1258+15/-15
1,883$0.21 / $0.83256K
48
4753
Alibaba · Apache 2.0
1247+16/-16
1,817$0.16 / $1.30262.1K
49
4754
OpenAI · Proprietary
1239+17/-17
1,444$0.25 / $2400K
50
4754
Alibaba · Proprietary
1236+17/-17
1,562N/AN/A
51
4754
Google · Proprietary
1236+10/-10
5,544$0.25 / $1.501M
52
4754
xAI · Proprietary
1234+9/-9
6,916$0.20 / $0.502M
53
4856
Mistral · Apache 2.0
1222+20/-20
1,032$0.50 / $1.50N/A
54
4957
xAI · Proprietary
1207+20/-20
1,209N/AN/A
55
5357
Google · Proprietary
1202+13/-13
3,300$1.25 / $101M
56
5357
Mistral · Modified MIT
1199+17/-17
1,579N/AN/A
57
5459
Inception AI · Proprietary
1167+23/-23
951$0.25 / $0.75128K
58
5759
xAI · Proprietary
1148+23/-23
936$0.20 / $0.502M
59
5759
xAI · Proprietary
1139+22/-22
984$0.20 / $1.50256K
60
6060
Mistral · Proprietary
1091+23/-23
993$0.40 / $2128K

Remove Style Control Leaderboard Plots

Fraction of Model A Wins for All Non-tied A vs. B Battles

Confidence Intervals on Model Strength (via Bootstrapping)

Battle Count for Each Combination of Models (without Ties)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)