Code ArenaReact

View overall rankings across AI models on agentic coding tasks involving multi-step reasoning and tool use.

Apr 1, 2026
98,690 votes
43 models
Rank Spread
1
12
Anthropic
Anthropic · Proprietary
1540+13/-13
3,022$5 / $251M
2
13
Anthropic
Anthropic · Proprietary
1534+12/-12
3,727$5 / $251M
3
23
Anthropic
Anthropic · Proprietary
1513+9/-9
6,046$3 / $151M
4
47
Anthropic
1477+9/-9
5,251$5 / $25200K
5
48
Anthropic
Anthropic · Proprietary
1460+9/-9
5,955$5 / $25200K
6
415
Alibaba · Proprietary
1452+20/-20
968N/AN/A
7
515
OpenAI · Proprietary
1448+18/-18
1,333N/AN/A
8
615
Google · Proprietary
1440+10/-10
4,762$2 / $121M
9
426
Z.ai · MIT
1435+60/-60
119$0.39 / $1.75202.8K
10
615
Google · Proprietary
1430+10/-10
3,988$0.50 / $31M
11
615
Z.ai · MIT
1429+11/-11
3,952$1 / $3.20202.8K
12
616
Xiaomi · Proprietary
1424+13/-13
2,527$1 / $31M
13
615
Moonshot · Modified MIT
1423+9/-9
5,591$0.60 / $3N/A
14
616
MiniMax · Proprietary
1421+13/-13
2,330$0.30 / $1.20204.8K
15
618
OpenAI · Proprietary
1415+17/-17
1,388N/AN/A
16
1223
Google · Proprietary
1400+11/-11
3,393$2 / $121M
17
1426
OpenAI · Proprietary
1395+13/-13
2,605$1.75 / $14400K
18
1425
Moonshot · Modified MIT
1395+12/-12
3,023$0.38 / $1.91262.1K
19
1526
Anthropic
Anthropic · Proprietary
1386+9/-9
5,525$3 / $15200K
20
1526
MiniMax · Modified MIT
1385+9/-9
5,770$0.12 / $1196.6K
21
1526
Anthropic
1382+10/-10
4,564$3 / $15200K
22
1527
1380+12/-12
2,639$2 / $62M
23
1627
Alibaba · Apache 2.0
1376+9/-9
4,806$0.39 / $2.34262.1K
24
1727
1374+8/-8
6,141$0.50 / $31M
25
1628
MiniMax · MIT
1372+13/-13
2,494$0.27 / $0.95196.6K
26
1528
OpenAI · Proprietary
1372+19/-19
1,027$0.75 / $4.50400K
27
2229
DeepSeek · MIT
1362+10/-10
4,047$0.26 / $0.38163.8K
28
2530
Alibaba · Apache 2.0
1354+10/-10
3,734$0.26 / $2.08262.1K
29
2731
DeepSeek · MIT
1343+10/-10
4,415$0.26 / $0.38163.8K
30
2834
Alibaba · Apache 2.0
1333+11/-11
3,461$0.20 / $1.56262.1K
31
2934
OpenAI · Proprietary
1327+9/-9
4,763$1.75 / $14400K
32
3035
Moonshot · Modified MIT
1318+9/-9
5,231$1.15 / $8262.1K
33
3035
Anthropic
Anthropic · Proprietary
1317+9/-9
5,422$1 / $5200K
34
3035
1311+12/-12
2,611$0.09 / $0.29262.1K
35
3235
OpenAI · Proprietary
1305+12/-12
2,854$1.25 / $10400K
36
3637
Alibaba · Apache 2.0
1264+10/-10
4,582$0.40 / $1.60262.1K
37
3641
1236+20/-20
912$0.09 / $0.29262.1K
38
3741
Alibaba · Apache 2.0
1228+17/-17
1,566$0.16 / $1.30262.1K
39
3741
Google · Proprietary
1226+11/-11
4,587$0.25 / $1.501M
40
3741
xAI · Proprietary
1224+17/-17
1,459$0.20 / $0.502M
41
3742
Alibaba · Proprietary
1216+18/-18
1,366N/AN/A
42
4243
Inception AI · Proprietary
1170+22/-22
957$0.25 / $0.75128K
43
4143
Mistral · Modified MIT
1163+43/-43
225N/AN/A

Remove Style Control Leaderboard Plots

Confidence Intervals on Model Strength (via Bootstrapping)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Battle Count for Each Combination of Models (without Ties)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)