Code ArenaReact

View overall rankings across AI models on agentic coding tasks involving multi-step reasoning and tool use.

Apr 14, 2026
108,261 votes
44 models
Rank Spread
1
13
Anthropic
Anthropic · Proprietary
1542+12/-12
3,508$5 / $251M
2
13
Anthropic
Anthropic · Proprietary
1536+11/-11
4,262$5 / $251M
3
14
Z.ai · MIT
1532+18/-18
1,282$0.95 / $3.15202.8K
4
34
Anthropic
Anthropic · Proprietary
1515+9/-9
6,041$3 / $151M
5
58
Anthropic
1477+9/-9
5,091$5 / $25200K
6
59
Anthropic
Anthropic · Proprietary
1463+8/-8
6,371$5 / $25200K
7
512
Alibaba · Proprietary
1454+14/-14
1,911$0.33 / $1.951M
8
615
OpenAI · Proprietary
1448+18/-18
1,323$2.50 / $151.1M
9
715
Google · Proprietary
1440+9/-9
5,300$2 / $121M
10
527
Z.ai · MIT
1436+60/-60
119$0.39 / $1.75202.8K
11
716
Google · Proprietary
1430+10/-10
3,986$0.50 / $31M
12
816
Z.ai · MIT
1429+10/-10
4,344$1 / $3.20202.8K
13
718
OpenAI · Proprietary
1425+17/-17
1,284$2.50 / $151.1M
14
816
Moonshot · Modified MIT
1423+9/-9
6,164$0.60 / $3N/A
15
817
Xiaomi · Proprietary
1422+12/-12
2,855$1 / $31M
16
1020
MiniMax · Modified MIT
1416+12/-12
2,646$0.30 / $1.20196.6K
17
1323
Google · Proprietary
1402+11/-11
3,393$2 / $121M
18
1426
OpenAI · Proprietary
1396+13/-13
2,601$1.75 / $14400K
19
1526
Moonshot · Modified MIT
1395+12/-12
3,021$0.38 / $1.72262.1K
20
1526
1394+12/-12
2,980$2 / $62M
21
1627
Anthropic
Anthropic · Proprietary
1386+9/-9
5,427$3 / $15200K
22
1628
OpenAI · Proprietary
1386+15/-16
1,610$2.50 / $151.1M
23
1628
Anthropic
1381+10/-10
4,423$3 / $15200K
24
1728
Alibaba · Apache 2.0
1380+9/-9
5,205$0.39 / $2.34262.1K
25
1728
MiniMax · Modified MIT
1378+9/-9
6,226$0.12 / $0.99196.6K
26
2029
1374+8/-8
6,580$0.50 / $31M
27
1729
MiniMax · MIT
1373+13/-13
2,492$0.29 / $0.95196.6K
28
2230
DeepSeek · MIT
1363+10/-10
3,897$0.26 / $0.38163.8K
29
2630
Alibaba · Apache 2.0
1358+10/-10
4,182$0.26 / $2.08262.1K
30
2832
DeepSeek · MIT
1346+9/-9
4,778$0.26 / $0.38163.8K
31
3034
Alibaba · Apache 2.0
1336+10/-10
3,845$0.20 / $1.56262.1K
32
3035
OpenAI · Proprietary
1327+9/-9
4,622$1.75 / $14400K
33
3136
Anthropic
Anthropic · Proprietary
1319+8/-8
5,878$1 / $5200K
34
3136
Moonshot · Modified MIT
1319+9/-9
5,368$1.15 / $8262.1K
35
3236
1312+12/-12
2,609$0.09 / $0.29262.1K
36
3336
OpenAI · Proprietary
1306+12/-12
2,859$1.25 / $10400K
37
3738
Alibaba · Apache 2.0
1265+10/-10
4,439$0.40 / $1.60262.1K
38
3742
1237+20/-20
912$0.09 / $0.29262.1K
39
3842
Alibaba · Apache 2.0
1229+17/-17
1,565$0.16 / $1.30262.1K
40
3842
Google · Proprietary
1226+10/-10
4,870$0.25 / $1.501M
41
3842
xAI · Proprietary
1225+18/-18
1,459$0.20 / $0.502M
42
3843
Alibaba · Proprietary
1217+18/-18
1,365N/AN/A
43
4244
Mistral · Modified MIT
1163+43/-43
225N/AN/A
44
4344
Inception AI · Proprietary
1156+24/-24
850$0.25 / $0.75128K

Remove Style Control Leaderboard Plots

Fraction of Model A Wins for All Non-tied A vs. B Battles

Confidence Intervals on Model Strength (via Bootstrapping)

Battle Count for Each Combination of Models (without Ties)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)