Code Arena🏆Overall

View overall rankings across AI models on agentic coding tasks involving multi-step reasoning and tool use.

Apr 1, 2026
224,709 votes
59 models
Rank Spread
9
616
Z.ai · MIT
1441+10/-10
4,536$1 / $3.20202.8K
10
616
Z.ai · MIT
1439+10/-10
4,876$0.39 / $1.75202.8K
33
3135
Z.ai · MIT
1354+9/-9
8,345$0.39 / $1.90204.8K

Remove Style Control Leaderboard Plots

Confidence Intervals on Model Strength (via Bootstrapping)

Battle Count for Each Combination of Models (without Ties)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles