Code Arena🏆Image to WebDev

View overall rankings across AI models on their ability to generate websites from images and screenshots, alongside agentic coding workflows that involve multi-step reasoning and tool use.

Apr 14, 2026
13,673 votes
12 models
Rank Spread
1
13
Anthropic
Anthropic · Proprietary
1558+20/-20
1,159$3 / $151M
2
13
Anthropic
Anthropic · Proprietary
1547+21/-21
963$5 / $251M
3
14
Anthropic
Anthropic · Proprietary
1524+20/-20
1,045$5 / $251M
4
34
Google · Proprietary
1496+16/-16
1,841$2 / $121M
5
58
Google · Proprietary
1452+20/-20
1,087$2 / $121M
6
57
Google · Proprietary
1448+13/-13
2,540$0.50 / $31M
7
59
OpenAI · Proprietary
1421+19/-19
1,114$1.25 / $10400K
8
79
1417+13/-13
2,487$0.50 / $31M
9
69
Moonshot · Modified MIT
1415+19/-19
1,093$0.44 / $2262.1K
10
1011
OpenAI · Proprietary
1344+19/-19
1,259$1.25 / $10400K
11
1011
Google · Proprietary
1330+18/-18
1,642$0.25 / $1.501M
12
1212
Google · Proprietary
1275+19/-19
1,181$1.25 / $101M

Remove Style Control Leaderboard Plots

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Confidence Intervals on Model Strength (via Bootstrapping)

Battle Count for Each Combination of Models (without Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles