Code Arena | WebDev🏆Overall

View overall rankings across AI models on front-end web development tasks, including agentic coding workflows that require multi-step reasoning and tool use.

Jul 9, 2026

457,644 votes

39 open source models

Rank by

				Rank Spread
1	2	glm-5.2 (max) Z.ai · MIT	1580+10/-10	24	4,186	$1.40 / $4.40	1M
2	12	glm-5.1 Z.ai · MIT	1527+9/-9	815	5,276	$1.40 / $4.40	202.8K
3	15	kimi-k2.6 Moonshot · Modified MIT	1514+7/-7	1217	8,812	$0.95 / $4	262.1K
4	18	minimax-m3 MiniMax · MiniMax Community License	1496+9/-9	1621	5,555	$0.60 / $2.40	N/A
5	22	mimo-v2.5-pro Xiaomi · MIT	1473+7/-7	2025	8,848	$0.43 / $0.87	1M
6	23	kimi-k2.7-code Moonshot · Modified MIT	1469+10/-10	2027	4,042	$0.72 / $3.49	262.1K
7	27	deepseek-v4-pro-thinking DeepSeek · MIT	1457+7/-7	2332	8,269	$0.43 / $0.87	1M
8	29	deepseek-v4-pro DeepSeek · MIT	1446+8/-8	2535	9,028	$0.43 / $0.87	1M
9	31	glm-4.7 Z.ai · MIT	1440+10/-10	2639	4,885	$0.40 / $1.75	202.8K
10	35	kimi-k2.5-thinking Moonshot · Modified MIT	1432+6/-6	3139	14,588	$0.60 / $3	N/A
11	37	glm-5 Z.ai · MIT	1430+8/-8	3139	7,445	$1 / $3.20	202.8K
12	38	mimo-v2.5 Xiaomi · MIT	1429+7/-7	3139	7,955	$0.10 / $0.28	1M
13	39	kimi-k2.5-instant Moonshot · Modified MIT	1408+11/-11	3949	3,610	$0.38 / $2.02	262.1K
14	44	qwen3.5-397b-a17b Alibaba · Apache 2.0	1396+6/-6	3953	13,981	$0.39 / $2.45	256K
15	45	minimax-m2.7 MiniMax · Modified MIT	1395+7/-7	3955	9,997	$0.24 / $0.96	204.8K
16	48	minimax-m2.1-preview MiniMax · MIT	1392+8/-8	3955	9,272	$0.30 / $1.20	204.8K
17	55	minimax-m2.5 MiniMax · Modified MIT	1382+8/-8	4360	7,858	$0.15 / $0.90	204.8K
18	57	gemma-4-31b Google · Apache 2.0	1370+8/-8	5165	6,016	$0.14 / $0.40	262.1K
19	58	deepseek-v3.2-thinking DeepSeek · MIT	1368+8/-8	5365	7,920	$0.23 / $0.34	131.1K
20	59	qwen3.5-122b-a10b Alibaba · Apache 2.0	1364+7/-7	5565	8,236	$0.26 / $2.08	262.1K
21	61	hunyuan-hy3-preview Tencent · tencent-hunyuan-community	1361+17/-17	5167	1,382	N/A	N/A
22	62	gemma-4-26b-a4b Google · Apache 2.0	1361+16/-16	5267	1,514	N/A	N/A
23	63	qwen3.5-27b Alibaba · Apache 2.0	1356+8/-8	5665	7,742	$0.20 / $1.56	262.1K
24	64	glm-4.6 Z.ai · MIT	1355+9/-9	5666	8,349	$0.43 / $1.74	202.8K
25	65	laguna-m.1 Poolside · Apache 2.0	1354+11/-11	5667	3,275	$0.20 / $0.40	262.1K
26	67	mimo-v2-flash (non-thinking) Xiaomi · MIT	1337+8/-8	6372	6,728	$0.10 / $0.30	262.1K
27	69	deepseek-v3.2 DeepSeek · MIT	1332+7/-7	6672	10,499	$0.23 / $0.34	131.1K
28	71	kimi-k2-thinking-turbo Moonshot · Modified MIT	1330+6/-6	6672	15,362	$1.15 / $8	262.1K
29	73	minimax-m2 MiniMax · Apache 2.0	1305+9/-9	7376	8,404	$0.26 / $1.02	204.8K
30	74	laguna-xs.2 Poolside · Apache 2.0	1303+11/-11	7376	3,880	$0.10 / $0.20	262.1K
31	75	mimo-v2-flash (thinking) Xiaomi · MIT	1301+14/-14	7377	2,098	$0.10 / $0.30	262.1K
32	76	deepseek-v3.2-exp DeepSeek · MIT	1288+11/-11	7378	4,873	$0.27 / $0.41	163.8K
33	77	qwen3-coder-480b-a35b-instruct Alibaba · Apache 2.0	1281+7/-7	7578	15,214	$0.40 / $1.60	262.1K
34	78	mistral-medium-3.5 Mistral · Modified MIT	1268+15/-15	7684	2,163	$1.50 / $7.50	262.1K
35	81	qwen3.5-35b-a3b Alibaba · Apache 2.0	1250+16/-16	7886	1,815	$0.14 / $1	262.1K
36	82	trinity-large-thinking Arcee AI · Apache 2.0	1243+19/-19	7887	1,321	$0.25 / $0.80	262.1K
37	86	mistral-large-3 Mistral · Apache 2.0	1224+20/-20	8090	1,034	$0.50 / $1.50	N/A
38	89	devstral-2 Mistral · Modified MIT	1200+17/-17	8691	1,588	N/A	N/A
39	90	granite-4.1-8b IBM · Apache 2.0	1200+17/-17	8691	1,771	$0.05 / $0.10	131.1K

Remove Style Control Leaderboard Plots

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Confidence Intervals on Model Strength (via Bootstrapping)

Battle Count for Each Combination of Models (without Ties)

Rank Spread

glm-5.2 (max)

Z.ai · MIT

1580+10/-10

4,186

$1.40 / $4.40

glm-5.1

Z.ai · MIT

1527+9/-9

815

5,276

$1.40 / $4.40

202.8K

kimi-k2.6

Moonshot · Modified MIT

1514+7/-7

1217

8,812

$0.95 / $4

262.1K

minimax-m3

MiniMax · MiniMax Community License

1496+9/-9

1621

5,555

$0.60 / $2.40

N/A

mimo-v2.5-pro

Xiaomi · MIT

1473+7/-7

2025

8,848

$0.43 / $0.87

kimi-k2.7-code

Moonshot · Modified MIT

1469+10/-10

2027

4,042

$0.72 / $3.49

262.1K

deepseek-v4-pro-thinking

DeepSeek · MIT

1457+7/-7

2332

8,269

$0.43 / $0.87

deepseek-v4-pro

DeepSeek · MIT

1446+8/-8

2535

9,028

$0.43 / $0.87

glm-4.7

Z.ai · MIT

1440+10/-10

2639

4,885

$0.40 / $1.75

202.8K

kimi-k2.5-thinking

Moonshot · Modified MIT

1432+6/-6

3139

14,588

$0.60 / $3

N/A

glm-5

Z.ai · MIT

1430+8/-8

3139

7,445

$1 / $3.20

202.8K

mimo-v2.5

Xiaomi · MIT

1429+7/-7

3139

7,955

$0.10 / $0.28

kimi-k2.5-instant

Moonshot · Modified MIT

1408+11/-11

3949

3,610

$0.38 / $2.02

262.1K

qwen3.5-397b-a17b

Alibaba · Apache 2.0

1396+6/-6

3953

13,981

$0.39 / $2.45

256K

minimax-m2.7

MiniMax · Modified MIT

1395+7/-7

3955

9,997

$0.24 / $0.96

204.8K

minimax-m2.1-preview

MiniMax · MIT

1392+8/-8

3955

9,272

$0.30 / $1.20

204.8K

minimax-m2.5

MiniMax · Modified MIT

1382+8/-8

4360

7,858

$0.15 / $0.90

204.8K

gemma-4-31b

Google · Apache 2.0

1370+8/-8

5165

6,016

$0.14 / $0.40

262.1K

deepseek-v3.2-thinking

DeepSeek · MIT

1368+8/-8

5365

7,920

$0.23 / $0.34

131.1K

qwen3.5-122b-a10b

Alibaba · Apache 2.0

1364+7/-7

5565

8,236

$0.26 / $2.08

262.1K

hunyuan-hy3-preview

Tencent · tencent-hunyuan-community

1361+17/-17

5167

1,382

N/A

gemma-4-26b-a4b

Google · Apache 2.0

1361+16/-16

5267

1,514

N/A

qwen3.5-27b

Alibaba · Apache 2.0

1356+8/-8

5665

7,742

$0.20 / $1.56

262.1K

glm-4.6

Z.ai · MIT

1355+9/-9

5666

8,349

$0.43 / $1.74

202.8K

laguna-m.1

Poolside · Apache 2.0

1354+11/-11

5667

3,275

$0.20 / $0.40

262.1K

mimo-v2-flash (non-thinking)

Xiaomi · MIT

1337+8/-8

6372

6,728

$0.10 / $0.30

262.1K

deepseek-v3.2

DeepSeek · MIT

1332+7/-7

6672

10,499

$0.23 / $0.34

131.1K

kimi-k2-thinking-turbo

Moonshot · Modified MIT

1330+6/-6

6672

15,362

$1.15 / $8

262.1K

minimax-m2

MiniMax · Apache 2.0

1305+9/-9

7376

8,404

$0.26 / $1.02

204.8K

laguna-xs.2

Poolside · Apache 2.0

1303+11/-11

7376

3,880

$0.10 / $0.20

262.1K

mimo-v2-flash (thinking)

Xiaomi · MIT

1301+14/-14

7377

2,098

$0.10 / $0.30

262.1K

deepseek-v3.2-exp

DeepSeek · MIT

1288+11/-11

7378

4,873

$0.27 / $0.41

163.8K

qwen3-coder-480b-a35b-instruct

Alibaba · Apache 2.0

1281+7/-7

7578

15,214

$0.40 / $1.60

262.1K

mistral-medium-3.5

Mistral · Modified MIT

1268+15/-15

7684

2,163

$1.50 / $7.50

262.1K

qwen3.5-35b-a3b

Alibaba · Apache 2.0

1250+16/-16

7886

1,815

$0.14 / $1

262.1K

trinity-large-thinking

Arcee AI · Apache 2.0

1243+19/-19

7887

1,321

$0.25 / $0.80

262.1K

mistral-large-3

Mistral · Apache 2.0

1224+20/-20

8090

1,034

$0.50 / $1.50

N/A

devstral-2

Mistral · Modified MIT

1200+17/-17

8691

1,588

N/A

granite-4.1-8b

IBM · Apache 2.0

1200+17/-17

8691

1,771

$0.05 / $0.10

131.1K

Domain

Code Arena | WebDev🏆Overall

Remove Style Control Leaderboard Plots

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Confidence Intervals on Model Strength (via Bootstrapping)

Battle Count for Each Combination of Models (without Ties)

Domain

Code Arena | WebDev🏆Overall

Remove Style Control Leaderboard Plots

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Confidence Intervals on Model Strength (via Bootstrapping)

Battle Count for Each Combination of Models (without Ties)