Code Arena | WebDev🏆Overall

View overall rankings across AI models on front-end web development tasks, including agentic coding workflows that require multi-step reasoning and tool use.

May 7, 2026

288,203 votes

77 models

Rank by

	Rank Spread
1	12	claude-opus-4-7-thinking Anthropic · Proprietary	1570+12/-12	3,309	$5 / $25	1M
2	14	claude-opus-4-7 Anthropic · Proprietary	1560+12/-12	3,333	$5 / $25	1M
3	25	claude-opus-4-6-thinking Anthropic · Proprietary	1549+9/-9	6,382	$5 / $25	1M
4	25	claude-opus-4-6 Anthropic · Proprietary	1544+8/-8	7,325	$5 / $25	1M
5	38	glm-5.1 Z.ai · MIT	1531+11/-11	3,609	$1.40 / $4.40	202.8K
6	58	claude-sonnet-4-6 Anthropic · Proprietary	1524+8/-8	9,434	$3 / $15	1M
7	58	kimi-k2.6 Moonshot · Modified MIT	1523+12/-12	2,651	$0.95 / $4	262.1K
8	511	muse-spark Meta · Proprietary	1509+16/-16	1,629	N/A	N/A
9	812	gpt-5.5-high (codex-harness) OpenAI · Proprietary	1491+12/-12	2,765	N/A	N/A
10	811	claude-opus-4-5-20251101-thinking-32k Anthropic · Proprietary	1490+7/-7	13,063	$5 / $25	200K
11	816	qwen3.6-max-preview Alibaba · Proprietary	1478+17/-17	1,343	$1.04 / $6.24	262.1K
12	1016	mimo-v2.5-pro Xiaomi · MIT	1472+11/-11	3,278	$1 / $3	1M
13	1116	claude-opus-4-5-20251101 Anthropic · Proprietary	1467+6/-6	15,308	$5 / $25	200K
14	1119	qwen3.6-plus Alibaba · Proprietary	1463+10/-10	4,416	$0.33 / $1.95	1M
15	1125	gpt-5.4-high (codex-harness) OpenAI · Proprietary	1457+17/-17	1,482	$2.50 / $15	1.1M
16	1125	deepseek-v4-pro-thinking DeepSeek · MIT	1454+16/-16	1,471	$0.43 / $0.87	1M
17	1422	gemini-3.1-pro-preview Google · Proprietary	1452+7/-7	8,551	$2 / $12	1M
18	1427	mimo-v2.5 Xiaomi · MIT	1443+13/-13	2,198	$0.40 / $2	1M
19	1427	gpt-5.5 (codex-harness) OpenAI · Proprietary	1443+12/-13	2,522	N/A	N/A
20	1527	glm-4.7 Z.ai · MIT	1440+10/-10	4,885	$0.40 / $1.75	202.8K
21	1526	gemini-3-pro Google · Proprietary	1438+7/-7	17,166	$2 / $12	1M
22	1527	gpt-5.4-medium (codex-harness) OpenAI · Proprietary	1437+16/-16	1,448	$2.50 / $15	1.1M
23	1627	gemini-3-flash Google · Proprietary	1437+7/-7	13,281	$0.50 / $3	1M
24	1627	glm-5 Z.ai · MIT	1436+8/-8	6,574	$1 / $3.20	202.8K
25	1627	mimo-v2-pro Xiaomi · Proprietary	1432+9/-9	5,440	$1 / $3	1M
26	1827	kimi-k2.5-thinking Moonshot · Modified MIT	1430+7/-7	9,307	$0.60 / $3	N/A
27	2737	kimi-k2.5-instant Moonshot · Modified MIT	1408+11/-11	3,610	$0.44 / $2	262.1K
28	2737	minimax-m2.7 MiniMax · Modified MIT	1407+9/-9	4,998	$0.30 / $1.20	196.6K
29	2741	gpt-5.3-codex (codex-harness) OpenAI · Proprietary	1406+12/-12	2,962	$1.75 / $14	400K
30	2744	gpt-5.2 OpenAI · Proprietary	1404+17/-17	1,457	$1.75 / $14	400K
31	2744	gpt-5.4-mini-high OpenAI · Proprietary	1398+10/-10	4,023	$0.75 / $4.50	400K
32	2745	grok-4.3 xAI · Proprietary	1397+14/-14	1,905	$1.25 / $2.50	1M
33	2744	grok-4.20-beta-0309-reasoning xAI · Proprietary	1396+9/-9	5,715	$2 / $6	2M
34	2746	gpt-5-medium OpenAI · Proprietary	1393+13/-13	3,755	$1.25 / $10	400K
35	2745	minimax-m2.1-preview MiniMax · MIT	1392+8/-8	9,279	$0.29 / $0.95	196.6K
36	2746	gpt-5.1-medium OpenAI · Proprietary	1391+9/-9	6,120	$1.25 / $10	400K
37	2945	gemini-3-flash (thinking-minimal) Google · Proprietary	1389+6/-6	14,862	$0.50 / $3	1M
38	2946	claude-sonnet-4-5-20250929-thinking-32k Anthropic · Proprietary	1388+7/-7	15,737	$3 / $15	200K
39	2946	qwen3.5-397b-a17b Alibaba · Apache 2.0	1388+7/-7	8,189	$0.39 / $2.34	262.1K
40	3046	claude-sonnet-4-5-20250929 Anthropic · Proprietary	1386+6/-6	18,401	$3 / $15	200K
41	1954	gpt-5.4 OpenAI · Proprietary	1385+45/-45	172	$2.50 / $15	1.1M
42	3046	claude-opus-4-1-20250805 Anthropic · Proprietary	1385+9/-9	8,569	$15 / $75	200K
43	2949	gemma-4-31b Google · Apache 2.0	1383+13/-13	2,426	$0.14 / $0.40	262.1K
44	3048	minimax-m2.5 MiniMax · Modified MIT	1382+8/-8	7,851	$0.15 / $1.15	196.6K
45	3650	gpt-5.3-codex (codex-harness) OpenAI · Proprietary	1372+11/-11	3,553	$1.75 / $14	400K
46	4251	deepseek-v3.2-thinking DeepSeek · MIT	1368+8/-8	7,917	$0.25 / $0.38	131.1K
47	3351	hunyuan-hy3-preview Tencent · tencent-hunyuan-community	1367+18/-18	1,228	N/A	N/A
48	4351	qwen3.5-122b-a10b Alibaba · Apache 2.0	1363+8/-8	6,899	$0.26 / $2.08	262.1K
49	4253	gemma-4-26b-a4b Google · Apache 2.0	1360+16/-16	1,507	N/A	N/A
50	4452	glm-4.6 Z.ai · MIT	1355+9/-9	8,353	$0.39 / $1.90	204.8K
51	4553	qwen3.5-27b Alibaba · Apache 2.0	1352+8/-8	6,435	$0.20 / $1.56	262.1K
52	4857	gpt-5.1 OpenAI · Proprietary	1340+7/-7	12,865	$1.25 / $10	400K
53	4957	mimo-v2-flash (non-thinking) Xiaomi · MIT	1337+8/-8	6,734	$0.10 / $0.30	262.1K
54	5157	gpt-5.2-codex OpenAI · Proprietary	1334+8/-8	7,761	$1.75 / $14	400K
55	5257	deepseek-v3.2 DeepSeek · MIT	1332+7/-7	10,476	$0.25 / $0.38	131.1K
56	5258	kimi-k2-thinking-turbo Moonshot · Modified MIT	1330+6/-6	15,369	$1.15 / $8	262.1K
57	5258	gpt-5.1-codex OpenAI · Proprietary	1329+10/-10	6,222	$1.25 / $10	400K
58	5660	claude-haiku-4-5-20251001 Anthropic · Proprietary	1318+6/-6	19,227	$1 / $5	200K
59	5861	minimax-m2 MiniMax · Apache 2.0	1304+9/-9	8,401	$0.26 / $1	196.6K
60	5862	mimo-v2-flash (thinking) Xiaomi · MIT	1300+14/-14	2,097	$0.10 / $0.30	262.1K
61	5962	deepseek-v3.2-exp DeepSeek · MIT	1286+11/-11	4,870	$0.27 / $0.41	163.8K
62	6062	qwen3-coder-480b-a35b-instruct Alibaba · Apache 2.0	1281+7/-7	15,215	$0.40 / $1.60	262.1K
63	6369	KAT-Coder-Pro-V1 KwaiKAT · Proprietary	1258+15/-15	1,881	$0.21 / $0.83	256K
64	6370	qwen3.5-35b-a3b Alibaba · Apache 2.0	1249+16/-16	1,815	$0.14 / $1	262.1K
65	6371	trinity-large-thinking Arcee AI · Apache 2.0	1245+19/-19	1,314	$0.22 / $0.85	262.1K
66	6370	gemini-3.1-flash-lite-preview Google · Proprietary	1240+8/-8	7,744	$0.25 / $1.50	1M
67	6371	gpt-5.1-codex-mini OpenAI · Proprietary	1239+17/-17	1,443	$0.25 / $2	400K
68	6371	qwen3.5-flash Alibaba · Proprietary	1236+17/-17	1,562	N/A	N/A
69	6371	grok-4-1-fast-reasoning xAI · Proprietary	1234+9/-9	6,911	$0.20 / $0.50	2M
70	6473	mistral-large-3 Mistral · Apache 2.0	1222+20/-20	1,032	$0.50 / $1.50	N/A
71	6673	grok-4.1-thinking xAI · Proprietary	1208+20/-20	1,209	N/A	N/A
72	7073	gemini-2.5-pro Google · Proprietary	1203+13/-13	3,297	$1.25 / $10	1M
73	7074	devstral-2 Mistral · Modified MIT	1199+17/-17	1,583	N/A	N/A
74	7376	mercury-2 Inception AI · Proprietary	1165+23/-23	947	$0.25 / $0.75	128K
75	7476	grok-4-fast-reasoning xAI · Proprietary	1149+23/-23	934	$0.20 / $0.50	2M
76	7476	grok-code-fast-1 xAI · Proprietary	1139+22/-22	982	$0.20 / $1.50	256K
77	7777	devstral-medium-2507 Mistral · Proprietary	1091+23/-23	993	$0.40 / $2	128K

Code Arena | WebDev🏆Overall

Remove Style Control Leaderboard Plots

Confidence Intervals on Model Strength (via Bootstrapping)

Battle Count for Each Combination of Models (without Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Domain

Code Arena | WebDev🏆Overall

Remove Style Control Leaderboard Plots

Confidence Intervals on Model Strength (via Bootstrapping)

Battle Count for Each Combination of Models (without Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)