Code ArenaReact
View overall rankings across AI models on agentic coding tasks involving multi-step reasoning and tool use.
Apr 1, 2026
98,690 votes
43 models
Rank Spread | ||||||
|---|---|---|---|---|---|---|
| 1 | 12 | Anthropic · Proprietary | 1540+13/-13 | 3,022 | $5 / $25 | 1M |
| 2 | 13 | Anthropic · Proprietary | 1534+12/-12 | 3,727 | $5 / $25 | 1M |
| 3 | 23 | Anthropic · Proprietary | 1513+9/-9 | 6,046 | $3 / $15 | 1M |
| 4 | 47 | Anthropic · Proprietary | 1477+9/-9 | 5,251 | $5 / $25 | 200K |
| 5 | 48 | Anthropic · Proprietary | 1460+9/-9 | 5,955 | $5 / $25 | 200K |
| 6 | 415 | Alibaba · Proprietary | 1452+20/-20 | 968 | N/A | N/A |
| 7 | 515 | OpenAI · Proprietary | 1448+18/-18 | 1,333 | N/A | N/A |
| 8 | 615 | Google · Proprietary | 1440+10/-10 | 4,762 | $2 / $12 | 1M |
| 9 | 426 | Z.ai · MIT | 1435+60/-60 | 119 | $0.39 / $1.75 | 202.8K |
| 10 | 615 | Google · Proprietary | 1430+10/-10 | 3,988 | $0.50 / $3 | 1M |
| 11 | 615 | Z.ai · MIT | 1429+11/-11 | 3,952 | $1 / $3.20 | 202.8K |
| 12 | 616 | Xiaomi · Proprietary | 1424+13/-13 | 2,527 | $1 / $3 | 1M |
| 13 | 615 | Moonshot · Modified MIT | 1423+9/-9 | 5,591 | $0.60 / $3 | N/A |
| 14 | 616 | MiniMax · Proprietary | 1421+13/-13 | 2,330 | $0.30 / $1.20 | 204.8K |
| 15 | 618 | OpenAI · Proprietary | 1415+17/-17 | 1,388 | N/A | N/A |
| 16 | 1223 | Google · Proprietary | 1400+11/-11 | 3,393 | $2 / $12 | 1M |
| 17 | 1426 | OpenAI · Proprietary | 1395+13/-13 | 2,605 | $1.75 / $14 | 400K |
| 18 | 1425 | Moonshot · Modified MIT | 1395+12/-12 | 3,023 | $0.38 / $1.91 | 262.1K |
| 19 | 1526 | Anthropic · Proprietary | 1386+9/-9 | 5,525 | $3 / $15 | 200K |
| 20 | 1526 | MiniMax · Modified MIT | 1385+9/-9 | 5,770 | $0.12 / $1 | 196.6K |
| 21 | 1526 | Anthropic · Proprietary | 1382+10/-10 | 4,564 | $3 / $15 | 200K |
| 22 | 1527 | xAI · Proprietary | 1380+12/-12 | 2,639 | $2 / $6 | 2M |
| 23 | 1627 | Alibaba · Apache 2.0 | 1376+9/-9 | 4,806 | $0.39 / $2.34 | 262.1K |
| 24 | 1727 | Google · Proprietary | 1374+8/-8 | 6,141 | $0.50 / $3 | 1M |
| 25 | 1628 | MiniMax · MIT | 1372+13/-13 | 2,494 | $0.27 / $0.95 | 196.6K |
| 26 | 1528 | OpenAI · Proprietary | 1372+19/-19 | 1,027 | $0.75 / $4.50 | 400K |
| 27 | 2229 | DeepSeek · MIT | 1362+10/-10 | 4,047 | $0.26 / $0.38 | 163.8K |
| 28 | 2530 | Alibaba · Apache 2.0 | 1354+10/-10 | 3,734 | $0.26 / $2.08 | 262.1K |
| 29 | 2731 | DeepSeek · MIT | 1343+10/-10 | 4,415 | $0.26 / $0.38 | 163.8K |
| 30 | 2834 | Alibaba · Apache 2.0 | 1333+11/-11 | 3,461 | $0.20 / $1.56 | 262.1K |
| 31 | 2934 | OpenAI · Proprietary | 1327+9/-9 | 4,763 | $1.75 / $14 | 400K |
| 32 | 3035 | Moonshot · Modified MIT | 1318+9/-9 | 5,231 | $1.15 / $8 | 262.1K |
| 33 | 3035 | Anthropic · Proprietary | 1317+9/-9 | 5,422 | $1 / $5 | 200K |
| 34 | 3035 | Xiaomi · MIT | 1311+12/-12 | 2,611 | $0.09 / $0.29 | 262.1K |
| 35 | 3235 | OpenAI · Proprietary | 1305+12/-12 | 2,854 | $1.25 / $10 | 400K |
| 36 | 3637 | Alibaba · Apache 2.0 | 1264+10/-10 | 4,582 | $0.40 / $1.60 | 262.1K |
| 37 | 3641 | Xiaomi · MIT | 1236+20/-20 | 912 | $0.09 / $0.29 | 262.1K |
| 38 | 3741 | Alibaba · Apache 2.0 | 1228+17/-17 | 1,566 | $0.16 / $1.30 | 262.1K |
| 39 | 3741 | Google · Proprietary | 1226+11/-11 | 4,587 | $0.25 / $1.50 | 1M |
| 40 | 3741 | xAI · Proprietary | 1224+17/-17 | 1,459 | $0.20 / $0.50 | 2M |
| 41 | 3742 | Alibaba · Proprietary | 1216+18/-18 | 1,366 | N/A | N/A |
| 42 | 4243 | Inception AI · Proprietary | 1170+22/-22 | 957 | $0.25 / $0.75 | 128K |
| 43 | 4143 | Mistral · Modified MIT | 1163+43/-43 | 225 | N/A | N/A |