Vision Arena📚Homework

View overall rankings across multimodal AI models capable of reasoning over visual inputs.

Jun 5, 2026
78,605 votes
83 models
Rank Spread
1
111
OpenAI · Proprietary
1341±17
1,433$5 / $301.1M
2
114
Anthropic
Anthropic · Proprietary
1333±14
1,985$5 / $251M
3
117
OpenAI · Proprietary
1330±17
1,487$5 / $301.1M
4
128
Anthropic
Anthropic · Proprietary
1326±31
366$5 / $251M
5
118
Anthropic
Anthropic · Proprietary
1323±15
1,867$5 / $251M
6
120
Anthropic
Anthropic · Proprietary
1322±15
1,840$5 / $251M
7
120
OpenAI · Proprietary
1321±15
1,762$2.50 / $151.1M
8
122
Anthropic
Anthropic · Proprietary
1316±13
2,380$5 / $251M
9
124
OpenAI · Proprietary
1314±15
1,789$2.50 / $151.1M
10
221
Google · Proprietary
1314±10
3,999$0.50 / $31M
11
223
Google · Proprietary
1311±11
3,684$2 / $121M
12
125
Google · Proprietary
1311±14
1,797$2 / $121M
13
230
Moonshot · Modified MIT
1306±16
1,539$0.95 / $4262.1K
14
331
Anthropic
Anthropic · Proprietary
1302±13
2,410$3 / $151M
15
334
Meta
Meta · Proprietary
1299±20
910N/AN/A
16
531
Google · Apache 2.0
1297±10
4,477$0.14 / $0.40262.1K
17
144
Anthropic
Anthropic · Proprietary
1296±28
441$5 / $251M
18
533
OpenAI · Proprietary
1294±13
2,394$1.75 / $14128K
19
833
1293±10
3,732$0.50 / $31M
20
736
OpenAI · Proprietary
1291±14
2,201$0.75 / $4.50400K
21
353
MiniMax · Proprietary
1288±30
422$0.60 / $2.40N/A
22
1036
Moonshot · Modified MIT
1288±12
2,633$0.60 / $3N/A
23
448
Alibaba · Proprietary
1288±24
642$0.40 / $1.601M
24
1243
Google · Apache 2.0
1283±12
2,744N/AN/A
25
1343
OpenAI · Proprietary
1283±12
2,810$1.75 / $14400K
26
1243
Alibaba · Apache 2.0
1283±13
2,279$0.39 / $2.34262.1K
27
950
OpenAI · Proprietary
1282±19
1,129$5 / $301.1M
28
1147
OpenAI · Proprietary
1282±16
1,211$1.25 / $10400K
29
1246
Bytedance
Bytedance · Proprietary
1281±15
1,746N/AN/A
30
1352
OpenAI · Proprietary
1276±16
1,348$1.25 / $10128K
31
1650
Google · Proprietary
1275±12
3,014$0.25 / $1.501M
32
1849
Google · Proprietary
1272±9
5,410$1.25 / $101M
33
1654
1270±14
2,180$2 / $62M
34
1955
OpenAI · Proprietary
1267±12
2,919$1.75 / $14400K
35
1462
1266±22
624$0.30 / $2.501M
36
1960
xAI · Proprietary
1262±17
1,356$1.25 / $2.501M
37
2158
1262±13
2,302$2 / $62M
38
2158
Xiaomi · MIT
1261±14
2,073$0.14 / $0.281M
39
2160
Alibaba · Apache 2.0
1260±14
1,614$0.20 / $0.88262.1K
40
2162
Z.ai · Proprietary
1259±14
2,152$1.20 / $4202.8K
41
2162
OpenAI · Proprietary
1257±16
1,486$1.25 / $10400K
42
2562
Alibaba · Apache 2.0
1253±14
1,856$0.26 / $2.08262.1K
43
2462
OpenAI · Proprietary
1253±15
1,401$1.25 / $10400K
44
2163
Moonshot · Modified MIT
1250±24
563$0.40 / $1.90262.1K
45
2863
OpenAI · Proprietary
1249±15
1,656$1.10 / $4.40200K
46
2763
Xiaomi · Proprietary
1248±16
1,561$0.40 / $2262.1K
47
2963
OpenAI · Proprietary
1248±16
1,538$2 / $81M
48
3263
OpenAI · Proprietary
1247±13
2,170$5 / $15128K
49
3163
OpenAI · Proprietary
1247±14
2,023$2 / $8200K
50
2166
Baidu · Proprietary
1242±29
361N/AN/A
51
3563
Alibaba · Apache 2.0
1241±14
1,979$0.20 / $1.56262.1K
52
3763
Google · Proprietary
1237±9
4,866$0.30 / $2.501M
53
2570
Anthropic
Anthropic · Proprietary
1235±31
334$15 / $75200K
54
2670
Tencent
Tencent · Proprietary
1234±32
312N/AN/A
55
3964
OpenAI · Proprietary
1232±14
2,177$0.20 / $1.25400K
56
3566
OpenAI · Proprietary
1231±19
1,013$0.25 / $2400K
57
3371
Alibaba · Apache 2.0
1227±31
311$0.26 / $2.60131.1K
58
3474
Anthropic
Anthropic · Proprietary
1221±35
240$3 / $15200K
59
4470
OpenAI · Proprietary
1220±16
1,351$0.40 / $1.601M
60
3771
Mistral · Modified MIT
1220±27
564$1.50 / $7.50262.1K
61
3576
Anthropic
Anthropic · Proprietary
1219±35
244$3 / $151M
62
3182
Anthropic
Anthropic · Proprietary
1216±44
170$15 / $75200K
63
3982
Anthropic
1205±40
183$3 / $15200K
64
5280
1197±22
620$0.10 / $0.401M
65
5582
1188±16
1,378$0.10 / $0.401M
66
5582
Google · Proprietary
1187±21
789$0.10 / $0.401M
67
5382
OpenAI · Proprietary
1181±34
271$0.05 / $0.40400K
68
5382
Stepfun
StepFun · Apache 2.0
1179±38
226$0.57 / $1.4265.5K
69
5882
Mistral · Apache 2.0
1174±23
632$0.10 / $0.3032K
70
6082
xAI · Proprietary
1171±18
1,124$3 / $15256K
71
6182
xAI · Proprietary
1170±15
1,950$0.20 / $0.502M
72
6082
Google · Gemma
1169±18
1,068$0.08 / $0.16131.1K
73
5583
Z.ai · MIT
1169±38
222$0.60 / $1.8065.5K
74
6282
Mistral · Proprietary
1168±13
2,183$2.70 / $8.1032K
75
5583
Anthropic
Anthropic · Proprietary
1167±38
224$3 / $15200K
76
6282
Mistral · Proprietary
1163±20
843$0.40 / $2131.1K
77
6183
1161±24
549$0.63 / $1.80131.1K
78
6283
1157±20
840$0.10 / $0.3032K
79
6083
Z.ai · MIT
1153±36
283$0.30 / $0.90131.1K
80
6383
1149±25
496$0.40 / $0.708.2K
81
6283
Anthropic
Anthropic · Proprietary
1142±38
232$0.80 / $4200K
82
6383
Stepfun
StepFun · Proprietary
1138±35
250N/AN/A
83
7583
Ai2 · Apache 2.0
1089±51
186$0.20 / $0.2036.9K

Default Leaderboard Plots

Battle Count for Each Combination of Models (without Ties)

Confidence Intervals on Model Strength (via Bootstrapping)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)