Vision Arena🔍Entity Recognition

View overall rankings across multimodal AI models capable of reasoning over visual inputs.

Jun 5, 2026
4,666 votes
35 models
Rank Spread
1
113
Google · Proprietary
1302±35
243$2 / $121M
2
113
Google · Proprietary
1299±37
224$0.50 / $31M
3
116
Google · Proprietary
1288±39
207$2 / $121M
4
122
1269±36
209$0.50 / $31M
5
120
Google · Proprietary
1257±21
873$1.25 / $101M
6
123
OpenAI · Proprietary
1256±32
434$1.25 / $10400K
7
131
Moonshot · Modified MIT
1243±43
162$0.60 / $3N/A
8
131
Google · Proprietary
1242±48
132$0.25 / $1.501M
9
131
OpenAI · Proprietary
1240±49
108$1.25 / $10400K
10
131
Alibaba · Apache 2.0
1240±51
112$0.39 / $2.34262.1K
11
131
xAI · Proprietary
1237±34
341$3 / $15256K
12
331
OpenAI · Proprietary
1231±29
505$2 / $8200K
13
135
1227±67
72$2 / $62M
14
331
OpenAI · Proprietary
1226±30
294$5 / $15128K
15
431
Google · Proprietary
1217±24
587$0.30 / $2.501M
16
431
OpenAI · Proprietary
1213±32
398$1.10 / $4.40200K
17
135
OpenAI · Proprietary
1211±59
99$1.75 / $14128K
18
432
OpenAI · Proprietary
1204±40
283$0.25 / $2400K
19
632
OpenAI · Proprietary
1200±32
362$0.40 / $1.601M
20
335
xAI · Proprietary
1197±61
79$0.20 / $0.502M
21
535
OpenAI · Proprietary
1192±43
141$1.25 / $10400K
22
535
Alibaba · Apache 2.0
1191±43
148$0.20 / $0.88262.1K
23
435
OpenAI · Proprietary
1189±49
123$1.75 / $14400K
24
735
OpenAI · Proprietary
1188±31
383$1.25 / $10128K
25
735
1188±31
422$0.10 / $0.401M
26
735
OpenAI · Proprietary
1183±30
404$2 / $81M
27
735
OpenAI · Proprietary
1176±44
138$1.75 / $14400K
28
735
Google · Apache 2.0
1175±46
151$0.14 / $0.40262.1K
29
735
Google · Gemma
1175±32
326$0.08 / $0.16131.1K
30
735
Google · Apache 2.0
1152±59
92N/AN/A
31
735
Alibaba · Apache 2.0
1146±64
80$0.20 / $1.56262.1K
32
1635
Mistral · Apache 2.0
1135±40
212$0.10 / $0.3032K
33
1835
Mistral · Proprietary
1132±30
443$2.70 / $8.1032K
34
1835
1126±38
248$0.10 / $0.3032K
35
1835
Mistral · Proprietary
1123±39
201$0.40 / $2131.1K

Default Leaderboard Plots

Battle Count for Each Combination of Models (without Ties)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Confidence Intervals on Model Strength (via Bootstrapping)

Fraction of Model A Wins for All Non-tied A vs. B Battles