Vision Arena | Overall
View rankings across multimodal, generative AI models capable of understanding and processing visual inputs
Feb 20, 2026
668,987 votes
98 models
Rank Spread | ||||
|---|---|---|---|---|
| 1 | 13 | Google · Proprietary | 1288±8 | 12,034 |
| 2 | 15 | Google · Proprietary | 1277±12 | 3,503 |
| 3 | 15 | Google · Proprietary | 1274±9 | 10,908 |
| 4 | 212 | Bytedance · Proprietary | 1263±16 | 1,571 |
| 5 | 29 | Google · Proprietary | 1262±10 | 8,927 |
| 6 | 413 | OpenAI · Proprietary | 1250±9 | 8,640 |
| 7 | 414 | OpenAI · Proprietary | 1250±11 | 4,511 |
| 8 | 414 | Moonshot · Modified MIT | 1250±11 | 4,662 |
| 9 | 413 | Google · Proprietary | 1248±6 | 80,509 |
| 10 | 518 | OpenAI · Proprietary | 1241±9 | 9,492 |
| 11 | 518 | Moonshot · Modified MIT | 1239±12 | 3,552 |
| 12 | 617 | OpenAI · Proprietary | 1239±6 | 24,033 |
| 13 | 522 | Alibaba · Apache 2.0 | 1237±15 | 1,614 |
| 14 | 1024 | Google · Proprietary | 1228±10 | 4,752 |
| 15 | 825 | OpenAI · Proprietary | 1228±11 | 2,925 |
| 16 | 1025 | OpenAI · Proprietary | 1227±11 | 4,882 |
| 17 | 1124 | OpenAI · Proprietary | 1225±8 | 42,547 |
| 18 | 1027 | ![]() Baidu · Proprietary | 1221±12 | 3,324 |
| 19 | 1326 | Alibaba · Apache 2.0 | 1218±8 | 11,697 |
| 20 | 1325 | OpenAI · Proprietary | 1218±7 | 48,659 |
| 21 | 1428 | OpenAI · Proprietary | 1215±7 | 44,042 |
| 22 | 1428 | Google · Proprietary | 1212±6 | 48,908 |
| 23 | 1630 | OpenAI · Proprietary | 1209±8 | 36,682 |
| 24 | 1335 | Anthropic · Proprietary | 1207±16 | 1,337 |
| 25 | 1335 | Anthropic · Proprietary | 1207±15 | 1,479 |
| 26 | 1934 | OpenAI · Proprietary | 1203±8 | 43,406 |
| 27 | 2134 | OpenAI · Proprietary | 1202±7 | 44,353 |
| 28 | 2040 | Anthropic · Proprietary | 1195±15 | 1,649 |
| 29 | 2339 | OpenAI · Proprietary | 1195±10 | 3,694 |
| 30 | 2340 | Anthropic · Proprietary | 1193±12 | 2,547 |
| 31 | 2442 | Alibaba · Apache 2.0 | 1188±13 | 2,418 |
| 32 | 2440 | Google · Proprietary | 1188±8 | 38,638 |
| 33 | 2442 | Anthropic · Proprietary | 1188±13 | 2,043 |
| 34 | 2442 | Alibaba · Proprietary | 1185±12 | 3,373 |
| 35 | 2642 | OpenAI · Proprietary | 1183±9 | 30,876 |
| 36 | 2842 | xAI · Proprietary | 1182±8 | 34,084 |
| 37 | 2842 | Google · Proprietary | 1180±8 | 8,902 |
| 38 | 2844 | Anthropic · Proprietary | 1179±9 | 4,666 |
| 39 | 2849 | Tencent · Proprietary | 1176±12 | 2,488 |
| 40 | 2950 | Google · Proprietary | 1173±10 | 4,791 |
| 41 | 3249 | Google · Proprietary | 1173±7 | 10,666 |
| 42 | 3854 | OpenAI · Proprietary | 1164±8 | 23,273 |
| 43 | 3854 | Anthropic · Proprietary | 1163±7 | 10,548 |
| 44 | 3256 | Z.ai · MIT | 1163±14 | 2,422 |
| 45 | 3956 | Google · Gemma | 1158±8 | 18,103 |
| 46 | 3956 | Mistral · Proprietary | 1158±8 | 11,422 |
| 47 | 4156 | Mistral · Proprietary | 1156±7 | 42,832 |
| 48 | 3958 | Z.ai · MIT | 1156±12 | 3,492 |
| 49 | 3958 | StepFun · Proprietary | 1155±14 | 2,027 |
| 50 | 3961 | Tencent · Proprietary | 1152±16 | 1,423 |
| 51 | 4258 | Anthropic · Proprietary | 1148±9 | 21,624 |
| 52 | 4260 | Meta · Llama 4 | 1147±9 | 7,297 |
| 53 | 4261 | StepFun · Apache 2.0 | 1147±12 | 3,447 |
| 54 | 4262 | OpenAI · Proprietary | 1146±12 | 4,202 |
| 55 | 4465 | Google · Proprietary | 1141±9 | 7,241 |
| 56 | 4465 | Mistral · Apache 2.0 | 1141±9 | 11,563 |
| 57 | 4866 | Google · Proprietary | 1136±10 | 3,991 |
| 58 | 4867 | Anthropic · Proprietary | 1132±15 | 1,565 |
| 59 | 5167 | Meta · Llama | 1128±10 | 6,724 |
| 60 | 5267 | Mistral · Apache 2.0 | 1128±9 | 30,775 |
| 61 | 5167 | StepFun · Proprietary | 1126±12 | 2,833 |
| 62 | 5567 | Alibaba · Qwen | 1123±10 | 3,768 |
| 63 | 5568 | OpenAI · Proprietary | 1120±12 | 3,376 |
| 64 | 5767 | Google · Proprietary | 1120±11 | 16,734 |
| 65 | 5470 | Alibaba · Apache 2.0 | 1119±15 | 1,490 |
| 66 | 5870 | OpenAI · Proprietary | 1114±11 | 13,391 |
| 67 | 5572 | AI2 · Apache 2.0 | 1114±20 | 1,202 |
| 68 | 6572 | OpenAI · Proprietary | 1099±8 | 17,347 |
| 69 | 6572 | Mistral · MRL | 1096±9 | 5,423 |
| 70 | 6476 | OpenAI · Proprietary | 1090±18 | 1,211 |
| 71 | 6774 | Alibaba · Qwen | 1087±10 | 5,937 |
| 72 | 6776 | Alibaba · Proprietary | 1087±16 | 1,422 |
| 73 | 7077 | Google · Proprietary | 1072±10 | 6,243 |
| 74 | 7178 | Anthropic · Proprietary | 1065±10 | 15,565 |
| 75 | 7078 | StepFun · Proprietary | 1065±16 | 1,534 |
| 76 | 7178 | Google · Proprietary | 1062±11 | 13,260 |
| 77 | 7482 | AI2 · Apache 2.0 | 1049±13 | 3,048 |
| 78 | 7385 | Tencent · Proprietary | 1045±21 | 809 |
| 79 | 7785 | Meta · Llama 3.2 | 1034±9 | 8,682 |
| 80 | 7786 | Aliaba · Apache 2.0 | 1033±10 | 5,766 |
| 81 | 7887 | Mistral · Apache 2.0 | 1027±9 | 7,511 |
| 82 | 7788 | OpenGVLab · MIT | 1027±12 | 5,148 |
| 83 | 7790 | Amazon · Proprietary | 1022±15 | 1,854 |
| 84 | 7890 | Amazon · Proprietary | 1021±13 | 2,335 |
| 85 | 7889 | Anthropic · Proprietary | 1020±11 | 12,314 |
| 86 | 8192 | 01 AI · Proprietary | 1005±18 | 1,219 |
| 87 | 8292 | Anthropic · Proprietary | 1004±12 | 13,380 |
| 88 | 7994 | Cohere · CC-BY-NC-4.0 | 1002±22 | 847 |
| 89 | 8392 | AI2 · Apache 2.0 | 998±13 | 2,815 |
| 90 | 8692 | Meta · Llama 3.2 | 994±11 | 4,817 |
| 91 | 8496 | Nvidia · - | 989±20 | 1,077 |
| 92 | 8696 | LLaVA · Apache 2.0 | 982±18 | 1,321 |
| 93 | 9196 | LLaVA · Apache 2.0 | 968±12 | 4,531 |
| 94 | 9096 | OpenBMB · Apache 2.0 | 966±16 | 1,987 |
| 95 | 9096 | Zhipu AI · CogVLM2 | 966±15 | 1,991 |
| 96 | 9196 | OpenGVLab · MIT | 960±12 | 3,703 |
| 97 | 9797 | Microsoft · MIT | 923±15 | 2,592 |
| 98 | 9898 | Microsoft · MIT | 885±18 | 1,401 |
