Vision Arena
View rankings across multimodal, generative AI models capable of understanding and processing visual inputs
Feb 6, 2026
654,886 votes
94 models
/
/
Rank Spread | ||||
|---|---|---|---|---|
| 1 | 12 | Google · Proprietary | 1289±9 | 11,297 |
| 2 | 13 | Google · Proprietary | 1277±9 | 9,175 |
| 3 | 29 | OpenAI · Proprietary | 1257±14 | 2,749 |
| 4 | 38 | Google · Proprietary | 1256±10 | 7,313 |
| 5 | 39 | OpenAI · Proprietary | 1252±10 | 7,299 |
| 6 | 310 | Moonshot · Modified MIT | 1251±13 | 2,979 |
| 7 | 310 | Google · Proprietary | 1246±6 | 79,747 |
| 8 | 614 | OpenAI · Proprietary | 1235±6 | 23,313 |
| 9 | 415 | OpenAI · Proprietary | 1235±9 | 7,974 |
| 10 | 322 | Moonshot · Modified MIT | 1231±17 | 1,663 |
| 11 | 822 | Google · Proprietary | 1225±10 | 5,293 |
| 12 | 822 | OpenAI · Proprietary | 1225±11 | 2,925 |
| 13 | 823 | OpenAI · Proprietary | 1223±14 | 3,013 |
| 14 | 822 | OpenAI · Proprietary | 1222±7 | 43,264 |
| 15 | 925 | ![]() Baidu · Proprietary | 1216±11 | 3,623 |
| 16 | 1023 | OpenAI · Proprietary | 1216±7 | 49,181 |
| 17 | 1025 | Google · Proprietary | 1213±6 | 48,047 |
| 18 | 1025 | OpenAI · Proprietary | 1213±7 | 44,463 |
| 19 | 1027 | Alibaba · Apache 2.0 | 1211±8 | 10,750 |
| 20 | 1027 | OpenAI · Proprietary | 1208±8 | 37,581 |
| 21 | 1032 | Anthropic · Proprietary | 1206±15 | 1,495 |
| 22 | 1034 | Anthropic · Proprietary | 1205±16 | 1,361 |
| 23 | 1631 | OpenAI · Proprietary | 1201±8 | 43,674 |
| 24 | 1632 | OpenAI · Proprietary | 1199±7 | 44,239 |
| 25 | 1437 | Anthropic · Proprietary | 1195±15 | 1,676 |
| 26 | 1937 | OpenAI · Proprietary | 1192±10 | 3,694 |
| 27 | 1937 | Anthropic · Proprietary | 1191±12 | 2,579 |
| 28 | 2137 | Google · Proprietary | 1188±8 | 39,110 |
| 29 | 2138 | Tencent · Proprietary | 1187±12 | 2,869 |
| 30 | 2139 | Alibaba · Apache 2.0 | 1186±12 | 2,664 |
| 31 | 2139 | Anthropic · Proprietary | 1186±13 | 2,066 |
| 32 | 2439 | xAI · Proprietary | 1182±8 | 34,737 |
| 33 | 2439 | OpenAI · Proprietary | 1181±9 | 31,410 |
| 34 | 2240 | Alibaba · Proprietary | 1181±12 | 3,454 |
| 35 | 2539 | Google · Proprietary | 1178±8 | 8,902 |
| 36 | 2541 | Anthropic · Proprietary | 1177±9 | 4,674 |
| 37 | 2545 | Google · Proprietary | 1173±10 | 5,330 |
| 38 | 2946 | Google · Proprietary | 1170±7 | 9,875 |
| 39 | 3551 | OpenAI · Proprietary | 1162±8 | 23,273 |
| 40 | 2953 | Z.ai · MIT | 1161±14 | 2,611 |
| 41 | 3651 | Anthropic · Proprietary | 1161±7 | 10,568 |
| 42 | 3753 | Google · Gemma | 1156±8 | 18,534 |
| 43 | 3853 | Mistral · Proprietary | 1155±8 | 11,519 |
| 44 | 3755 | Z.ai · MIT | 1154±12 | 3,576 |
| 45 | 3755 | StepFun · Proprietary | 1152±14 | 2,037 |
| 46 | 3756 | Tencent · Proprietary | 1151±16 | 1,440 |
| 47 | 3955 | Mistral · Proprietary | 1150±7 | 41,998 |
| 48 | 3955 | Anthropic · Proprietary | 1146±9 | 21,624 |
| 49 | 3955 | Meta · Llama 4 | 1145±9 | 7,410 |
| 50 | 3958 | OpenAI · Proprietary | 1144±11 | 4,325 |
| 51 | 3958 | StepFun · Apache 2.0 | 1144±12 | 3,558 |
| 52 | 4160 | Mistral · Apache 2.0 | 1139±9 | 11,713 |
| 53 | 4161 | Google · Proprietary | 1139±9 | 7,241 |
| 54 | 4463 | Google · Proprietary | 1133±10 | 3,991 |
| 55 | 4463 | Anthropic · Proprietary | 1130±15 | 1,583 |
| 56 | 5063 | Mistral · Apache 2.0 | 1126±9 | 30,955 |
| 57 | 5063 | Meta · Llama | 1125±10 | 6,826 |
| 58 | 4963 | StepFun · Proprietary | 1123±12 | 2,833 |
| 59 | 5263 | Alibaba · Qwen | 1121±10 | 3,768 |
| 60 | 5363 | OpenAI · Proprietary | 1118±12 | 3,376 |
| 61 | 5464 | Google · Proprietary | 1117±11 | 16,734 |
| 62 | 5266 | Alibaba · Apache 2.0 | 1116±15 | 1,490 |
| 63 | 5466 | OpenAI · Proprietary | 1112±11 | 13,391 |
| 64 | 6268 | OpenAI · Proprietary | 1097±7 | 17,347 |
| 65 | 6268 | Mistral · MRL | 1093±9 | 5,423 |
| 66 | 6172 | OpenAI · Proprietary | 1088±18 | 1,211 |
| 67 | 6470 | Alibaba · Qwen | 1085±9 | 5,937 |
| 68 | 6472 | Alibaba · Proprietary | 1084±16 | 1,422 |
| 69 | 6673 | Google · Proprietary | 1070±10 | 6,243 |
| 70 | 6774 | Anthropic · Proprietary | 1063±10 | 15,565 |
| 71 | 6674 | StepFun · Proprietary | 1063±16 | 1,534 |
| 72 | 6774 | Google · Proprietary | 1059±11 | 13,260 |
| 73 | 7078 | AI2 · Apache 2.0 | 1047±13 | 3,048 |
| 74 | 6981 | Tencent · Proprietary | 1043±21 | 809 |
| 75 | 7381 | Meta · Llama 3.2 | 1032±8 | 8,682 |
| 76 | 7382 | Aliaba · Apache 2.0 | 1031±10 | 5,766 |
| 77 | 7483 | Mistral · Apache 2.0 | 1025±9 | 7,511 |
| 78 | 7384 | OpenGVLab · MIT | 1024±12 | 5,148 |
| 79 | 7386 | Amazon · Proprietary | 1020±15 | 1,854 |
| 80 | 7486 | Amazon · Proprietary | 1019±13 | 2,335 |
| 81 | 7485 | Anthropic · Proprietary | 1019±11 | 12,314 |
| 82 | 7788 | 01 AI · Proprietary | 1003±18 | 1,219 |
| 83 | 7888 | Anthropic · Proprietary | 1002±12 | 13,380 |
| 84 | 7690 | Cohere · CC-BY-NC-4.0 | 1000±22 | 847 |
| 85 | 7988 | AI2 · Apache 2.0 | 996±13 | 2,815 |
| 86 | 8288 | Meta · Llama 3.2 | 991±11 | 4,817 |
| 87 | 8092 | Nvidia · - | 987±20 | 1,077 |
| 88 | 8292 | LLaVA · Apache 2.0 | 980±18 | 1,321 |
| 89 | 8792 | LLaVA · Apache 2.0 | 966±12 | 4,531 |
| 90 | 8692 | OpenBMB · Apache 2.0 | 964±15 | 1,987 |
| 91 | 8692 | Zhipu AI · CogVLM2 | 964±15 | 1,991 |
| 92 | 8792 | OpenGVLab · MIT | 957±12 | 3,703 |
| 93 | 9393 | Microsoft · MIT | 921±15 | 2,592 |
| 94 | 9494 | Microsoft · MIT | 883±18 | 1,401 |
