Search Arena

View overall rankings across LLMs with integrated web search.

Feb 25, 2026
247,944 votes
22 models
Rank Spread
1
11
Anthropic
Anthropic · Proprietary
1255±10
3,607$5 / $251M
2
25
xAI · Proprietary
1225±8
4,687N/AN/A
3
27
OpenAI · Proprietary
1219±6
20,150$1.75 / $14400K
4
27
Google · Proprietary
1218±6
25,311N/AN/A
5
27
Google · Proprietary
1214±5
31,966$2 / $12N/A
6
37
OpenAI · Proprietary
1210±6
23,283$1.25 / $10400K
7
37
Anthropic
Anthropic · Proprietary
1203±10
3,602$3 / $151M
8
811
OpenAI · Proprietary
1183±6
20,045$1.75 / $14400K
9
811
xAI · Proprietary
1181±5
26,758$0.20 / $0.502M
10
811
xAI · Proprietary
1173±4
42,193$0.20 / $0.502M
11
811
Anthropic
Anthropic · Proprietary
1170±6
15,488$5 / $25200K
12
1218
OpenAI · Proprietary
1143±5
20,407$2 / $8200K
13
1217
Google · Proprietary
1143±4
45,483$1.25 / $101M
14
1218
xAI · Proprietary
1142±5
19,018$3 / $15256K
15
1218
Perplexity
Perplexity · Proprietary
1141±5
28,673$1 / $1127.1K
16
1220
Anthropic
Anthropic · Proprietary
1138±7
14,385$3 / $151M
17
1220
Anthropic
Anthropic · Proprietary
1138±4
44,888$15 / $75200K
18
1320
OpenAI · Proprietary
1133±5
20,519$1.25 / $10400K
19
1620
Perplexity
Perplexity · Proprietary
1131±5
28,131$1 / $1127.1K
20
1620
Anthropic
Anthropic · Proprietary
1129±5
30,695$15 / $75200K
21
2122
Diffbot · Apache 2.0
1024±8
6,378N/AN/A
22
2122
OpenAI · Proprietary
1006±11
3,375$30 / $608.2K

Default Leaderboard Plots

Battle Count for Each Combination of Models (without Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Confidence Intervals on Model Strength (via Bootstrapping)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)