Search Arena

View overall rankings across LLMs with integrated web search.

Mar 31, 2026
357,742 votes
25 models
Rank Spread
1
11
Anthropic
Anthropic · Proprietary
1251±5
19,512$5 / $251M
2
28
Google · Proprietary
1220±7
8,740N/AN/A
3
28
Anthropic
Anthropic · Proprietary
1218±5
19,712$3 / $151M
4
210
OpenAI · Proprietary
1214±5
30,450$1.75 / $14400K
5
210
Google · Proprietary
1213±5
41,934N/AN/A
6
210
1208±7
8,400$2 / $62M
7
210
Google · Proprietary
1208±5
37,396$2 / $12N/A
8
210
xAI · Proprietary
1207±6
20,720N/AN/A
9
410
OpenAI · Proprietary
1206±5
35,128$1.25 / $10400K
10
410
OpenAI · Proprietary
1204±8
10,115$2.50 / $151.1M
11
1114
xAI · Proprietary
1177±5
43,389$0.20 / $0.502M
12
1114
OpenAI · Proprietary
1177±5
37,039$1.75 / $14400K
13
1114
Anthropic
Anthropic · Proprietary
1175±5
29,875$5 / $25200K
14
1114
xAI · Proprietary
1172±4
43,020$0.20 / $0.502M
15
1520
OpenAI · Proprietary
1143±5
20,729$2 / $8200K
16
1520
Google · Proprietary
1143±4
57,810$1.25 / $101M
17
1521
Anthropic
Anthropic · Proprietary
1142±6
26,427$3 / $151M
18
1521
xAI · Proprietary
1141±6
19,321$3 / $15256K
19
1522
Anthropic
Anthropic · Proprietary
1139±4
57,404$15 / $75200K
20
1522
Perplexity · Proprietary
1139±5
29,154$1 / $1127.1K
21
1723
OpenAI · Proprietary
1132±5
20,855$1.25 / $10400K
22
1823
Perplexity · Proprietary
1130±5
28,656$1 / $1127.1K
23
2123
Anthropic
Anthropic · Proprietary
1128±5
31,162$15 / $75200K
24
2425
Diffbot · Apache 2.0
1022±8
6,409N/AN/A
25
2425
OpenAI · Proprietary
1006±11
3,416$30 / $608.2K

Default Leaderboard Plots

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Confidence Intervals on Model Strength (via Bootstrapping)

Battle Count for Each Combination of Models (without Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles