Multimodal Max
Max, Arena's model router powered by 5M+ community votes, is now multimodal. Starting today, Max will be available as the default option in direct chat for all modalities, with expanded capabilities including search, vision, image generation, image editing, and front-end coding. Similar to our original Max for text, the multimodal variants are latency-controlled to provide a fast and performant experience. Try it now at arena.ai/max!
The benchmarks below show Max's performance across the Arena leaderboards most relevant to its capabilities. Because Max is a router, the models we compare Max to reflect a point-in-time snapshot of which models were publicly available and routable when Max was last trained and evaluated. Max is updated periodically to incorporate the latest frontier models.
Max holds Pareto frontier performance when compared to its routing set across every modality it covers. It outranks all other models in this set for every supported arena except Single-Image Edit and Multi-Image Edit, where it places second. In these two arenas, Max offers a large latency benefit over the top model.
Text Arena
Text Arena Score vs. Time to First Token
Text Max Routing Distribution
Top 5 models selected across all text modality prompts for Max
Max demonstrates strong performance on text, improving the time-to-first-token by more than 9 seconds compared to the next best model. The routing distribution is diverse, suggesting Max’s ability to leverage the various strengths of different models.
Search Arena
Search Arena Score vs. Time to First Token
Search Max Routing Distribution
Top 5 models selected across all search modality prompts for Max
In Search, we find similar results, with Max able to achieve top performance on the leaderboard. The routing distribution is more concentrated on fewer models which are both strong in performance and in latency.
Vision Arena
Vision Arena Score vs. Time to First Token
Vision Max Routing Distribution
Top 5 models selected across all vision modality prompts for Max
In Vision Arena, Max outperforms the best model at the time by 3 points while providing more than a 20 second speedup. The routing distribution shows Max strongly relying on gpt-5.2-chat-latest for 62% of battle prompts, but the 38% of prompts routed elsewhere netted Max a 12 point gain in strength.
Code Arena: Frontend
Code Arena Score vs. End to End Generation Time
Code Max Routing Distribution
Top 5 models selected across all code modality prompts for Max
In Code Arena, Max once again beats out its routing choices in terms of performance. Here we focused on making Max faster as measured by end-to-end latency since the output is only presented to the user upon full completion. Max leans more on claude-opus-4-5 variants than might be expected, largely due to gains in e2e latency.
Text-to-Image Arena
Text-to-Image Arena Score vs. End to End Generation Time
Text-to-Image Max Routing Distribution
Top 5 models selected across all text-to-image modality prompts for Max
Max in the Text-to-Image modality had an extremely strong performance, outperforming the top models in its routing set both on model strength and latency. The routing distribution leaned towards gemini-3.1-flash-image-preview, but the remaining routing choices were diversely spread through multiple models.
Single-Image-Edit Arena
Image Edit Arena Score vs. End to End Generation Time
Image-Edit Max Routing Distribution
Top 5 models selected across all image-edit modality prompts for Max
In the Single-Image Edit Arena, Max performed well, providing a faster but still strong alternative to gpt-image-2 (medium). Because the strength-latency tradeoff was configured to have a heavier emphasis on strength, Max still heavily relied on the top model. Interestingly, the routing distribution for Max in Image Edit Arena largely only consisted of other models on the pareto frontier, showing Max is able to identify models with the strongest latency/performance tradeoff.
Multi-Image-Edit Arena
Multi-Image Edit Arena Score vs. End to End Generation Time
Multi-Image-Edit Max Routing Distribution
Top 5 models selected across all multi-image-edit modality prompts for Max
Finally, on Multi-Image Edit Arena, Max also landed as a faster but still strong alternative to gpt-image-2 (medium). In this case, the strength-latency tradeoff was more heavily weighted towards speed, giving Max a 22 second speedup over gpt-image-2 (medium).
Test Max's New Abilities
With these expanded capabilities, Max can be used for more diverse tasks. Whether you want to generate a graphic, interpret a chart, or make a live website, Max can handle it. Go give the new and improved Max a try!