Saltar al contenido
Agentic

BrowseComp

Web browsing comprehensive benchmark.

8 modelos publicaron score
# Modelo Empresa Score
1 GPT-5.5 Pro OpenAI 90.1
2 Claude Mythos 5 Anthropic 88.0
3 Claude Mythos Preview Anthropic 86.9
4 Gemini 3.1 Pro Google DeepMind 85.9
5 GPT-5.5 OpenAI 84.4
6 MiniMax M3 MiniMax 83.5
7 Kimi K2.6 Moonshot AI 83.2
8 Claude Opus 4.7 Anthropic 79.3