Saltar al contenido

Modelos

Tabla comparativa de los 128 modelos frontier × 31 benchmarks. Click en cualquier header para ordenar. Heatmap por columna (rojo = peor del set filtrado, verde = mejor). Frontier Index ranking en home.

Filter by tier
36 modelos · 31 benchmarks
Categorias:(todas — click para filtrar)
Tabla comparativa de modelos y benchmarks (desplazable)
Modelo MMLUMMLU-ProGPQA-DiamondBBHARC-AGI-2Humanitys-Last-ExamMMMUHumanEvalMBPP+SWE-bench-VerifiedSWE-bench-ProCyberGymLiveCodeBenchAider-polyglotTerminal-Bench-HardTerminal-Bench-2MATH-500AIME-2024AIME-2025GSM8KFrontierMathSimpleQAIFEvalArena-HardMGSMTAU-benchOSWorldBrowseCompGDPvalArena-ELOLiveBench
GPT-5.5
OpenAI · 2026-04
93.685.058.682.735.478.784.484.9
Claude Fable 5
Anthropic · 2026-06
53.095.080.084.385.0
Claude Opus 4.8
Anthropic · 2026-05
93.649.888.669.283.4
Claude Sonnet 4.6
Anthropic · 2026-02
89.960.479.695.672.5
Gemini 3.1 Pro
Google DeepMind · 2026-02
94.377.144.480.668.585.9
Grok 4.3
xAI · 2026-04
Grok 4.20
xAI · 2026-03
Muse Spark
Meta · 2026-04
58.0
Mistral Medium 3.5
Mistral AI · 2026-04
77.686.3
Command A+
Cohere · 2026-05
75.125.0
Reka Flash 3.1open
Reka · 2025-07
53.5
Jamba 1.7 Largeopen
AI21 Labs · 2025-07
57.739.0
DeepSeek V4 Proopen
DeepSeek · 2026-04
87.590.137.776.880.655.493.592.6
Qwen3.7-Max
Alibaba · 2026-05
89.692.441.480.460.691.669.794.3
GLM-5.2open
Zhipu AI · 2026-06
91.240.562.181.0
GLM-5
Zhipu AI · 2026-02
86.030.577.8
GLM-5.1open
Zhipu AI · 2026-03
86.231.077.858.463.5
ERNIE 5.1
Baidu · 2026-05
ERNIE 5.0
Baidu · 2026-01
Doubao Seed 2.0 Pro
ByteDance · 2026-02
87.088.985.476.587.893.3
MiMo V2.5 Pro
Xiaomi · 2026-04
78.957.268.4
MiMo V2.5open
Xiaomi · 2026-04
56.165.8
MiniMax M3open
MiniMax · 2026-06
59.066.070.183.5
Nemotron 3 Ultra 550B-A55B
Nvidia · 2026-06
86.887.026.771.989.054.0
Nemotron 3 Superopen
Nvidia · 2026-03
83.779.218.360.581.290.273.9
AFM Server
Apple · 2025-07
80.089.1
Amazon Nova 2 Omni
Amazon · 2025-12
Nova 2 Pro
Amazon · 2025-12
Samsung Gauss 2.3
Samsung · 2025-09
Kimi K2.7-Codeopen
Moonshot AI · 2026-06
Kimi K2.6open
Moonshot AI · 2026-04
90.534.780.258.689.666.773.183.2
EXAONE 4.5 33Bopen
LG AI Research · 2026-04
83.380.581.492.9
K-EXAONE 236B-A23Bopen
LG AI Research · 2026-01
83.879.113.649.480.792.8
Hunyuan Hy3-previewopen
Tencent · 2026-04
87.274.454.4
Ring-2.6-1Topen
Ant Group · 2026-05
88.366.274.0
Ling-2.6-1Topen
Ant Group · 2026-04