Knowledge
SimpleQA
Benchmark de factualidad de respuestas cortas.
3 modelos publicaron score
| # | Modelo | Empresa | Score |
|---|---|---|---|
| 1 | Gemini 3 Pro | Google DeepMind | 72.1 |
| 2 | GPT-5.2 | OpenAI | 58.0 |
| 3 | Mistral Large 3 | Mistral AI | 23.8 |
Benchmark de factualidad de respuestas cortas.
| # | Modelo | Empresa | Score |
|---|---|---|---|
| 1 | Gemini 3 Pro | Google DeepMind | 72.1 |
| 2 | GPT-5.2 | OpenAI | 58.0 |
| 3 | Mistral Large 3 | Mistral AI | 23.8 |