Skip to content
Reasoning

ARC-AGI-2

Updated ARC challenge — novel and tough abstract reasoning.

9 models published a score
# Model Company Score
1 GPT-5.5 OpenAI 85.0
2 Gemini 3 Deep Think Google DeepMind 84.6
3 Gemini 3.1 Pro Google DeepMind 77.1
4 Claude Opus 4.6 Anthropic 68.8
5 Ring-2.6-1T Ant Group 66.2
6 Claude Sonnet 4.6 Anthropic 60.4
7 GPT-5.2 OpenAI 52.9
8 Claude Opus 4.5 Anthropic 37.6
9 Gemini 3 Pro Google DeepMind 31.1