Coding
Aider-polyglot
Code editing benchmark across multiple languages.
4 models published a score
| # | Model | Company | Score |
|---|---|---|---|
| 1 | Claude Opus 4.5 | Anthropic | 89.4 |
| 2 | Grok 4 | xAI | 79.6 |
| 3 | DeepSeek V3.2 | DeepSeek | 74.5 |
| 4 | GLM-4.6 | Zhipu AI | 39.1 |
Code editing benchmark across multiple languages.
| # | Model | Company | Score |
|---|---|---|---|
| 1 | Claude Opus 4.5 | Anthropic | 89.4 |
| 2 | Grok 4 | xAI | 79.6 |
| 3 | DeepSeek V3.2 | DeepSeek | 74.5 |
| 4 | GLM-4.6 | Zhipu AI | 39.1 |