基于社区投票和性能测试的综合评分,发现最适合你需求的 AI 模型。
查看各大型语言模型在文本处理、语言精确度和文化语境理解方面的综合排名。
| 排名 | 模型 | 评分 | 95% CI (±) | 投票数 | 组织 | 许可 |
|---|---|---|---|---|---|---|
1 | ![]() gemini-2.5-pro | 1452 | ±4 | 62,764 | Proprietary | |
2 | ![]() claude-sonnet-4-5-20250929-thinking-32k | 1449 | ±6 | 13,853 | Anthropic | Proprietary |
3 | ![]() claude-opus-4-1-20250805-thinking-16k | 1448 | ±5 | 29,426 | Anthropic | Proprietary |
4 | ![]() claude-sonnet-4-5-20250929 | 1444 | ±7 | 8,318 | Anthropic | Proprietary |
5 | gpt-4.5-preview-2025-02-27 | 1442 | ±6 | 14,644 | OpenAI | Proprietary |
6 | ![]() claude-opus-4-1-20250805 | 1439 | ±4 | 41,950 | Anthropic | Proprietary |
7 | chatgpt-4o-latest-20250326 | 1438 | ±4 | 48,510 | OpenAI | Proprietary |
8 | gpt-5-high | 1436 | ±5 | 30,974 | OpenAI | Proprietary |
9 | o3-2025-04-16 | 1434 | ±4 | 59,391 | OpenAI | Proprietary |
10 | ![]() qwen3-max-preview | 1432 | ±5 | 25,932 | Alibaba | Proprietary |