现在有各种各样的大模型榜单,但是大伙好像对榜单的认可度不高?而且还有和体感不符的情况?
那么现在要找编码能力强的模型那个榜单更有参考价值呢?
附上自己在看的榜单:
AI Model & API Providers Analysis | Artificial Analysis
Comparison and analysis of AI models and API hosting providers. Independent benchmarks across key performance metrics including quality, price, output speed & latency.
WebDev AI Leaderboard - Best AI Models for Web Development
View overall rankings across AI models on front-end web development tasks, including agentic coding workflows that require multi-step reasoning and tool use.
DeepSWE
DeepSWE measures frontier coding agents on original, long-horizon software engineering tasks.


正在处理:2a4a7846-b75f-496f-bdc7-d43cf60fcad8.png…
5 个帖子 - 4 位参与者