Anthropic's New Claude 4 Model Leads Software Engineering Benchmarks
Anthropic introduced its new Claude 4 models on 22 May, Claude Opus 4 and Claude Sonnet 4, setting new standards for coding, advanced reasoning and AI agents. Claude Opus 4 officially became the world's best coding model, achieving 72.5% on the SWE-bench benchmark and 43.2% on