How We're Using AI to Build Software (And Everything Else)
To me, the real inflection point came at the end of May 2025. That’s when Anthropic—the company founded by ex-OpenAI researchers—released Claude 4, a new family of large language models. For software development, it is said that the previous models produced errors in 30-40% of generated code. Claude 4 is rumoured to have dropped that to 6-10%. It certainly matches my own observations… This drop is not yet another incremental improvement. To me, it’s a phase change. When your error rate drops by that much, it fundamentally changes what’s possible. ...