This is an intresting take and the ”tooling” around pure llm-based code generation is what really matters.
AFAIK Replit and Claude code has way to reduce the rate of these kind of errors, but I havn’t deep dived into how.