... and it still doesn't work. In the Anthropic experiement, the model was trained on a reference implementation and the agents still failed.