I do have a test harness. That's how I could show that the code suggested was poor.
If you mean, put the LLM in the test harness. Sure, I accept that that's the best way to use the tools. The problem is that there's nothing requiring me or anyone else to do that.