logoalt Hacker News

epolanskiyesterday at 2:23 PM1 replyview on HN

+1 half the time I see such posts the answer is "harness".

Put the LLM in a situation where it can test and reason about its results.


Replies

JetSetIllyyesterday at 2:29 PM

I do have a test harness. That's how I could show that the code suggested was poor.

If you mean, put the LLM in the test harness. Sure, I accept that that's the best way to use the tools. The problem is that there's nothing requiring me or anyone else to do that.