logoalt Hacker News

icedchailast Thursday at 8:30 PM3 repliesview on HN

You don't think these errors compound? Generated code has 100's of little decisions. Yes, it "usually" works.


Replies

russfinklast Thursday at 11:28 PM

LLM’s: sometimes wrong but never in doubt.

dyauspitrlast Thursday at 8:37 PM

Not in my experience. With a proper TDD framework it does better than most programmers at a company who anecdotally have a bug every 2-3 tasks.

show 1 reply
FeepingCreaturelast Friday at 9:07 AM

Errors compounding is a meme. In iterated as well as verifiable domains, errors dilute instead of compounding because the llm has repeated chances to notice its failure.