logoalt Hacker News

judahmeekyesterday at 1:08 PM4 repliesview on HN

Something is missing in the common test suite if this can occur, right?


Replies

pronyesterday at 1:14 PM

First, it's not "can occur" but does occur 100% of the time. Second, sure, it does mean something is missing, but how do you test for "this codebase can withstand at least two years of evolution"?

stronglikedanyesterday at 9:20 PM

Maybe, and that unknown unknown is the biggest footgun with LLMs in just about every regard.

bluGillyesterday at 1:18 PM

You have to fight to get agents to write tests in my experience. It can be done, but they don't. I've yet to figure out how get any any agent to use TDD - that is write a test and then verify it fails - once in a while I can get it to write one test that way, but it then writes far more code to make it pass than the test justifies and so is still missing coverage of important edge cases.

show 1 reply
yosef123yesterday at 1:43 PM

You can spend a lot of time perfecting the test suite to meet your specific requirements and needs, but I think that would take quite a while, and at that point, why not just write the code yourself? I think the most viable approach of today's AI is still to let it code and steer it when it makes a decision you don't like, as it goes along.