logoalt Hacker News

pianopatricktoday at 5:49 AM5 repliesview on HN

I wonder, based on your experience, how hard would it be to improve your system to have an AI agent review the software and suggest tickets?

Like, can an AI agent use a browser, attempt to use the software, find bugs and create a ticket? Can an AI agent use a browser, try to use the software and suggest new features?


Replies

vidarhtoday at 6:11 PM

Yes, they can, and they do a reasonably good job at it. Hand them playwright or similar, and point them at it. The caveat is that they're often "lazy", and it takes some practice to coax them into being thorough (hot tip: have one write a list of things to probe and test, and tell it to use sub agents to address each; otherwise they tend to decide very quickly it's too tedious and start taking shortcuts)

ramon156today at 6:48 AM

I think it's more important to pin down where a human must be in order for this not to become a mess. Or have we skipped that step entirely?

mlsutoday at 7:09 AM

perhaps we can give the AI a bit of money, make it the customer, then we can all safely get off the computer and go outside :)

stingraycharlestoday at 9:29 AM

AI agents can absolutely use web browsers to do these things, but the hard part is accurately defining the acceptance criteria.

smokeyfishtoday at 6:22 AM

Datadog have a feature like that.