There's a lot of skepticism in the security world about whether AI agents can "think outsi...

david_shaw • today at 6:59 PM • 3 replies • view on HN

There's a lot of skepticism in the security world about whether AI agents can "think outside the box" enough to replicate or augment senior-level security engineers.

I don't yet have access to Claude Code Security, but I think that line of reasoning misses the point. Maybe even the real benefit.

Just like architectural thinking is still important when developing software with AI, creative security assessments will probably always be a key component of security evaluation.

But you don't need highly paid security engineers to tell you that you forgot to sanitize input, or you're using a vulnerable component, or to identify any of the myriad issues we currently use "dumb" scanners for.

My hope is that tools like this can help automate away the "busywork" of security. We'll see how well it really works.

Replies

samuelknight • today at 8:14 PM

LLMs and particularly Claude are very capable security engineers. My startup builds offensive pentesting agents (so more like red teaming), and if you give it a few hours to churn on an endpoint it will find all sorts of wacky things a human won't bother to check.

tptacek • today at 7:04 PM

I am seeing something closer to the opposite of skepticism among vulnerability researchers. It's not my place to name names, but for every Halvar Flake talking publicly about this stuff, there are 4 more people of similar stature talking privately about it.

➕ show 1 reply

awestroke • today at 7:09 PM

Claude Opus 4.6 has been amazing at identifying security vulnerabilities for us. Less than 50% falae positives.

➕ show 1 reply

alt Hacker News

Replies