OpenClaw user here. Genuinely curious to see if this works and how easy it turns out to be in practi...

gleipnircode • yesterday at 5:49 PM • 1 reply • view on HN

OpenClaw user here. Genuinely curious to see if this works and how easy it turns out to be in practice.

One thing I'd love to hear opinions on: are there significant security differences between models like Opus and Sonnet when it comes to prompt injection resistance? Any experiences?

Replies

datsci_est_2015 • yesterday at 5:59 PM

> One thing I'd love to hear opinions on: are there significant security differences between models like Opus and Sonnet when it comes to prompt injection resistance?

Is this a worthwhile question when it’s a fundamental security issue with LLMs? In meatspace, we fire Alice and Bob if they fail too many phishing training emails, because they’ve proven they’re a liability.

You can’t fire an LLM.

➕ show 3 replies

alt Hacker News

Replies