logoalt Hacker News

lufenialif2yesterday at 12:18 AM1 replyview on HN

Curious how you make something that has data exfiltration as a feature secure.


Replies

CuriouslyCyesterday at 1:01 AM

Mitigate prompt injection to the best of your ability, implement a policy layer over all capabilities, and isolate capabilities within the system so if one part gets compromised you can quarantine the result safely. It's not much different than securing human systems really. If you want more details there are a lot of AI security articles, I like https://sibylline.dev/articles/2026-02-15-agentic-security/ as a simple primer.

show 1 reply