I don’t think we have the right mental models of LMM security yet. The lethal trifecta identifies ma...

black_knight • today at 4:58 AM • 0 replies • view on HN

I don’t think we have the right mental models of LMM security yet. The lethal trifecta identifies many of the dangerous situations, but only describes the negative space of a solution.

Speculation: I think we must accept that prompt injection happens, and structure the security of the rest of the system around that. Data given to an LLM becomes an agent, so maybe we must give permissions to this data, instead of to the LLM. Not sure exactly how this would look like in practice!

alt Hacker News