Prompt injection attacks are very much a thing. It doesn't matter how good the agent is, its vulnerable, and you don't know what you don't know.
Where are we at with SOTA or reliable prompt injection detection mechanisms?
Where are we at with SOTA or reliable prompt injection detection mechanisms?