How are y’all defending your agents on the input side?

Posted by RJSabouhi@reddit | LocalLLaMA | View on Reddit | 15 comments

Question for people building agents. The discussion around output safety I understand, but what are you doing for input-side defense? I mean stuff like prompt injection, memory poisoning, adversarial retrieved context, malicious external feeds, speaker / identity confusion, long-term contamination of system state If your agent has memory, tools, retrieval, or persistent state, how are you preventing bad inputs from warping the system upstream? Im asking about actual implementations not theory.