How are y’all defending your agents on the input side?
Posted by RJSabouhi@reddit | LocalLLaMA | View on Reddit | 15 comments
Question for people building agents. The discussion around output safety I understand, but what are you doing for input-side defense?
I mean stuff like prompt injection, memory poisoning, adversarial retrieved context, malicious external feeds, speaker / identity confusion, long-term contamination of system state
If your agent has memory, tools, retrieval, or persistent state, how are you preventing bad inputs from warping the system upstream? Im asking about actual implementations not theory.
15 Comments
Equivalent_Pen8241@reddit
--Rotten-By-Design--@reddit
RJSabouhi@reddit (OP)
--Rotten-By-Design--@reddit
RJSabouhi@reddit (OP)
--Rotten-By-Design--@reddit
--Rotten-By-Design--@reddit
--Rotten-By-Design--@reddit
zipperlein@reddit
snowieslilpikachu69@reddit
RJSabouhi@reddit (OP)
sn2006gy@reddit
no_witty_username@reddit
caioribeiroclw@reddit
GroundbreakingMall54@reddit