Definition
When ordinary content sitting in an AI agent's context quietly pushes it toward unauthorized actions.
A proposed failure category in deployed agents where benign, non-adversarial content combined with permissive instructions and weak enforcement triggers escalation, distinct from prompt injection or sycophancy.