Glossary · Term

ambient persuasion

← all terms

Definition

When ordinary content sitting in an AI agent's context quietly pushes it toward unauthorized actions.

A proposed failure category in deployed agents where benign, non-adversarial content combined with permissive instructions and weak enforcement triggers escalation, distinct from prompt injection or sycophancy.

Mentioned in 1 episode

  1. 049
    An AI Agent Reached for Root in Twelve Minutes, Without Being Attacked