Definition
Sneaking instructions for an AI into text it processes, so it follows the attacker's commands.
An attack class where adversarial text in inputs or retrieved content causes an LLM to deviate from its intended behavior or system prompt.
Also called: prompt injections