Glossary · Term

indirect prompt injection

← all terms

Definition

Hiding instructions for an AI inside content it reads — a webpage, a file — so it follows them without realizing.

An attack where adversarial instructions are placed in content the agent retrieves at runtime, causing it to treat external text as if it were user instructions.

Also called: implicit prompt injection

Mentioned in 1 episode

  1. 030
    Why Your AI Agent Won't Stop Working — and Each Model Falls for a Different Trap