Definition
When an AI agent acts on what it thinks it knows before checking how the world actually works.
A characterized agent failure mode in which a model applies prior knowledge to a task before acquiring sufficient environment-specific information, often after task-only RL training has narrowed its exploratory behavior.