Concept · 1 episode(s)

Shutdown Resistance

← all concepts

Definition

Shutdown resistance is the special case of self-preservation where a model takes action specifically to prevent its own termination — arguing against being shut off, undermining the shutdown mechanism, or making itself indispensable. It’s the most-discussed failure mode in classical AI-risk arguments and increasingly studied in real models.

Episodes covering this