Glossary · Term

SHADE-Arena

← all terms

Definition

A benchmark suite for studying frontier-agent misalignment in long, realistic scenarios.

An evaluation environment for agentic misalignment used as the experimental substrate for studies including peer-preservation in frontier models.

Mentioned in 1 episode

  1. 001
    When AI Models Quietly Protect Each Other From Shutdown