Glossary · Term

Model Spec

← all terms

Definition

A document describing how an AI assistant is supposed to behave and what it should value.

A natural-language specification used by labs (e.g., OpenAI, Anthropic) to describe an assistant's intended character, values, and behavior policies, typically used to guide post-training.

Also called: Constitution, spec

Mentioned in 5 episodes

  1. 075
    Growing Code and Proof Together: Verified Systems in Ten Hours Instead of a Year
  2. 058
    Why Upgrading Your AI Auditor to a Smarter Model Can Make Your System Less Safe
  3. 034
    Catching Multi-Agent Deadlocks Before Deployment With a 40-Year-Old Tool
  4. 022
    Training the Model Spec Directly: An Alignment Lever Aimed at the Say-Do Gap
  5. 014
    Why a Constrained Pipeline Beat a Full Coding Agent at Finding Bugs 30-to-1

Related concepts