Concept · 1 episode(s)

Model Spec

← all concepts

Definition

A model spec is a written statement of how a model is supposed to behave: its priorities, its rules, the trade-offs it should make when those rules conflict. Public model specs (like Anthropic’s and OpenAI’s) double as policy documents and as training targets.

Episodes covering this

Worth reading next

Papers we haven't done a deep dive on yet, but would recommend on this topic.