Definition
The principal–agent problem is the classic mismatch where a principal hires an agent to act on their behalf, but the agent has different interests and better information. It’s the economics frame for most AI alignment concerns: the user is the principal, the AI is the agent, and the gap between intent and behavior is the problem.