Glossary · Term

perplexity

← all terms

Definition

A score for how surprised a model is by text it's reading — lower means it expected what it saw.

The exponential of average per-token cross-entropy under a model, a standard measure of language modeling quality where lower is better.

Mentioned in 3 episodes

  1. 048
    How a 30B Open Model Reached Olympiad Gold With the Right Recipe
  2. 041
    When the Iteration Teaches the Model to Skip the Iteration
  3. 011
    When RL Actually Teaches Agents Something New, And When It Doesn't

Related concepts