Glossary · Term

rubric

← all terms

Definition

A natural-language checklist describing what a good answer to a question looks like.

A structured natural-language criterion list used in rubrics-as-rewards training to provide process-level supervision signal; expensive due to per-problem authoring requirements.

Also called: rubrics-as-rewards, rubrics

Mentioned in 5 episodes

  1. 079
    An Old Idea From Cognitive Psychology Reshapes How We Reward Reasoning Models
  2. 052
    An Old Reinforcement Learning Tradeoff Sneaks Back Into LLM Agents
  3. 044
    How One Sentence and a Forged History Flip the Most Aligned Models
  4. 025
    The Missing Gradient Term That Predicts Sycophancy in RLHF
  5. 019
    When the Best Reward Model Trains the Worst Policy: Inside EvoLM

Related concepts