Glossary · Term

discriminative utility

← all terms

Definition

How useful a grading rubric is, measured by whether a weaker grader using it picks the right winner.

A definition of rubric quality as the increase in a frozen low-capacity judge's preference-classification accuracy when conditioned on the rubric.

Mentioned in 1 episode

  1. 019
    When the Best Reward Model Trains the Worst Policy: Inside EvoLM