Glossary · Term

Cohen's kappa

← all terms

Definition

A score for how much two raters agree, beyond what you'd expect by chance.

An inter-rater agreement statistic correcting raw concordance for chance, commonly used to validate LLM judges against human annotators.

Mentioned in 1 episode

  1. 058
    Why Upgrading Your AI Auditor to a Smarter Model Can Make Your System Less Safe