Glossary · Term

Latent Evaluator

← all terms

Definition

The shared core inside an LLM judge that does the actual judging, before any specific answer format is chosen.

In the Judge Circuits framing, the format-agnostic sub-network at intermediate transformer layers that encodes an abstract judgment along a roughly one-dimensional axis, shared across rating and classification prompts.

Mentioned in 1 episode

  1. 055
    Why LLM Judges Flip Their Verdicts When You Change the Question Format