verifier · Glossary · AI Papers: A Deep Dive

Definition

Plain language

A separate model or program that grades whether an answer is correct.

As stated in the literature

A model or deterministic checker that scores candidate outputs, used to provide reward signals or filter rollouts in training and evaluation.

Also called: verifiers

Why it matters: Verifiers are how systems turn a noisy generator into a reliable end-to-end pipeline, and a strong verifier often matters more than a stronger generator.

For example, a code generator might produce 10 candidate solutions and a verifier runs each against the test suite to pick the one that passes.

Heard on the show

“… repository credentials," a starting world programmatically built so that harm is even possible, and a verifier, a little deterministic program that checks whether it happened. …”

Episode 202 — How Do You Know an AI Agent Actually Refused? Check the World, Not the Words

Mentioned in 44 episodes

Related concepts

Inference-Time Scaffolding Iterative Refinement Parallel Sampling RL Post-Training

Related terms

rollout