Concept · 1 episode(s)

Perplexity Probe

Definition

Perplexity probes use a model’s assigned perplexity on a piece of text as a signal for some property of that text — whether it’s in-distribution, machine-generated, memorized, or just unusual. They’re cheap, model-internal, and often surprisingly informative.

Episodes covering this

011
When RL Actually Teaches Agents Something New, And When It Doesn't
Does RL Expand the Capability Boundary of LLM Agents? A PASS@(k,T) Analysis
Zhai, Yan, Shao et al. · Fudan University·23 min·May 02, 2026