Concept · 1 episode(s)

Entropy Regularization

Definition

Entropy regularization adds a term to an RL or training objective that rewards keeping the policy’s output distribution spread out, preventing premature collapse to a single confident behavior. It’s a standard way to keep exploration alive without resorting to explicit exploration bonuses.

Episodes covering this

010
When Reward Climbs But Reasoning Goes Generic: Diagnosing Template Collapse in Agentic RL
RAGEN-2: Reasoning Collapse in Agentic RL
Wang, Gui, Jin et al. · Northwestern University·22 min·May 02, 2026