Definition
A measure of how much one probability distribution differs from another.
Kullback-Leibler divergence, an asymmetric measure of relative entropy used in RL training to penalize policy drift from a reference distribution.
Also called: KL, K-L, KL-divergence