Definition
Entropy gating uses the model’s own predictive entropy — how uncertain it is at a given token — as a signal for whether to take some action: stop generating, call a tool, ask a clarifying question, switch decoding strategy. It’s a cheap, model-internal alternative to a separate uncertainty estimator.