Definition
A knob that controls how creative or how predictable a model's output is.
A scalar that scales logits before softmax during sampling; higher values flatten the distribution and produce more diverse outputs, lower values make generation more deterministic.