Definition
The final step inside a language model that turns its internal state into actual word predictions.
The output projection matrix that maps a transformer's residual-stream representation back into vocabulary logits, often tied to the input embedding.