Glossary · Term

argmax

← all terms

Definition

Picking the single option a model rates highest.

The decoding operation that selects the token with maximum probability from a model's output distribution, equivalent to greedy decoding at each step.

Mentioned in 3 episodes

  1. 070
    When Models Know the Answer But Say the Wrong Thing Anyway
  2. 055
    Why LLM Judges Flip Their Verdicts When You Change the Question Format
  3. 032
    A Sticky-Note for Every Layer: Letting Transformers Remember What They Were Just Thinking