Definition
The initial expensive training phase where a model learns from a huge pile of text.
The first stage of training a language model on a large unlabeled corpus via self-supervised objectives like next-token prediction.
Also called: pretrained, pre-training, pretrain