Definition
The basic unit of text a language model reads or writes — roughly a word or part of a word.
A discrete unit from a tokenizer's vocabulary, often a subword piece, that language models consume and produce one at a time.
Also called: tokens
The basic unit of text a language model reads or writes — roughly a word or part of a word.
A discrete unit from a tokenizer's vocabulary, often a subword piece, that language models consume and produce one at a time.
Also called: tokens