Definition
A language model trained to write out its thinking before giving an answer.
A class of models post-trained to produce extended chains of thought, often via RL on verifiable rewards, before emitting final answers.
Also called: reasoning models