reasoning model · Glossary · AI Papers: A Deep Dive

Definition

Plain language

A language model trained to write out its thinking before giving an answer.

As stated in the literature

A class of models post-trained to produce extended chains of thought, often via RL on verifiable rewards, before emitting final answers.

Also called: reasoning models

Why it matters: Allowing extended chains of thought turns out to dramatically improve accuracy on hard problems, at the cost of more tokens and latency per query.

For example, when asked a tricky math problem, the model first writes several paragraphs of step-by-step working before producing its final boxed answer.

Heard on the show

“On GPQA Diamond, a graduate-level science exam, the reasoning model wins by about nineteen points.”

Episode 197 — Twin Problems Suggest AI Reasoning Gains Are Mostly Better Fact Recall

Mentioned in 31 episodes

Related concepts

Iterative Training Reasoning Collapse RL Post-Training

Related terms

chain of thought post-training reinforcement learning verifiable reward