chain of thought · Glossary · AI Papers: A Deep Dive

Definition

Plain language

When a model writes out its reasoning step by step before giving an answer.

As stated in the literature

A prompting and generation strategy where the model produces intermediate reasoning tokens before its final answer, often improving accuracy on multi-step tasks.

Also called: CoT, chain-of-thought, chain-of-thought reasoning, chains of thought

Why it matters: Letting the model spend tokens on intermediate steps often turns problems it would otherwise fumble into ones it can solve reliably.

For example, asked how many tennis balls fit in a suitcase, the model writes out estimates of a ball's volume and the suitcase's volume before giving a final number.

Heard on the show

“Writing your reasoning down offloads what you'd otherwise carry in the workspace — the authors' reading, not a dissected mechanism, but it would explain a lot about why chain-of-thought works.”

Episode 203 — The Thought a Model Doesn't Say — and the Lens That Reads It

Mentioned in 38 episodes

Related concepts

CoT Faithfulness Deliberative Alignment Reasoning Collapse RL for Reasoning

Related terms

token