Glossary · Term

reasoning model

← all terms

Definition

A language model trained to write out its thinking before giving an answer.

A class of models post-trained to produce extended chains of thought, often via RL on verifiable rewards, before emitting final answers.

Also called: reasoning models

Mentioned in 12 episodes

  1. 079
    An Old Idea From Cognitive Psychology Reshapes How We Reward Reasoning Models
  2. 077
    Reading a Model's Confidence Curve to Decide When Chain-of-Thought Is Worth It
  3. 076
    Same Model, Organized Differently: How an Agent Architecture Beat Frontier Systems at Research Math
  4. 074
    How a Fifteen-Hundred-Dollar Training Run Matched Llama and Gemma on Reasoning
  5. 058
    Why Upgrading Your AI Auditor to a Smarter Model Can Make Your System Less Safe
  6. 057
    How Uber Caught 206 Leaked Credentials With an LLM-Powered Security Stack
  7. 054
    When Models Learn the Monitor Exists, the Reasoning Trace Stops Being a Window
  8. 048
    How a 30B Open Model Reached Olympiad Gold With the Right Recipe
  9. 041
    When the Iteration Teaches the Model to Skip the Iteration
  10. 036
    Sparse Attention Was the Wrong Frame. Treat It as Geometry Instead.
  11. 015
    The Audit Number Isn't What You Think: Sycophancy and the Case Against Single-Prompt Bias Tests
  12. 009
    How Two Silent Library Bugs Quietly Invalidated a Wave of Reasoning Papers

Related concepts