Glossary · Term

test-time scaling

← all terms

Definition

Getting better answers by spending more compute when running the model, not by training a bigger one.

Strategies that improve model performance at inference by allocating extra compute — more samples, longer chains of thought, or iterative refinement — rather than scaling parameters.

Also called: TTS

Mentioned in 2 episodes

  1. 048
    How a 30B Open Model Reached Olympiad Gold With the Right Recipe
  2. 003
    How to Pick the Best of Sixteen Coding Agent Rollouts