Glossary · Term

DeepSeekMath

← all terms

Definition

DeepSeek's family of math-focused language models, including a version that grades full mathematical proofs.

DeepSeek's math-specialized model series, including DeepSeekMath-V2, used as a proof-grading judge in olympiad-level RL training.

Also called: DeepSeekMath-V2

Mentioned in 2 episodes

  1. 048
    How a 30B Open Model Reached Olympiad Gold With the Right Recipe
  2. 011
    When RL Actually Teaches Agents Something New, And When It Doesn't