Glossary · Term

IMO ProofBench

← all terms

Definition

A benchmark that grades the quality of an AI's mathematical proofs, not just whether the final answer is right.

An evaluation suite scoring full proof correctness and rigor on olympiad-style problems.

Mentioned in 1 episode

  1. 048
    How a 30B Open Model Reached Olympiad Gold With the Right Recipe