Glossary · Term

AnswerBench

← all terms

Definition

A test set focusing on whether models can give correct final numerical answers to math problems.

An evaluation suite emphasizing final-answer correctness on math problems, used alongside IMO ProofBench and other proof-quality benchmarks.

Mentioned in 1 episode

  1. 048
    How a 30B Open Model Reached Olympiad Gold With the Right Recipe