Definition
FrontierMath is a benchmark of expert-level, original mathematics problems designed to resist the usual contamination and saturation issues that plague math evals. Its problems are written by research mathematicians and intended to remain genuinely hard for the foreseeable future.
Episodes covering this
Worth reading next
Papers we haven't done a deep dive on yet, but would recommend on this topic.