Glossary · Term

GPQA-Diamond

← all terms

Definition

A small set of extremely hard graduate-level science questions used to test AI reasoning.

The hardest subset of GPQA, a benchmark of expert-validated, Google-proof graduate-level science questions in physics, chemistry, and biology.

Also called: GPQA, G-P-Q-A-Diamond

Mentioned in 3 episodes

  1. 079
    An Old Idea From Cognitive Psychology Reshapes How We Reward Reasoning Models
  2. 058
    Why Upgrading Your AI Auditor to a Smarter Model Can Make Your System Less Safe
  3. 032
    A Sticky-Note for Every Layer: Letting Transformers Remember What They Were Just Thinking