Definition
Adversarial review is the practice of evaluating a system — code, a model output, a security claim — with the explicit goal of finding ways it’s wrong, rather than confirming it’s right. It assumes a competent attacker and asks “how would I break this?” instead of “does this look correct?”