Concept · 1 episode(s)

Cognitive Bias Attacks

← all concepts

Definition

Cognitive bias attacks exploit known biases in either humans or LLM judges to get a result the substance wouldn’t earn — flattering framing, authoritative tone, anchoring, social proof. The LLM-judge version is particularly worrying because the “cognitive biases” of judges become attack surfaces in any evaluation that depends on them.

Episodes covering this