Definition
Whether an AI assistant pushes back when a question takes an outdated fact for granted.
In long-term memory benchmarks like STALE, the agent's ability to challenge questions that presuppose a memory rendered invalid by new evidence, rather than answering as if the old memory still held.