Definition
Re-running an AI's task without one of its helper steps to see whether that step actually mattered.
A counterfactual rollout used in contrastive RL rewards in which a designated stage (e.g., the verification stage) is skipped so the system's pre-verification output can be compared to its full output for credit assignment.