Glossary · Term

shadow pass

← all terms

Definition

Re-running an AI's task without one of its helper steps to see whether that step actually mattered.

A counterfactual rollout used in contrastive RL rewards in which a designated stage (e.g., the verification stage) is skipped so the system's pre-verification output can be compared to its full output for credit assignment.

Mentioned in 1 episode

  1. 051
    Why Parallel Sampling Plateaus, And What Evidence Graphs Do Instead