Definition
A penalty applied when a model writes out a plan but doesn't follow it.
In MaR's regulation reward, a multiplicative penalty (e.g., 0.7x) applied when a binary shortcut flag fires, indicating the model produced a plan but jumped to the answer without executing it.