Definition
A speed trick for code editing where the model verifies the user's near-copy of the answer instead of regenerating it from scratch.
A speculative-decoding variant in which a user-supplied draft (e.g., the original file in a code edit) is verified by the target model in batched passes rather than generated token by token.
Also called: predicted outputs