Definition
A speed trick where multiple AI runs that share the same opening context only compute that opening once.
In Forge, an exact (non-approximate) optimization that computes shared prompt prefixes once across many sampled rollouts and branches into divergent continuations, yielding up to ~40x training speedup.