Definition
Storing rare past successes and showing them to a model again during training.
A reinforcement-learning technique that maintains a buffer of past trajectories—often rare successful ones—and samples from it during training to improve learning on sparse-reward tasks.
Also called: replay buffer