Definition
Rollout sampling is the practice of generating many independent trajectories from a policy in order to estimate its expected behavior — reward, success rate, distribution over outcomes. It’s the workhorse evaluation technique for any non-deterministic policy.
Episodes covering this
Worth reading next
Papers we haven't done a deep dive on yet, but would recommend on this topic.