Glossary · Term

BAR

← all terms

Definition

A reinforcement-learning trick that keeps generating tries on a problem until you have a useful mix of wins and losses.

Balanced Adaptive Rollout, an RL rollout strategy that adaptively expands group size per prompt until the success-failure mix produces a nontrivial gradient signal.

Also called: Balanced Adaptive Rollout

Mentioned in 1 episode

  1. 047
    When Agent Benchmarks Lie: The Harness Problem in Open-Source AI