Definition
Reward shaping adds extra reward terms intended to guide a policy toward useful behavior — intermediate progress bonuses, exploration incentives, penalty terms. Done well it accelerates learning; done badly it teaches the policy to chase the shape rather than the goal.