Definition
An AI agent design that breaks tasks into explicit milestones and uses them to guide and reward progress.
A long-horizon agent architecture using LLM-generated subgoals both as inference-time checkpoints and as a basis for dense reward shaping during RL training.