Definition
An automated checker that grades whether an AI agent has hit each milestone in a task.
A separate LLM-based call within agent frameworks that reads the action history, compares it against a list of subgoals, and updates a binary progress vector consumed by the next decision step.