trajectory · Glossary · AI Papers: A Deep Dive

Definition

Plain language

The full record of what an AI agent did from start to finish on a task.

As stated in the literature

A sequence of states, actions, and observations produced by an agent over the course of a task, used as the unit of training data in agentic RL.

Also called: trajectories

Why it matters: Trajectories are the raw material of agent RL — both for credit assignment during training and for human review during debugging.

For example, an agent's trajectory on a flight-booking task includes every web page it viewed, every click, and every observation it received along the way.

Heard on the show

“And a real robotics engineer, faced with that, would replay the run, inspect the camera overlays, look at the trajectories, figure out which subsystem broke — and then remember the fix for next time.”

Episode 194 — How a Robot Builds a Debugging Notebook It Can Read, Edit, and Hand to Another Robot

Mentioned in 87 episodes

Related concepts

Agentic RL Mixed-Policy Training Monte Carlo Tree Search Process Reward Models Reward Variance Rollout Sampling Rollout Summarization Termination Poisoning TracIn

Related terms

agent reinforcement learning