Definition
Training examples generated by AI rather than collected from the real world.
Data produced by simulation or by other models — often used to scale training in agentic, math, and code domains where verifiable rewards or rubric trees can be constructed.