Definition
A pipeline that builds verified AI-agent training data by running tools first and writing the question afterward.
A synthetic data system that explores real APIs via a tool compatibility graph, records execution graphs, and back-chains tasks whose ground-truth answers are extracted from observed outputs.