Definition
MiniMax's custom training system for teaching AI agents through long, varied real-world trajectories.
MiniMax's RL infrastructure for agent training, with decoupled model, agent, and trainer modules, windowed FIFO scheduling, and prefix tree merging for shared-context efficiency.