Definition
A program-aware LLM serving scheduler that understands agent session structure.
An LLM serving system that schedules requests with awareness of agent program structure (sessions, tool calls), but without holistic visibility into physical resource pressure; a baseline against which MARS is compared.