Glossary · Term

Autellix

← all terms

Definition

A program-aware LLM serving scheduler that understands agent session structure.

An LLM serving system that schedules requests with awareness of agent program structure (sessions, tool calls), but without holistic visibility into physical resource pressure; a baseline against which MARS is compared.

Mentioned in 1 episode

  1. 016
    Why Your Coding Agent Stalls While the GPU Runs Hot