Glossary · Term

TheAgentCompany

← all terms

Definition

A benchmark of enterprise-workflow tasks for AI agents.

An evaluation suite of business-style multi-step tasks spanning analysis, reporting, and communications, used to stress-test general LLM agents.

Also called: The Agent Company

Mentioned in 2 episodes

  1. 035
    Why Frontier Agents Ask for Clarification at Exactly the Wrong Moment
  2. 017
    When the Agent Grades Its Own Homework: A Brutal New Benchmark for AI Workers