Glossary · Term

LHAW

← all terms

Definition

A benchmark of underspecified agent tasks used to study clarification behavior.

A benchmark of deliberately underspecified long-horizon tasks used to study when LLM agents should ask for clarification versus proceed under ambiguity.

Mentioned in 1 episode

  1. 035
    Why Frontier Agents Ask for Clarification at Exactly the Wrong Moment