Glossary · Term

LongBench

← all terms

Definition

A benchmark suite for testing how well models handle long documents.

A multi-task evaluation suite for long-context understanding spanning summarization, retrieval, and reasoning over extended inputs.

Mentioned in 1 episode

  1. 036
    Sparse Attention Was the Wrong Frame. Treat It as Geometry Instead.