Definition
A benchmark suite for testing whether sequence models can handle very long inputs.
A standardized benchmark for evaluating long-context sequence models on tasks like ListOps, image classification, and document matching, used in AIRA-Design's attention-mechanism experiments.
Also called: LRA