Glossary · Term

long-context

Definition

Models or tasks where the input can be tens or hundreds of thousands of words long.

The regime where transformer input sequences are large enough that attention cost, memory pressure, and degradation of recall become primary engineering and modeling concerns.

Mentioned in 7 episodes

090
How MiniMax-M2 Bets That Sparsity Plus Verifiable Rewards Can Match Frontier Agents
085
Why Long-Context Models Might Need Compute, Not Capacity, Before Eviction
079
An Old Idea From Cognitive Psychology Reshapes How We Reward Reasoning Models
036
Sparse Attention Was the Wrong Frame. Treat It as Geometry Instead.
033
Echo: The Paper Arguing You Never Needed a KV Cache for Retrieval
028
Teaching a Model to Hire Copies of Itself: Recursive Agent Optimization
016
Why Your Coding Agent Stalls While the GPU Runs Hot

Related concepts

Hybrid SSM/Attention Long Context