Definition
Long-context models accept and reason over very large inputs — hundreds of thousands or millions of tokens. The headline number on the spec sheet is rarely the same as the effective context: useful long-context work involves architecture, training, and serving choices all the way down.
Episodes covering this
Worth reading next
Papers we haven't done a deep dive on yet, but would recommend on this topic.