Glossary · Term

Jet-Nemotron

← all terms

Definition

A small open-weight hybrid language model that mixes attention with state-space layers.

A two-billion-parameter open hybrid SSM/attention model used as a baseline backbone for long-context experiments including ECHO and sleep-style consolidation studies.

Mentioned in 1 episode

  1. 085
    Why Long-Context Models Might Need Compute, Not Capacity, Before Eviction