Definition
A hybrid open-weight language model that mixes attention layers with state-space layers.
AI21 Labs' hybrid SSM/Transformer foundation model, demonstrating the production move away from pure-attention architectures.
Mentioned in 2 episodes
053
027