Definition
A compact open-weight language model that loops its layers to get more reasoning out of a small network.
A 1.4B-parameter looped attention model used as a backbone in depth-recurrence and consolidation studies, retrofittable with SSM layers for hybrid experiments.