Definition
A model that mixes different architectural pieces, like attention and state-space layers together.
A sequence model that combines attention layers with state-space or linear recurrent layers, trading off expressive retrieval against constant-state efficiency.
Also called: hybrid SSM/attention models, hybrid models