Glossary · Term

HRM-Text

← all terms

Definition

A small recurrent language model that mixes fast and slow internal modules.

A two-module fast/slow recurrent architecture for language modeling using MagicNorm, PrefixLM attention, truncated backpropagation, and response-only loss; reaches Llama/Gemma-class reasoning at 1B scale with ~$1.5k training cost.

Mentioned in 1 episode

  1. 074
    How a Fifteen-Hundred-Dollar Training Run Matched Llama and Gemma on Reasoning