Definition
A model's internal working memory at a given moment — the numbers it carries between steps.
The intermediate vector representation maintained by a neural network at each token position or recurrent step, capturing accumulated context.
Also called: hidden states