weights · Glossary · AI Papers: A Deep Dive

Definition

Plain language

The numbers inside a neural network that define what it has learned.

As stated in the literature

The trainable parameters of a neural network, typically the matrices and biases in its layers, learned via gradient descent.

Also called: weight

Why it matters: Whether a model's weights are public or private fundamentally shapes who can use, study, fine-tune, or audit it.

For example, a 7-billion-parameter model's 'weights' are the 7 billion specific numbers that, together with its architecture, define how it responds to every prompt.

Heard on the show

“Second, the linear probe, which reads them in the weakest possible way: multiply by a fixed set of weights, add them up, and output a guess.”

Episode 204 — The Length Estimate Hiding Inside a Word-by-Word Model

Mentioned in 116 episodes

Related concepts

AI & Security Attention Analysis Black-Box Optimization Context Fatigue Generation-Time Specialization In-Context Learning Inference-Time Scaffolding LoRA RAG Reproducibility Transformer Attention Weight Exfiltration

Related terms

gradient neural network parameter