Definition
The dominant neural network design behind modern language models, built around attention.
The architecture introduced in 2017 combining self-attention with feedforward blocks and residual connections, underlying nearly all current frontier LLMs.
Also called: transformers, Transformer