Definition
How much text a model can pay attention to at one time.
The maximum sequence length a transformer can attend over during a forward pass, bounded by model architecture and memory.
Also called: context windows
How much text a model can pay attention to at one time.
The maximum sequence length a transformer can attend over during a forward pass, bounded by model architecture and memory.
Also called: context windows