Definition
An index that uses geometry to skip over irrelevant past tokens during attention.
A halfspace-range-searching-based attention sparsification system using bounding-ball pruning across factored subspaces, with completeness guarantees above a threshold and constant-memory index state.