Glossary · Term

TransformerLens

← all terms

Definition

A popular Python library that lets researchers peek inside transformer models layer by layer.

An open-source library providing hooks and tooling for inspecting and intervening on internal activations of transformer language models.

Mentioned in 1 episode

  1. 018
    Language Models Compute the Rational Move, Then Override It