Definition
LLM behavior analysis is the broad project of characterizing what models do across inputs — capabilities, failure modes, biases, persona shifts — treating the model as a black-box object of empirical study. It’s how most safety-relevant claims about a model actually get grounded.
Episodes covering this
Worth reading next
Papers we haven't done a deep dive on yet, but would recommend on this topic.