Definition
A score for how surprised a model is by text it's reading — lower means it expected what it saw.
The exponential of average per-token cross-entropy under a model, a standard measure of language modeling quality where lower is better.
A score for how surprised a model is by text it's reading — lower means it expected what it saw.
The exponential of average per-token cross-entropy under a model, a standard measure of language modeling quality where lower is better.