Definition
A single score that ranks AI models by how well they do across many standard tests.
An aggregate capability metric maintained by Epoch AI combining performance on benchmarks like MMLU and GPQA, used as the capability axis in recent inverse-scaling analyses of LLM forecasting.
Also called: ECI