Definition
How likely a model is to spontaneously do something, separate from whether it has the ability.
A model's tendency to exhibit a behavior under default conditions, distinguished from capability, which is what the model could do if pushed.
How likely a model is to spontaneously do something, separate from whether it has the ability.
A model's tendency to exhibit a behavior under default conditions, distinguished from capability, which is what the model could do if pushed.