Definition
Whether a model is able to do something at all, given the right prompting or setup.
The maximum performance a model can reach on a task under favorable conditions, contrasted with propensity to do it spontaneously.
Mentioned in 40 episodes
Related concepts
Agent Scaffolding
Agentic Vuln Discovery
AI Efficiency & Cost
Capability Elicitation
Capability vs. Efficiency
Capability vs. Propensity
Creation-Audit Loop
Exploit Generation
GDP-Weighted Evaluation
Inference-Time Scaffolding
Knowledge Distillation
Math Benchmarks
Recursive Agent Optimization
Sandbagging
Seed-and-Amplify
Step Amplification Factor
Structural Transfer
Test-Time Compute
Training Methods
Web Agents
Workflow Search