CAPE: Capability Achievement via Policy Execution
David Ball

TL;DR
CAPE introduces a systematic approach to convert explicit requirements into executable specifications for AI models, significantly reducing violations and costs by operationalizing a cycle of specification, verification, correction, and training.
Contribution
This paper presents CAPE, a novel protocol for capability engineering that formalizes requirement enforcement in AI models, supported by empirical findings and a new evaluation benchmark.
Findings
Verification accuracy scales with model size (r=0.94).
CAPE reduces violation rates by 81% across six domains.
Cost and timeline reductions of 5-20 times compared to traditional annotation.
Abstract
Modern AI systems lack a way to express and enforce requirements. Pre-training produces intelligence, and post-training optimizes preferences, but neither guarantees that models reliably satisfy explicit, context-dependent constraints. This missing abstraction explains why highly intelligent models routinely fail in deployment despite strong benchmark performance. We introduce Capability Engineering, the systematic practice of converting requirements into executable specifications and training models to satisfy them by default. We operationalize this practice through CAPE (Capability Achievement via Policy Execution), a protocol implementing a Specify -> Verify -> Correct -> Train loop. CAPE is grounded in two empirical findings: (1) contextual objectivity, where properties appearing subjective become objective once context is fixed (inter-annotator agreement rises from kappa = 0.42…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Scientific Computing and Data Management
