CAPE: Capability Achievement via Policy Execution

David Ball

arXiv:2512.14761·cs.SE·December 18, 2025

CAPE: Capability Achievement via Policy Execution

David Ball

PDF

Open Access 1 Video

TL;DR

CAPE introduces a systematic approach to convert explicit requirements into executable specifications for AI models, significantly reducing violations and costs by operationalizing a cycle of specification, verification, correction, and training.

Contribution

This paper presents CAPE, a novel protocol for capability engineering that formalizes requirement enforcement in AI models, supported by empirical findings and a new evaluation benchmark.

Findings

01

Verification accuracy scales with model size (r=0.94).

02

CAPE reduces violation rates by 81% across six domains.

03

Cost and timeline reductions of 5-20 times compared to traditional annotation.

Abstract

Modern AI systems lack a way to express and enforce requirements. Pre-training produces intelligence, and post-training optimizes preferences, but neither guarantees that models reliably satisfy explicit, context-dependent constraints. This missing abstraction explains why highly intelligent models routinely fail in deployment despite strong benchmark performance. We introduce Capability Engineering, the systematic practice of converting requirements into executable specifications and training models to satisfy them by default. We operationalize this practice through CAPE (Capability Achievement via Policy Execution), a protocol implementing a Specify -> Verify -> Correct -> Train loop. CAPE is grounded in two empirical findings: (1) contextual objectivity, where properties appearing subjective become objective once context is fixed (inter-annotator agreement rises from kappa = 0.42…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

CAPE: Capability Achievement via Policy Execution· underline

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Scientific Computing and Data Management