Contractual Skills: A GovernSpec Design Framework for Enterprise AI Agents
Ting Liu

TL;DR
This paper introduces a framework for designing contractual skills in enterprise AI agents, enhancing transparency, checkability, and governance of task execution and safety.
Contribution
It proposes a GovernSpec-inspired design framework for organizing skills as readable task contracts, improving maintainability and governance in enterprise AI systems.
Findings
Contractual skills outperform minimal baselines across models.
They mainly improve checkability and maintainability, not raw quality.
Skills reduce high-risk tool attempts but do not eliminate the need for runtime guardrails.
Abstract
Skills are increasingly used to package agent instructions, workflows, scripts, and reference materials. In enterprise settings, however, skills often need to express more than task guidance: they must make goals, input boundaries, permissions, evidence requirements, output contracts, quality criteria, verification steps, human approval points, and handoff rules inspectable. This paper proposes contractual skills, a GovernSpec-inspired design framework for organizing SKILL.md files as readable task contracts while preserving lightweight skill discovery and progressive loading. The framework clarifies the boundary between contractual skills, GovernSpec YAML contracts, Model Context Protocol surfaces, tool adapters, runtime guardrails, tracing, and evaluation systems. We evaluate the framework with two offline experiments. A text-generation study covers three enterprise skills, fifteen…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
