Formal Skill: Programmable Runtime Skills for Efficient and Accurate LLM Agents
Xi Zhang, Meijun Gao, Yuntian Zhao, Xinyu Tan, Yilun Yao, Feiyu Wang, Yanshu Wang, Dingsiyi, Tong Yang

TL;DR
This paper introduces Formal Skill, a runtime-native abstraction for reusable, executable skills in LLM agents, enhancing reliability, efficiency, and enforceability in real-world workspace applications.
Contribution
It proposes a novel Formal Skill abstraction with JSON schemas and Python executors, implemented in FairyClaw, improving token efficiency and task performance.
Findings
FairyClaw achieves competitive scores on Harness-Bench.
Formal Skill reduces token usage significantly.
Strong results on tasks emphasizing Formal Skill role.
Abstract
Large Language Model (LLM) agents increasingly act inside real workspaces, where tools and skills determine whether model reasoning becomes reliable action. Existing skills remain largely informal: Markdown skills and instruction packs encode procedures as long natural-language documents, while function calling, Model Context Protocol (MCP) servers, and framework tools structure individual actions but usually leave workflow state, policy enforcement, and completion discipline outside the skill itself. We introduce Formal Skill, a runtime-native abstraction that represents reusable capability with JSON metadata and action schemas, reliable Python executors, hook-governed control logic, Formal Skill routing, and skill-local runtime state. By moving reusable procedure from repeated prompt text into executable state machines and hook policies, Formal Skill gives agents a token-efficient and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
