Semia: Auditing Agent Skills via Constraint-Guided Representation Synthesis
Hongbo Wen, Ying Li, Hanzhi Liu, Chaofan Shou, Yanju Chen, Yuan Tian, and Yu Feng

TL;DR
Semia is a static auditing tool that uses constraint-guided synthesis to verify security properties of agent skills expressed in a hybrid language, identifying critical risks in real-world skills.
Contribution
It introduces CGRS, a novel loop for synthesizing faithful representations of skills, enabling effective security analysis of complex agent capabilities.
Findings
Semia successfully audits 13,728 real-world skills from public marketplaces.
More than half of the skills contain at least one critical semantic security risk.
Semia outperforms existing signature-based and LLM baselines with 97.7% recall and 90.6% F1.
Abstract
An agent skill is a configuration package that equips an LLM-driven agent with a concrete capability, such as reading email, executing shell commands, or signing blockchain transactions. Each skill is a hybrid artifact-a structured half declares executable interfaces, while a prose half dictates when and how those interfaces fire-and the prose is reinterpreted probabilistically on every invocation. Conventional static analyzers parse the structured half but ignore the prose; LLM-based tools read the prose but cannot reproducibly prove that a tainted input reaches a high-impact sink. We present Semia, a static auditor for agent skills. Semia lifts each skill into the Skill Description Language (SDL), a Datalog fact base that captures LLM-triggered actions, prose-defined conditions, and human-in-the-loop checkpoints. Synthesizing a fact base that is both structurally sound and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
