Semia: Auditing Agent Skills via Constraint-Guided Representation Synthesis

Hongbo Wen; Ying Li; Hanzhi Liu; Chaofan Shou; Yanju Chen; Yuan Tian; and Yu Feng

arXiv:2605.00314·cs.CR·May 4, 2026

Semia: Auditing Agent Skills via Constraint-Guided Representation Synthesis

Hongbo Wen, Ying Li, Hanzhi Liu, Chaofan Shou, Yanju Chen, Yuan Tian, and Yu Feng

PDF

TL;DR

Semia is a static auditing tool that uses constraint-guided synthesis to verify security properties of agent skills expressed in a hybrid language, identifying critical risks in real-world skills.

Contribution

It introduces CGRS, a novel loop for synthesizing faithful representations of skills, enabling effective security analysis of complex agent capabilities.

Findings

01

Semia successfully audits 13,728 real-world skills from public marketplaces.

02

More than half of the skills contain at least one critical semantic security risk.

03

Semia outperforms existing signature-based and LLM baselines with 97.7% recall and 90.6% F1.

Abstract

An agent skill is a configuration package that equips an LLM-driven agent with a concrete capability, such as reading email, executing shell commands, or signing blockchain transactions. Each skill is a hybrid artifact-a structured half declares executable interfaces, while a prose half dictates when and how those interfaces fire-and the prose is reinterpreted probabilistically on every invocation. Conventional static analyzers parse the structured half but ignore the prose; LLM-based tools read the prose but cannot reproducibly prove that a tainted input reaches a high-impact sink. We present Semia, a static auditor for agent skills. Semia lifts each skill into the Skill Description Language (SDL), a Datalog fact base that captures LLM-triggered actions, prose-defined conditions, and human-in-the-loop checkpoints. Synthesizing a fact base that is both structurally sound and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.