Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale

Yi Liu; Weizhe Wang; Ruitao Feng; Yao Zhang; Guangquan Xu; Gelei Deng; Yuekang Li; and Leo Zhang

arXiv:2601.10338·cs.CR·January 16, 2026

Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale

Yi Liu, Weizhe Wang, Ruitao Feng, Yao Zhang, Guangquan Xu, Gelei Deng, Yuekang Li, and Leo Zhang

PDF

Open Access

TL;DR

This study empirically analyzes security vulnerabilities in AI agent skills at scale, revealing widespread risks and proposing detection methods, highlighting the need for improved security measures in AI ecosystems.

Contribution

It provides the first large-scale empirical security analysis of agent skills, introduces a vulnerability taxonomy, and develops a validated detection framework with an open dataset.

Findings

01

26.1% of skills contain vulnerabilities

02

Data exfiltration and privilege escalation are most common

03

Skills with executable scripts are 2.12x more likely to be vulnerable

Abstract

The rise of AI agent frameworks has introduced agent skills, modular packages containing instructions and executable code that dynamically extend agent capabilities. While this architecture enables powerful customization, skills execute with implicit trust and minimal vetting, creating a significant yet uncharacterized attack surface. We conduct the first large-scale empirical security analysis of this emerging ecosystem, collecting 42,447 skills from two major marketplaces and systematically analyzing 31,132 using SkillScan, a multi-stage detection framework integrating static analysis with LLM-based semantic classification. Our findings reveal pervasive security risks: 26.1% of skills contain at least one vulnerability, spanning 14 distinct patterns across four categories: prompt injection, data exfiltration, privilege escalation, and supply chain risks. Data exfiltration (13.3%) and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Information and Cyber Security · Security and Verification in Computing