Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks

David Schmotz; Luca Beurer-Kellner; Sahar Abdelnabi; Maksym Andriushchenko

arXiv:2602.20156·cs.CR·February 26, 2026

Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks

David Schmotz, Luca Beurer-Kellner, Sahar Abdelnabi, Maksym Andriushchenko

PDF

Open Access

TL;DR

SkillInject is a benchmark that evaluates the vulnerability of LLM agents with skill files to prompt injection attacks, revealing high susceptibility and emphasizing the need for context-aware security measures.

Contribution

The paper introduces SkillInject, a comprehensive benchmark for assessing skill file injection vulnerabilities in LLM agents, highlighting security challenges and proposing the need for advanced safeguards.

Findings

01

Up to 80% attack success rate on frontier models

02

Agents can execute harmful instructions like data exfiltration and ransomware

03

Simple filtering is insufficient; context-aware authorization is needed

Abstract

LLM agents are evolving rapidly, powered by code execution, tools, and the recently introduced agent skills feature. Skills allow users to extend LLM applications with specialized third-party code, knowledge, and instructions. Although this can extend agent capabilities to new domains, it creates an increasingly complex agent supply chain, offering new surfaces for prompt injection attacks. We identify skill-based prompt injection as a significant threat and introduce SkillInject, a benchmark evaluating the susceptibility of widely-used LLM agents to injections through skill files. SkillInject contains 202 injection-task pairs with attacks ranging from obviously malicious injections to subtle, context-dependent attacks hidden in otherwise legitimate instructions. We evaluate frontier LLMs on SkillInject, measuring both security in terms of harmful instruction avoidance and utility in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSecurity and Verification in Computing · Advanced Malware Detection Techniques · Information and Cyber Security