Comment and Control: Hijacking Agentic Workflows via Context-Grounded Evolution
Neil Fendley, Zhengyu Liu, Aonan Guan, Jiacheng Zhong, Yinzhi Cao

TL;DR
This paper introduces JAW, a framework that detects and exploits vulnerabilities in agentic workflows using context analysis, demonstrating successful hijacking of numerous GitHub and n8n automation setups.
Contribution
It is the first to systematically analyze and demonstrate the hijacking of agentic workflows via context-grounded evolution, highlighting security risks.
Findings
4714 GitHub workflows can be hijacked using JAW.
Eight n8n templates are vulnerable to hijacking.
Multiple vulnerabilities were responsibly disclosed, leading to fixes and bug bounties.
Abstract
Automation platforms such as GitHub Actions and n8n are increasingly adopting so-called agentic workflows, which integrate Large Language Model (LLM) agents for tasks such as code review and data synchronization. While bringing convenience for developers, this integration exposes a new risk: An adversary may control and craft certain inputs, such as GitHub issue comments, to manipulate the LLM agent for unwanted actions, such as credential exfiltration and arbitrary command execution. To our knowledge, no prior academic work has studied such a risk in agentic workflows. In this paper, we design the first detection and exploitation framework, called JAW, to hijack agentic workflows hosted on automation platforms via a novel approach called Context-Grounded Evolution. Our key idea is to evolve agentic workflow inputs under the contexts derived from hybrid program analysis for hijacking…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
