Ghost in the Agent: Redefining Information Flow Tracking for LLM Agents
Yuandao Cai, Wensheng Tang, Cheng Wen, Shengchao Qin

TL;DR
NeuroTaint is a novel framework for tracking information flow in LLM agents, addressing the limitations of traditional taint analysis by considering semantic and causal influences.
Contribution
We introduce NeuroTaint, the first taint tracking system tailored for LLM agents that captures semantic transformations and causal influences in information flow.
Findings
NeuroTaint outperforms FIDES in source-sink detection across 400 scenarios.
It remains effective on established security benchmarks like InjecAgent and ToolEmu.
Operates efficiently offline with modest auditing costs.
Abstract
Autonomous Large Language Model (LLM) agents are increasingly deployed to conduct complex tasks by interacting with external tools, APIs, and memory stores. However, processing untrusted external data exposes these agents to severe security threats, such as indirect prompt injection and unauthorized tool execution. Securing these systems requires effective information flow tracking. Yet, traditional taint analysis that is designed for program memory states fundamentally fails when applied to LLMs, where data propagation is governed by probabilistic natural language reasoning. In this paper, we present NeuroTaint, the first comprehensive taint tracking framework tailored for the unique information flow characteristics of LLM agents. Our key insight is that taint propagation in LLM agents must be understood not only as explicit content transfer, but also as semantic transformation, causal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
