DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents
Hao Li, Xiaogeng Liu, Hung-Chun Chiu, Dianqi Li, Ning Zhang, Chaowei Xiao

TL;DR
DRIFT is a dynamic, rule-based framework that enhances the security of LLM agents by enforcing adaptable policies and isolating injection risks, effectively defending against prompt injection attacks while preserving utility.
Contribution
This paper introduces DRIFT, a novel framework that dynamically updates security policies and isolates injection threats, addressing limitations of static defenses in securing LLM agents.
Findings
DRIFT effectively defends against prompt injection attacks.
Maintains high utility across diverse models.
Demonstrates robustness and adaptability in empirical evaluations.
Abstract
Large Language Models (LLMs) are increasingly central to agentic systems due to their strong reasoning and planning capabilities. By interacting with external environments through predefined tools, these agents can carry out complex user tasks. Nonetheless, this interaction also introduces the risk of prompt injection attacks, where malicious inputs from external sources can mislead the agent's behavior, potentially resulting in economic loss, privacy leakage, or system compromise. System-level defenses have recently shown promise by enforcing static or predefined policies, but they still face two key challenges: the ability to dynamically update security rules and the need for memory stream isolation. To address these challenges, we propose Dynamic Rule-based Isolation Framework for Trustworthy agentic systems (DRIFT), which enforces the dynamic security policy and injection isolation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Security and Resilience · Security and Verification in Computing · Network Security and Intrusion Detection
