PISanitizer: Preventing Prompt Injection to Long-Context LLMs via Prompt Sanitization

Runpeng Geng; Yanting Wang; Chenlong Yin; Minhao Cheng; Ying Chen; Jinyuan Jia

arXiv:2511.10720·cs.CR·November 17, 2025

PISanitizer: Preventing Prompt Injection to Long-Context LLMs via Prompt Sanitization

Runpeng Geng, Yanting Wang, Chenlong Yin, Minhao Cheng, Ying Chen, Jinyuan Jia

PDF

Open Access

TL;DR

PISanitizer is a novel method that detects and sanitizes prompt injections in long-context LLMs by leveraging attention mechanisms, effectively preventing attacks while preserving utility and outperforming existing defenses.

Contribution

This work introduces PISanitizer, a new prompt sanitization technique specifically designed for long-context LLMs, addressing limitations of prior short-context defenses.

Findings

01

Successfully prevents prompt injection attacks in long-context LLMs

02

Maintains high utility of LLM outputs after sanitization

03

Outperforms existing prompt injection defenses in effectiveness and efficiency

Abstract

Long context LLMs are vulnerable to prompt injection, where an attacker can inject an instruction in a long context to induce an LLM to generate an attacker-desired output. Existing prompt injection defenses are designed for short contexts. When extended to long-context scenarios, they have limited effectiveness. The reason is that an injected instruction constitutes only a very small portion of a long context, making the defense very challenging. In this work, we propose PISanitizer, which first pinpoints and sanitizes potential injected tokens (if any) in a context before letting a backend LLM generate a response, thereby eliminating the influence of the injected instruction. To sanitize injected tokens, PISanitizer builds on two observations: (1) prompt injection attacks essentially craft an instruction that compels an LLM to follow it, and (2) LLMs intrinsically leverage the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSecurity and Verification in Computing · Adversarial Robustness in Machine Learning · Advanced Malware Detection Techniques