InfoFlood: Jailbreaking Large Language Models with Information Overload
Advait Yadav, Haibo Jin, Man Luo, Jun Zhuang, Haohan Wang

TL;DR
This paper introduces InfoFlood, a novel attack that exploits a new vulnerability in LLMs where excessive linguistic complexity can bypass safety mechanisms, revealing a significant weakness in current AI safety defenses.
Contribution
The paper identifies a new vulnerability called Information Overload and proposes InfoFlood, an effective method to bypass safety measures by transforming malicious prompts into complex, overloaded queries.
Findings
InfoFlood achieves up to 3x higher success rates than baseline attacks.
Common safety defenses fail to prevent InfoFlood attacks.
Information Overload significantly weakens LLM safety mechanisms.
Abstract
Large Language Models (LLMs) have demonstrated remarkable capabilities across various domains. However, their potential to generate harmful responses has raised significant societal and regulatory concerns, especially when manipulated by adversarial techniques known as "jailbreak" attacks. Existing jailbreak methods typically involve appending carefully crafted prefixes or suffixes to malicious prompts in order to bypass the built-in safety mechanisms of these models. In this work, we identify a new vulnerability in which excessive linguistic complexity can disrupt built-in safety mechanisms-without the need for any added prefixes or suffixes-allowing attackers to elicit harmful outputs directly. We refer to this phenomenon as Information Overload. To automatically exploit this vulnerability, we propose InfoFlood, a jailbreak attack that transforms malicious queries into complex,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital and Cyber Forensics · Privacy-Preserving Technologies in Data · Artificial Intelligence in Law
MethodsLLaMA
