InfoFlood: Jailbreaking Large Language Models with Information Overload

Advait Yadav; Haibo Jin; Man Luo; Jun Zhuang; Haohan Wang

arXiv:2506.12274·cs.CR·June 17, 2025

InfoFlood: Jailbreaking Large Language Models with Information Overload

Advait Yadav, Haibo Jin, Man Luo, Jun Zhuang, Haohan Wang

PDF

Open Access

TL;DR

This paper introduces InfoFlood, a novel attack that exploits a new vulnerability in LLMs where excessive linguistic complexity can bypass safety mechanisms, revealing a significant weakness in current AI safety defenses.

Contribution

The paper identifies a new vulnerability called Information Overload and proposes InfoFlood, an effective method to bypass safety measures by transforming malicious prompts into complex, overloaded queries.

Findings

01

InfoFlood achieves up to 3x higher success rates than baseline attacks.

02

Common safety defenses fail to prevent InfoFlood attacks.

03

Information Overload significantly weakens LLM safety mechanisms.

Abstract

Large Language Models (LLMs) have demonstrated remarkable capabilities across various domains. However, their potential to generate harmful responses has raised significant societal and regulatory concerns, especially when manipulated by adversarial techniques known as "jailbreak" attacks. Existing jailbreak methods typically involve appending carefully crafted prefixes or suffixes to malicious prompts in order to bypass the built-in safety mechanisms of these models. In this work, we identify a new vulnerability in which excessive linguistic complexity can disrupt built-in safety mechanisms-without the need for any added prefixes or suffixes-allowing attackers to elicit harmful outputs directly. We refer to this phenomenon as Information Overload. To automatically exploit this vulnerability, we propose InfoFlood, a jailbreak attack that transforms malicious queries into complex,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital and Cyber Forensics · Privacy-Preserving Technologies in Data · Artificial Intelligence in Law

MethodsLLaMA