TL;DR
LingoLoop is an attack method that exploits linguistic cues and structural patterns to trap multimodal large language models in endless, resource-consuming loops, revealing critical vulnerabilities.
Contribution
The paper introduces LingoLoop, a novel attack leveraging POS-awareness and output constraining mechanisms to induce MLLMs into repetitive, verbose loops, exposing their security weaknesses.
Findings
LingoLoop can generate outputs up to 367 times longer than normal.
The attack consistently drives models to their maximum generation limits.
It significantly increases energy consumption during inference.
Abstract
Multimodal Large Language Models (MLLMs) have shown great promise but require substantial computational resources during inference. Attackers can exploit this by inducing excessive output, leading to resource exhaustion and service degradation. Prior energy-latency attacks aim to increase generation time by broadly shifting the output token distribution away from the EOS token, but they neglect the influence of token-level Part-of-Speech (POS) characteristics on EOS and sentence-level structural patterns on output counts, limiting their efficacy. To address this, we propose LingoLoop, an attack designed to induce MLLMs to generate excessively verbose and repetitive sequences. First, we find that the POS tag of a token strongly affects the likelihood of generating an EOS token. Based on this insight, we propose a POS-Aware Delay Mechanism to postpone EOS token generation by adjusting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
