LoopLLM: Transferable Energy-Latency Attacks in LLMs via Repetitive Generation

Xingyu Li; Xiaolei Liu; Cheng Liu; Yixiao Xu; Kangyi Ding; Bangzhou Xin; Jia-Li Yin

arXiv:2511.07876·cs.CR·November 12, 2025

LoopLLM: Transferable Energy-Latency Attacks in LLMs via Repetitive Generation

Xingyu Li, Xiaolei Liu, Cheng Liu, Yixiao Xu, Kangyi Ding, Bangzhou Xin, Jia-Li Yin

PDF

Open Access 1 Video

TL;DR

LoopLLM is a novel attack framework that exploits repetitive generation in large language models to induce high energy and latency costs, demonstrating high transferability and effectiveness across multiple models.

Contribution

It introduces a new attack method leveraging low-entropy decoding loops and gradient aggregation for improved transferability and effectiveness.

Findings

01

Achieves over 90% of maximum output length in attacks.

02

Outperforms existing methods by a significant margin.

03

Improves transferability to commercial LLMs by around 40%.

Abstract

As large language models (LLMs) scale, their inference incurs substantial computational resources, exposing them to energy-latency attacks, where crafted prompts induce high energy and latency cost. Existing attack methods aim to prolong output by delaying the generation of termination symbols. However, as the output grows longer, controlling the termination symbols through input becomes difficult, making these methods less effective. Therefore, we propose LoopLLM, an energy-latency attack framework based on the observation that repetitive generation can trigger low-entropy decoding loops, reliably compelling LLMs to generate until their output limits. LoopLLM introduces (1) a repetition-inducing prompt optimization that exploits autoregressive vulnerabilities to induce repetitive generation, and (2) a token-aligned ensemble optimization that aggregates gradients to improve cross-model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

LoopLLM: Transferable Energy-Latency Attacks in LLMs via Repetitive Generation· underline

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing · Topic Modeling