Prompt-Induced Over-Generation as Denial-of-Service: A Black-Box Attack-Side Benchmark

Manu; Yi Guo; Kanchana Thilakarathna; Nirhoshan Sivaroopan; Jo Plested; Tim Lynar; Jack Yang; Wangli Yang

arXiv:2512.23779·cs.CR·January 21, 2026

Prompt-Induced Over-Generation as Denial-of-Service: A Black-Box Attack-Side Benchmark

Manu, Yi Guo, Kanchana Thilakarathna, Nirhoshan Sivaroopan, Jo Plested, Tim Lynar, Jack Yang, Wangli Yang

PDF

Open Access

TL;DR

This paper introduces a black-box benchmark for prompt-based over-generation attacks on large language models, comparing two novel attack methods that induce denial-of-service conditions by causing excessive token generation.

Contribution

It presents a new benchmark and evaluates two prompt-only attack algorithms, EOGen and RL-GOAL, for black-box over-generation attacks on LLMs, highlighting their effectiveness.

Findings

01

EOGen achieves an OGF of 1.39 with 25.2% success rate.

02

RL-GOAL nearly doubles severity with an OGF of 2.70 and 64.3% success.

03

RL-GOAL causes non-termination in 46% of trials.

Abstract

Large Language Models (LLMs) can be driven into over-generation, emitting thousands of tokens before producing an end-of-sequence (EOS) token. This degrades answer quality, inflates latency and cost, and can be weaponized as a denial-of-service (DoS) attack. Recent work has begun to study DoS-style prompt attacks, but typically focuses on a single attack algorithm or assumes white-box access, without an attack-side benchmark that compares prompt-based attackers in a black-box, query-only regime with a known tokenizer. We introduce such a benchmark and study two prompt-only attackers. The first is an Evolutionary Over-Generation Prompt Search (EOGen) that searches the token space for prefixes that suppress EOS and induce long continuations. The second is a goal-conditioned reinforcement learning attacker (RL-GOAL) that trains a network to generate prefixes conditioned on a target length.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Network Security and Intrusion Detection