Inducing Overthink: Hierarchical Genetic Algorithm-based DoS Attack on Black-Box Large Language Reasoning Models

Shuqiang Wang; Wei Cao; Jiaqi Weng; Jialing Tao; Licheng Pan; Hui Xue; Zhixuan Chu

arXiv:2605.13338·cs.CR·May 15, 2026

Inducing Overthink: Hierarchical Genetic Algorithm-based DoS Attack on Black-Box Large Language Reasoning Models

Shuqiang Wang, Wei Cao, Jiaqi Weng, Jialing Tao, Licheng Pan, Hui Xue, Zhixuan Chu

PDF

TL;DR

This paper presents a hierarchical genetic algorithm-based attack that induces excessive reasoning in large language models, exposing a new vulnerability related to resource exhaustion and denial-of-service risks.

Contribution

It introduces an automated black-box framework using hierarchical genetic algorithms to systematically induce overthinking in large reasoning models, significantly increasing their output length.

Findings

01

Achieves up to 26.1x increase in output length on the MATH benchmark.

02

Demonstrates high transferability of adversarial inputs across different models.

03

Outperforms baseline methods in inducing overthinking behaviors.

Abstract

Large Reasoning Models (LRMs) are increasingly integrated into systems requiring reliable multi-step inference, yet this growing dependence exposes new vulnerabilities related to computational availability. In particular, LRMs exhibit a tendency to "overthink", producing excessively long and redundant reasoning traces, when confronted with incomplete or logically inconsistent inputs. This behavior significantly increases inference latency and energy consumption, forming a potential vector for denial-of-service (DoS) style resource exhaustion. In this work, we investigate this attack surface and propose an automated black-box framework that induces overthinking in LRMs by systematically perturbing the logical structure of input problems. Our method employs a hierarchical genetic algorithm (HGA) operating on structured problem decompositions, and optimizes a composite fitness function…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.