TL;DR
OTora is a novel two-stage red-teaming framework that demonstrates how to induce reasoning-level denial-of-service attacks on LLM agents, significantly increasing latency while maintaining task accuracy.
Contribution
It introduces OTora, the first unified framework for R-DoS attacks on LLM agents, combining adversarial trigger optimization and reasoning payload generation.
Findings
OTora achieves up to 10x increase in reasoning tokens.
Latency can be increased by an order of magnitude.
Task accuracy remains near baseline despite attacks.
Abstract
Large Language Models (LLMs) are increasingly deployed as autonomous agents that execute tool-augmented, multi-step tasks, where latency is a critical factor for real-world applications. Yet an overlooked threat is Reasoning-Level Denial-of-Service (R-DoS), in which an attacker preserves task correctness but degrades availability by inflating an agent's reasoning depth or tool-use budget. We introduce OTora, the first unified, two-stage red-teaming framework for instantiating R-DoS attacks. Stage I optimizes an adversarial trigger that induces targeted tool invocations using insertion-aware scoring and dynamic target co-evolution, supporting both black-box and white-box settings. Stage II generates agent-aware reasoning payloads via an ICL-guided genetic search that amplifies overthinking while maintaining correct task outcomes. Across WebShop, Email, and OS agents built on multiple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
