SHIELD: An Auto-Healing Agentic Defense Framework for LLM Resource Exhaustion Attacks

Nirhoshan Sivaroopan; Kanchana Thilakarathna; Albert Zomaya; Manu; Yi Guo; Jo Plested; Tim Lynar; Jack Yang; Wangli Yang

arXiv:2601.19174·cs.CR·January 28, 2026

SHIELD: An Auto-Healing Agentic Defense Framework for LLM Resource Exhaustion Attacks

Nirhoshan Sivaroopan, Kanchana Thilakarathna, Albert Zomaya, Manu, Yi Guo, Jo Plested, Tim Lynar, Jack Yang, Wangli Yang

PDF

Open Access

TL;DR

SHIELD is a multi-agent, auto-healing framework that enhances LLM defenses against resource exhaustion attacks by combining semantic retrieval, pattern matching, and self-updating mechanisms, outperforming existing methods.

Contribution

The paper introduces SHIELD, a novel multi-agent auto-healing defense system for LLMs that adapts to evolving sponge attacks through self-updating and refinement.

Findings

01

SHIELD achieves high F1 scores against semantic sponge attacks.

02

It outperforms traditional perplexity-based defenses.

03

The system effectively adapts to evolving attack strategies.

Abstract

Sponge attacks increasingly threaten LLM systems by inducing excessive computation and DoS. Existing defenses either rely on statistical filters that fail on semantically meaningful attacks or use static LLM-based detectors that struggle to adapt as attack strategies evolve. We introduce SHIELD, a multi-agent, auto-healing defense framework centered on a three-stage Defense Agent that integrates semantic similarity retrieval, pattern matching, and LLM-based reasoning. Two auxiliary agents, a Knowledge Updating Agent and a Prompt Optimization Agent, form a closed self-healing loop, when an attack bypasses detection, the system updates an evolving knowledgebase, and refines defense instructions. Extensive experiments show that SHIELD consistently outperforms perplexity-based and standalone LLM defenses, achieving high F1 scores across both non-semantic and semantic sponge attacks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Network Security and Intrusion Detection · Advanced Malware Detection Techniques