CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems   Based on Large Language Models

Zhenhong Zhou; Zherui Li; Jie Zhang; Yuanhe Zhang; Kun Wang; Yang Liu,; Qing Guo

arXiv:2502.14529·cs.CL·February 21, 2025

CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models

Zhenhong Zhou, Zherui Li, Jie Zhang, Yuanhe Zhang, Kun Wang, Yang Liu,, Qing Guo

PDF

Open Access 1 Repo

TL;DR

This paper introduces Corba, a novel attack on LLM-based multi-agent systems that propagates and exhausts resources, revealing security vulnerabilities even in systems with safety measures.

Contribution

The paper presents Corba, a simple yet effective recursive blocking attack exploiting contagion and resource depletion in LLM-MASs, highlighting new security challenges.

Findings

01

Corba effectively disrupts interactions in LLM-MASs across various topologies.

02

Corba can deplete computational resources in complex multi-agent environments.

03

The attack is successful on both commercial and open-source LLM models.

Abstract

Large Language Model-based Multi-Agent Systems (LLM-MASs) have demonstrated remarkable real-world capabilities, effectively collaborating to complete complex tasks. While these systems are designed with safety mechanisms, such as rejecting harmful instructions through alignment, their security remains largely unexplored. This gap leaves LLM-MASs vulnerable to targeted disruptions. In this paper, we introduce Contagious Recursive Blocking Attacks (Corba), a novel and simple yet highly effective attack that disrupts interactions between agents within an LLM-MAS. Corba leverages two key properties: its contagious nature allows it to propagate across arbitrary network topologies, while its recursive property enables sustained depletion of computational resources. Notably, these blocking attacks often involve seemingly benign instructions, making them particularly challenging to mitigate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhrli324/corba
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection