Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent   Communities

Tianjie Ju; Yiting Wang; Xinbei Ma; Pengzhou Cheng; Haodong Zhao,; Yulong Wang; Lifeng Liu; Jian Xie; Zhuosheng Zhang; Gongshen Liu

arXiv:2407.07791·cs.CL·July 24, 2024·3 cites

Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities

Tianjie Ju, Yiting Wang, Xinbei Ma, Pengzhou Cheng, Haodong Zhao,, Yulong Wang, Lifeng Liu, Jian Xie, Zhuosheng Zhang, Gongshen Liu

PDF

Open Access 1 Repo

TL;DR

This paper uncovers security vulnerabilities in LLM-based multi-agent systems, demonstrating how manipulated knowledge can spread covertly and persist, posing significant risks that require new defense strategies.

Contribution

It introduces a novel two-stage attack method exploiting LLM vulnerabilities to spread manipulated knowledge without prompt tampering, supported by extensive experiments.

Findings

01

Successful manipulation of knowledge spread without degrading agent performance

02

Manipulated knowledge persists through retrieval-augmented frameworks

03

Highlights need for robust defenses like guardian agents and fact-checkers

Abstract

The rapid adoption of large language models (LLMs) in multi-agent systems has highlighted their impressive capabilities in various applications, such as collaborative problem-solving and autonomous negotiation. However, the security implications of these LLM-based multi-agent systems have not been thoroughly investigated, particularly concerning the spread of manipulated knowledge. In this paper, we investigate this critical issue by constructing a detailed threat model and a comprehensive simulation environment that mirrors real-world multi-agent deployments in a trusted platform. Subsequently, we propose a novel two-stage attack method involving Persuasiveness Injection and Manipulated Knowledge Injection to systematically explore the potential for manipulated knowledge (i.e., counterfactual and toxic knowledge) spread without explicit prompt manipulation. Our method leverages the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Jometeorie/KnowledgeSpread
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsService-Oriented Architecture and Web Services · Semantic Web and Ontologies · Distributed and Parallel Computing Systems