CuDA2: An approach for Incorporating Traitor Agents into Cooperative   Multi-Agent Systems

Zhen Chen; Yong Liao; Youpeng Zhao; Zipeng Dai; Jian Zhao

arXiv:2406.17425·cs.LG·June 26, 2024

CuDA2: An approach for Incorporating Traitor Agents into Cooperative Multi-Agent Systems

Zhen Chen, Yong Liao, Youpeng Zhao, Zipeng Dai, Jian Zhao

PDF

Open Access

TL;DR

This paper introduces CuDA2, a novel framework for adversarial attacks on cooperative multi-agent systems by injecting traitor agents trained with curiosity-driven exploration, effectively disrupting victim agents' policies.

Contribution

The paper proposes a new method to inject traitor agents into CMARL systems using a TMDP model and enhances attack efficiency with curiosity-driven exploration via RND.

Findings

01

CuDA2 achieves comparable or superior attack success rates.

02

Traitor agents effectively influence victim agents' formation and policies.

03

The framework maintains traitor policy invariance while increasing attack aggressiveness.

Abstract

Cooperative Multi-Agent Reinforcement Learning (CMARL) strategies are well known to be vulnerable to adversarial perturbations. Previous works on adversarial attacks have primarily focused on white-box attacks that directly perturb the states or actions of victim agents, often in scenarios with a limited number of attacks. However, gaining complete access to victim agents in real-world environments is exceedingly difficult. To create more realistic adversarial attacks, we introduce a novel method that involves injecting traitor agents into the CMARL system. We model this problem as a Traitor Markov Decision Process (TMDP), where traitors cannot directly attack the victim agents but can influence their formation or positioning through collisions. In TMDP, traitors are trained using the same MARL algorithm as the victim agents, with their reward function set as the negative of the victim…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation · Semantic Web and Ontologies

MethodsSparse Evolutionary Training