CuDA2: An approach for Incorporating Traitor Agents into Cooperative Multi-Agent Systems
Zhen Chen, Yong Liao, Youpeng Zhao, Zipeng Dai, Jian Zhao

TL;DR
This paper introduces CuDA2, a novel framework for adversarial attacks on cooperative multi-agent systems by injecting traitor agents trained with curiosity-driven exploration, effectively disrupting victim agents' policies.
Contribution
The paper proposes a new method to inject traitor agents into CMARL systems using a TMDP model and enhances attack efficiency with curiosity-driven exploration via RND.
Findings
CuDA2 achieves comparable or superior attack success rates.
Traitor agents effectively influence victim agents' formation and policies.
The framework maintains traitor policy invariance while increasing attack aggressiveness.
Abstract
Cooperative Multi-Agent Reinforcement Learning (CMARL) strategies are well known to be vulnerable to adversarial perturbations. Previous works on adversarial attacks have primarily focused on white-box attacks that directly perturb the states or actions of victim agents, often in scenarios with a limited number of attacks. However, gaining complete access to victim agents in real-world environments is exceedingly difficult. To create more realistic adversarial attacks, we introduce a novel method that involves injecting traitor agents into the CMARL system. We model this problem as a Traitor Markov Decision Process (TMDP), where traitors cannot directly attack the victim agents but can influence their formation or positioning through collisions. In TMDP, traitors are trained using the same MARL algorithm as the victim agents, with their reward function set as the negative of the victim…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Semantic Web and Ontologies
MethodsSparse Evolutionary Training
