Finding the Weakest Link: Adversarial Attack against Multi-Agent Communications
Maxwell Standen, Junae Kim, Claudia Szabo

TL;DR
This paper presents gradient-based adversarial attack methods targeting multi-agent communication systems, demonstrating their effectiveness in various navigation environments.
Contribution
It introduces novel gradient-informed attack techniques and loss functions that improve attack success and impact on multi-agent communication systems.
Findings
Gradient-based message selection outperforms random selection in impact.
Proposed methods increase attack effectiveness in multiple scenarios.
Certain attack configurations significantly disrupt multi-agent coordination.
Abstract
Multi-agent systems rely on communication for information sharing and action coordination, which exposes a vulnerability to attacks. We investigate single-victim communication perturbation attacks against Multi-Agent Reinforcement Learning-trained systems and propose methods that use gradient information from the Jacobian to identify which messages, agent, and timesteps are most susceptible to attack and have the greatest impact on the system. We enhance these methods with two proposed adversarial loss functions that trade-off attack success for attack impact which also create more effective perturbations. We empirically demonstrate the effectiveness of our methods against two different multi-agent communication methods in navigation, PredatorPrey, and TrafficJunction environments. Our results show that our novel message selection method achieves a similar or greater impact than random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
