Adversarial Attacks in Cooperative AI
Ted Fujimoto, Arthur Paul Pedersen

TL;DR
This paper reveals that popular cooperative AI algorithms, inspired by social intelligence, are vulnerable to adversarial attacks, highlighting the need for more robust methods in multi-agent cooperation.
Contribution
It demonstrates that existing cooperative AI algorithms are susceptible to adversarial attacks, exposing a critical weakness not previously addressed.
Findings
Three cooperative AI algorithms are vulnerable to adversarial attacks.
Experimental results show how vulnerabilities can be exploited in practice.
Highlights the need for developing more robust cooperative AI methods.
Abstract
Single-agent reinforcement learning algorithms in a multi-agent environment are inadequate for fostering cooperation. If intelligent agents are to interact and work together to solve complex problems, methods that counter non-cooperative behavior are needed to facilitate the training of multiple agents. This is the goal of cooperative AI. Recent research in adversarial machine learning, however, shows that models (e.g., image classifiers) can be easily deceived into making inferior decisions. Meanwhile, an important line of research in cooperative AI has focused on introducing algorithmic improvements that accelerate learning of optimally cooperative behavior. We argue that prominent methods of cooperative AI are exposed to weaknesses analogous to those studied in prior machine learning research. More specifically, we show that three algorithms inspired by human-like social intelligence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Evolutionary Game Theory and Cooperation · Reinforcement Learning in Robotics
