SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent   Reinforcement Learning Systems

Oubo Ma; Yuwen Pu; Linkang Du; Yang Dai; Ruo Wang; Xiaolei Liu,; Yingcai Wu; Shouling Ji

arXiv:2402.03741·cs.LG·June 27, 2024·1 cites

SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems

Oubo Ma, Yuwen Pu, Linkang Du, Yang Dai, Ruo Wang, Xiaolei Liu,, Yingcai Wu, Shouling Ji

PDF

Open Access 1 Repo

TL;DR

This paper introduces SUB-PLAY, a novel black-box attack method that generates adversarial policies against partially observed multi-agent reinforcement learning systems, revealing security vulnerabilities and proposing defenses.

Contribution

The study demonstrates, for the first time, the ability to craft adversarial policies under partial observability in multi-agent settings using a new subgame approach.

Findings

01

SUB-PLAY effectively exploits partial observability limitations.

02

Adversarial policies significantly alter victim agents' network activations.

03

Proposed defenses show potential to mitigate security threats.

Abstract

Recent advancements in multi-agent reinforcement learning (MARL) have opened up vast application prospects, such as swarm control of drones, collaborative manipulation by robotic arms, and multi-target encirclement. However, potential security threats during the MARL deployment need more attention and thorough investigation. Recent research reveals that attackers can rapidly exploit the victim's vulnerabilities, generating adversarial policies that result in the failure of specific tasks. For instance, reducing the winning rate of a superhuman-level Go AI to around 20%. Existing studies predominantly focus on two-player competitive environments, assuming attackers possess complete global state observation. In this study, we unveil, for the first time, the capability of attackers to generate adversarial policies even when restricted to partial observations of the victims in multi-agent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maoubo/sub-play
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsFocus