Reward-Free Attacks in Multi-Agent Reinforcement Learning

Ted Fujimoto; Timothy Doster; Adam Attarian; Jill; Brandenberger; Nathan Hodas

arXiv:2112.00940·cs.LG·December 3, 2021

Reward-Free Attacks in Multi-Agent Reinforcement Learning

Ted Fujimoto, Timothy Doster, Adam Attarian, Jill, Brandenberger, Nathan Hodas

PDF

Open Access

TL;DR

This paper explores reward-free attacks in multi-agent reinforcement learning, demonstrating that an attacker can strategically subvert a victim agent by maximizing policy entropy solely through observation, without access to reward information.

Contribution

It introduces a reward-free exploration algorithm that enables an attacker to maximize the victim's policy entropy, revealing a new vulnerability in multi-agent systems.

Findings

01

Attacker can subvert agents by maximizing policy entropy.

02

Reward-free attacks are feasible without reward access.

03

Policy entropy maximization effectively compromises victim agents.

Abstract

We investigate how effective an attacker can be when it only learns from its victim's actions, without access to the victim's reward. In this work, we are motivated by the scenario where the attacker wants to behave strategically when the victim's motivations are unknown. We argue that one heuristic approach an attacker can use is to maximize the entropy of the victim's policy. The policy is generally not obfuscated, which implies it may be extracted simply by passively observing the victim. We provide such a strategy in the form of a reward-free exploration algorithm that maximizes the attacker's entropy during the exploration phase, and then maximizes the victim's empirical entropy during the planning phase. In our experiments, the victim agents are subverted through policy entropy maximization, implying an attacker might not need access to the victim's reward to succeed. Hence,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Reinforcement Learning in Robotics · Crime, Illicit Activities, and Governance