The Benefits of Power Regularization in Cooperative Reinforcement   Learning

Michelle Li; Michael Dennis

arXiv:2406.11240·cs.LG·June 18, 2024

The Benefits of Power Regularization in Cooperative Reinforcement Learning

Michelle Li, Michael Dennis

PDF

Open Access

TL;DR

This paper introduces a power regularization approach in cooperative multi-agent reinforcement learning to improve robustness against agent failure and adversarial attacks by balancing task reward and power distribution.

Contribution

It proposes a practical power measure and two algorithms, SBPR and PRIM, to regularize power concentration in cooperative RL systems.

Findings

01

Power regularization leads to more robust multi-agent systems.

02

Algorithms successfully balance task reward and power distribution.

03

Reduced catastrophic failures in off-policy scenarios.

Abstract

Cooperative Multi-Agent Reinforcement Learning (MARL) algorithms, trained only to optimize task reward, can lead to a concentration of power where the failure or adversarial intent of a single agent could decimate the reward of every agent in the system. In the context of teams of people, it is often useful to explicitly consider how power is distributed to ensure no person becomes a single point of failure. Here, we argue that explicitly regularizing the concentration of power in cooperative RL systems can result in systems which are more robust to single agent failure, adversarial attacks, and incentive changes of co-players. To this end, we define a practical pairwise measure of power that captures the ability of any co-player to influence the ego agent's reward, and then propose a power-regularized objective which balances task reward and power concentration. Given this new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Research in Systems and Signal Processing · Muscle activation and electromyography studies