Controlling Behavioral Diversity in Multi-Agent Reinforcement Learning

Matteo Bettini; Ryan Kortvelesy; Amanda Prorok

arXiv:2405.15054·cs.MA·May 27, 2024·2 cites

Controlling Behavioral Diversity in Multi-Agent Reinforcement Learning

Matteo Bettini, Ryan Kortvelesy, Amanda Prorok

PDF

Open Access 1 Repo

TL;DR

This paper introduces Diversity Control (DiCo), a novel method for precisely controlling behavioral diversity in Multi-Agent Reinforcement Learning without altering the learning objective, improving performance and sample efficiency.

Contribution

DiCo is a new approach that enforces exact diversity levels by architectural constraints, applicable to any actor-critic MARL algorithm, and is theoretically proven to achieve the desired diversity.

Findings

01

DiCo effectively controls diversity to a specified value.

02

Using DiCo improves performance in cooperative and competitive MARL tasks.

03

DiCo enhances sample efficiency in multi-agent learning environments.

Abstract

The study of behavioral diversity in Multi-Agent Reinforcement Learning (MARL) is a nascent yet promising field. In this context, the present work deals with the question of how to control the diversity of a multi-agent system. With no existing approaches to control diversity to a set value, current solutions focus on blindly promoting it via intrinsic rewards or additional loss functions, effectively changing the learning objective and lacking a principled measure for it. To address this, we introduce Diversity Control (DiCo), a method able to control diversity to an exact value of a given metric by representing policies as the sum of a parameter-shared component and dynamically scaled per-agent components. By applying constraints directly to the policy architecture, DiCo leaves the learning objective unchanged, enabling its applicability to any actor-critic MARL algorithm. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

proroklab/controllingbehavioraldiversity
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInnovation Diffusion and Forecasting · Reinforcement Learning in Robotics

MethodsSparse Evolutionary Training · Focus