System Neural Diversity: Measuring Behavioral Heterogeneity in Multi-Agent Learning
Matteo Bettini, Ajay Shankar, Amanda Prorok

TL;DR
This paper introduces System Neural Diversity (SND), a new metric for quantifying behavioral heterogeneity in multi-agent systems, enabling better understanding, control, and acceleration of multi-agent reinforcement learning.
Contribution
The paper proposes and validates a novel theoretical metric, SND, for measuring behavioral diversity in multi-agent systems, with applications in resilience, exploration, and policy optimization.
Findings
SND effectively measures behavioral heterogeneity across tasks.
It correlates with resilience skills in dynamic environments.
Using SND for control accelerates policy learning.
Abstract
Evolutionary science provides evidence that diversity confers resilience in natural systems. Yet, traditional multi-agent reinforcement learning techniques commonly enforce homogeneity to increase training sample efficiency. When a system of learning agents is not constrained to homogeneous policies, individuals may develop diverse behaviors, resulting in emergent complementarity that benefits the system. Despite this, there is a surprising lack of tools that quantify behavioral diversity. Such techniques would pave the way towards understanding the impact of diversity in collective artificial intelligence and enabling its control. In this paper, we introduce System Neural Diversity (SND): a measure of behavioral heterogeneity in multi-agent systems. We discuss and prove its theoretical properties, and compare it with alternate, state-of-the-art behavioral diversity metrics used in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Ecosystem dynamics and resilience
Methodsfail
