Graph-SND: Sparse Aggregation for Behavioral Diversity in Multi-Agent Reinforcement Learning
Shawn Ray

TL;DR
Graph-SND introduces a sparse, graph-based approach to measure behavioral diversity in multi-agent reinforcement learning, significantly reducing computational costs while maintaining accuracy.
Contribution
It replaces the quadratic complete-graph average with a flexible, sparse graph-based measure, enabling scalable and unbiased diversity estimation.
Findings
Graph-SND reduces per-call metric time by about 10x in large teams.
Empirical results show near-identical diversity measures with significantly fewer edges.
The method scales beyond complete graphs, enabling efficient diversity control in large multi-agent systems.
Abstract
System Neural Diversity (SND) measures behavioral heterogeneity in multi-agent reinforcement learning by averaging pairwise distances over all agent pairs, making each call quadratic in team size. We introduce Graph-SND, which replaces this complete-graph average with a weighted average over the edges of an arbitrary graph . Three regimes follow: recovers SND exactly; a fixed sparse defines a localized diversity measure at cost; and random edge samples yield an unbiased Horvitz-Thompson estimator and a normalized sample mean with concentration in the sampled edge count . For fixed sparse graphs we prove forwarding-index distortion bounds for expanders and a spectral refinement under low-rank distance structure; for random -regular graphs we prove an unconditional probabilistic …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
