Prism: Spectral Parameter Sharing for Multi-Agent Reinforcement Learning
Kyungbeom Kim, Seungwon Oh, Kyung-Joong Kim

TL;DR
Prism introduces a spectral domain parameter sharing method in multi-agent reinforcement learning that fosters diversity among agents while maintaining scalability and resource efficiency.
Contribution
The paper presents Prism, a novel spectral parameter sharing framework using SVD to induce diversity among agents without sacrificing scalability.
Findings
Achieves competitive performance on multiple benchmarks.
Maintains resource efficiency compared to existing methods.
Fosters inter-agent diversity effectively.
Abstract
Parameter sharing is a key strategy in multi-agent reinforcement learning (MARL) for improving scalability, yet conventional fully shared architectures often collapse into homogeneous behaviors. Recent methods introduce diversity through clustering, pruning, or masking, but typically compromise resource efficiency. We propose Prism, a parameter sharing framework that induces inter-agent diversity by representing shared networks in the spectral domain via singular value decomposition (SVD). All agents share the singular vector directions while learning distinct spectral masks on singular values. This mechanism encourages inter-agent diversity and preserves scalability. Extensive experiments on both homogeneous (LBF, SMACv2) and heterogeneous (MaMuJoCo) benchmarks show that Prism achieves competitive performance with superior resource efficiency.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Software-Defined Networks and 5G · Domain Adaptation and Few-Shot Learning
