Prism: Spectral Parameter Sharing for Multi-Agent Reinforcement Learning

Kyungbeom Kim; Seungwon Oh; Kyung-Joong Kim

arXiv:2602.06476·cs.MA·February 9, 2026

Prism: Spectral Parameter Sharing for Multi-Agent Reinforcement Learning

Kyungbeom Kim, Seungwon Oh, Kyung-Joong Kim

PDF

Open Access

TL;DR

Prism introduces a spectral domain parameter sharing method in multi-agent reinforcement learning that fosters diversity among agents while maintaining scalability and resource efficiency.

Contribution

The paper presents Prism, a novel spectral parameter sharing framework using SVD to induce diversity among agents without sacrificing scalability.

Findings

01

Achieves competitive performance on multiple benchmarks.

02

Maintains resource efficiency compared to existing methods.

03

Fosters inter-agent diversity effectively.

Abstract

Parameter sharing is a key strategy in multi-agent reinforcement learning (MARL) for improving scalability, yet conventional fully shared architectures often collapse into homogeneous behaviors. Recent methods introduce diversity through clustering, pruning, or masking, but typically compromise resource efficiency. We propose Prism, a parameter sharing framework that induces inter-agent diversity by representing shared networks in the spectral domain via singular value decomposition (SVD). All agents share the singular vector directions while learning distinct spectral masks on singular values. This mechanism encourages inter-agent diversity and preserves scalability. Extensive experiments on both homogeneous (LBF, SMACv2) and heterogeneous (MaMuJoCo) benchmarks show that Prism achieves competitive performance with superior resource efficiency.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Software-Defined Networks and 5G · Domain Adaptation and Few-Shot Learning