Decoupling Communication from Policy: Robust MARL under Bandwidth Constraints

Alexi Canesse; Beno\^it Goupil; Jesse Read; Sonia Vanier

arXiv:2605.21085·cs.MA·May 21, 2026

Decoupling Communication from Policy: Robust MARL under Bandwidth Constraints

Alexi Canesse, Beno\^it Goupil, Jesse Read, Sonia Vanier

PDF

1 Repo

TL;DR

This paper introduces a novel MARL architecture that decouples communication from policy, enabling robust coordination under bandwidth constraints with minimal performance loss.

Contribution

The paper proposes SLIM, a minimal architecture that separates communication from policy representation, and introduces a normalized bandwidth budget to improve multi-agent reinforcement learning under communication limits.

Findings

01

Achieves state-of-the-art performance on MARL benchmarks.

02

Maintains robustness with minimal performance degradation under bandwidth reduction.

03

Scales effectively with limited communication bandwidth.

Abstract

Communication enables coordination in multi-agent reinforcement learning (MARL), but many real-world applications, e.g., search-and-rescue with drone swarms, operate under severe bandwidth constraints. Many communication architectures still expose a coupled bottleneck in which a shared latent representation is used for both policy execution and inter-agent communication. Consequently, reducing message size directly limits the policy's latent space, often leading to significant performance degradation. We address this with two contributions. First, we introduce $β$ , a normalised per-agent bandwidth budget that unifies sparsity, rounds, and message dimension into a single comparable constraint. Second, we provide SLIM, a minimal architecture that decouples the communication pathway from the policy's latent representation, allowing us to isolate the effect of bandwidth from the effect…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alexicanesse/Decoupling-Communication-from-Policy-Robust-MARL-under-Bandwidth-Constraints
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.