TACTIC: Task-Agnostic Contrastive pre-Training for Inter-Agent Communication
Peihong Yu, Manav Mishra, Syed Zaidi, Pratap Tokekar

TL;DR
TACTIC introduces a task-agnostic contrastive pre-training approach for inter-agent communication in multi-agent reinforcement learning, significantly improving generalization across different sight ranges and enhancing team coordination.
Contribution
It proposes a novel adaptive communication mechanism with contrastive learning that generalizes to unseen sight ranges, addressing a key challenge in cooperative MARL.
Findings
Outperforms state-of-the-art MARL methods on SMACv2 benchmark
Enhances coordination in scenarios with limited or extensive observability
Demonstrates strong generalization to unseen sight ranges
Abstract
The "sight range dilemma" in cooperative Multi-Agent Reinforcement Learning (MARL) presents a significant challenge: limited observability hinders team coordination, while extensive sight ranges lead to distracted attention and reduced performance. While communication can potentially address this issue, existing methods often struggle to generalize across different sight ranges, limiting their effectiveness. We propose TACTIC, Task-Agnostic Contrastive pre-Training strategy Inter-Agent Communication. TACTIC is an adaptive communication mechanism that enhances agent coordination even when the sight range during execution is vastly different from that during training. The communication mechanism encodes messages and integrates them with local observations, generating representations grounded in the global state using contrastive learning. By learning to generate and interpret messages…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Robot Manipulation and Learning · Reinforcement Learning in Robotics
