What Makes Good Collaborative Views? Contrastive Mutual Information Maximization for Multi-Agent Perception
Wanfang Su, Lixing Chen, Yang Bai, Xi Lin, Gaolei Li, Zhe Qu, Pan Zhou

TL;DR
This paper introduces CMiMC, a novel framework that enhances multi-agent perception by maximizing mutual information between individual and collaborative views, leading to improved accuracy and reduced communication costs.
Contribution
The paper proposes a new mutual information maximization framework for intermediate collaboration in multi-agent perception, with a multi-view contrastive learning approach for voxel-level feature fusion.
Findings
Improves state-of-the-art average precision by over 3% at 0.5 IoU.
Reduces communication volume to 1/32 of previous methods.
Achieves comparable performance with significantly less data exchange.
Abstract
Multi-agent perception (MAP) allows autonomous systems to understand complex environments by interpreting data from multiple sources. This paper investigates intermediate collaboration for MAP with a specific focus on exploring "good" properties of collaborative view (i.e., post-collaboration feature) and its underlying relationship to individual views (i.e., pre-collaboration features), which were treated as an opaque procedure by most existing works. We propose a novel framework named CMiMC (Contrastive Mutual Information Maximization for Collaborative Perception) for intermediate collaboration. The core philosophy of CMiMC is to preserve discriminative information of individual views in the collaborative view by maximizing mutual information between pre- and post-collaboration features while enhancing the efficacy of collaborative views by minimizing the loss function of downstream…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsVisual Attention and Saliency Detection
MethodsFocus · Contrastive Learning
