What Makes Good Collaborative Views? Contrastive Mutual Information   Maximization for Multi-Agent Perception

Wanfang Su; Lixing Chen; Yang Bai; Xi Lin; Gaolei Li; Zhe Qu; Pan Zhou

arXiv:2403.10068·cs.CV·March 18, 2024·1 cites

What Makes Good Collaborative Views? Contrastive Mutual Information Maximization for Multi-Agent Perception

Wanfang Su, Lixing Chen, Yang Bai, Xi Lin, Gaolei Li, Zhe Qu, Pan Zhou

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces CMiMC, a novel framework that enhances multi-agent perception by maximizing mutual information between individual and collaborative views, leading to improved accuracy and reduced communication costs.

Contribution

The paper proposes a new mutual information maximization framework for intermediate collaboration in multi-agent perception, with a multi-view contrastive learning approach for voxel-level feature fusion.

Findings

01

Improves state-of-the-art average precision by over 3% at 0.5 IoU.

02

Reduces communication volume to 1/32 of previous methods.

03

Achieves comparable performance with significantly less data exchange.

Abstract

Multi-agent perception (MAP) allows autonomous systems to understand complex environments by interpreting data from multiple sources. This paper investigates intermediate collaboration for MAP with a specific focus on exploring "good" properties of collaborative view (i.e., post-collaboration feature) and its underlying relationship to individual views (i.e., pre-collaboration features), which were treated as an opaque procedure by most existing works. We propose a novel framework named CMiMC (Contrastive Mutual Information Maximization for Collaborative Perception) for intermediate collaboration. The core philosophy of CMiMC is to preserve discriminative information of individual views in the collaborative view by maximizing mutual information between pre- and post-collaboration features while enhancing the efficacy of collaborative views by minimizing the loss function of downstream…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

77swf/cmimc
pytorchOfficial

Videos

What Makes Good Collaborative Views? Contrastive Mutual Information Maximization for Multi-Agent Perception· underline

Taxonomy

TopicsVisual Attention and Saliency Detection

MethodsFocus · Contrastive Learning