Pragmatic Heterogeneous Collaborative Perception via Generative Communication Mechanism
Junfei Zhou, Penglin Dai, Quanmin Wei, Bingyi Liu, Xiao Wu, Jianping Wang

TL;DR
This paper introduces GenComm, a novel generative communication mechanism enabling heterogeneous multi-agent perception collaboration without retraining core modules, significantly reducing computational costs and supporting scalable integration of new agents.
Contribution
The paper proposes a non-intrusive, generative approach for heterogeneous multi-agent perception that preserves semantic consistency and allows efficient addition of new agents with minimal computational overhead.
Findings
GenComm outperforms state-of-the-art methods on multiple datasets.
Achieves 81% reduction in computational cost and parameters for new agents.
Effectively maintains semantic and spatial alignment across heterogeneous agents.
Abstract
Multi-agent collaboration enhances the perception capabilities of individual agents through information sharing. However, in real-world applications, differences in sensors and models across heterogeneous agents inevitably lead to domain gaps during collaboration. Existing approaches based on adaptation and reconstruction fail to support pragmatic heterogeneous collaboration due to two key limitations: (1) Intrusive retraining of the encoder or core modules disrupts the established semantic consistency among agents; and (2) accommodating new agents incurs high computational costs, limiting scalability. To address these challenges, we present a novel Generative Communication mechanism (GenComm) that facilitates seamless perception across heterogeneous multi-agent systems through feature generation, without altering the original network, and employs lightweight numerical alignment of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Social Robot Interaction and HRI
