RG-Attn: Radian Glue Attention for Multi-modality Multi-agent Cooperative Perception
Lantao Li, Kang Yang, Wenqi Zhang, Xiaoxue Wang, Chen Sun

TL;DR
This paper introduces RG-Attn, a novel cross-modal fusion module for multi-agent cooperative perception in autonomous driving, enabling efficient, flexible, and accurate sensor data fusion across heterogeneous modalities and agent configurations.
Contribution
We propose RG-Attn, a lightweight, transformation-based attention module that unifies intra- and inter-agent fusion, supporting diverse sensor setups and improving perception accuracy in multi-agent systems.
Findings
Achieves state-of-the-art detection accuracy on multiple datasets.
Supports flexible sensor configurations including LiDAR and cameras.
Demonstrates high efficiency and robustness in real-world scenarios.
Abstract
Cooperative perception enhances autonomous driving by leveraging Vehicle-to-Everything (V2X) communication for multi-agent sensor fusion. However, most existing methods rely on single-modal data sharing, limiting fusion performance, particularly in heterogeneous sensor settings involving both LiDAR and cameras across vehicles and roadside units (RSUs). To address this, we propose Radian Glue Attention (RG-Attn), a lightweight and generalizable cross-modal fusion module that unifies intra-agent and inter-agent fusion via transformation-based coordinate alignment and a unified sampling/inversion strategy. RG-Attn efficiently aligns features through a radian-based attention constraint, operating column-wise on geometrically consistent regions to reduce overhead and preserve spatial coherence, thereby enabling accurate and robust fusion. Building upon RG-Attn, we propose three cooperative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Multimodal Machine Learning Applications
MethodsADaptive gradient method with the OPTimal convergence rate · Focus
