TL;DR
This paper introduces a simple, effective collaborative perception framework for multi-agent 3D object detection that outperforms existing methods in bandwidth efficiency and robustness to synchronization issues.
Contribution
Proposes a novel collaboration method that improves bandwidth-performance tradeoff with minimal modifications to existing models and relaxed synchronization assumptions.
Findings
Achieves 98% of early-collaboration performance
Consumes bandwidth equivalent to late-collaboration methods
Outperforms prior state-of-the-art in efficiency and robustness
Abstract
Occlusion is a major challenge for LiDAR-based object detection methods. This challenge becomes safety-critical in urban traffic where the ego vehicle must have reliable object detection to avoid collision while its field of view is severely reduced due to the obstruction posed by a large number of road users. Collaborative perception via Vehicle-to-Everything (V2X) communication, which leverages the diverse perspective thanks to the presence at multiple locations of connected agents to form a complete scene representation, is an appealing solution. State-of-the-art V2X methods resolve the performance-bandwidth tradeoff using a mid-collaboration approach where the Bird-Eye View images of point clouds are exchanged so that the bandwidth consumption is lower than communicating point clouds as in early collaboration, and the detection performance is higher than late collaboration, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
