TL;DR
This paper explores cooperative perception for 3D object detection in autonomous driving, using infrastructure sensors to improve detection accuracy by fusing data from multiple viewpoints, especially in occluded or challenging scenarios.
Contribution
It introduces two fusion schemes—early and late—for cooperative 3D detection with infrastructure sensors, and evaluates their performance in complex driving scenarios.
Findings
Early fusion outperforms late fusion significantly.
Cooperative perception recalls over 95% of objects in challenging scenarios.
Sensor configuration impacts detection performance.
Abstract
3D object detection is a common function within the perception system of an autonomous vehicle and outputs a list of 3D bounding boxes around objects of interest. Various 3D object detection methods have relied on fusion of different sensor modalities to overcome limitations of individual sensors. However, occlusion, limited field-of-view and low-point density of the sensor data cannot be reliably and cost-effectively addressed by multi-modal sensing from a single point of view. Alternatively, cooperative perception incorporates information from spatially diverse sensors distributed around the environment as a way to mitigate these limitations. This article proposes two schemes for cooperative 3D object detection using single modality sensors. The early fusion scheme combines point clouds from multiple spatially diverse sensing points of view before detection. In contrast, the late…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
