Towards Consistent Object Detection via LiDAR-Camera Synergy
Kai Luo, Hao Wu, Kefu Yi, Kailun Yang, Wei Hao, Rongdong Hu

TL;DR
This paper presents an end-to-end framework for consistent object detection using LiDAR and camera data, improving accuracy and robustness in environmental perception for human-machine interaction.
Contribution
It introduces a novel end-to-end COD algorithm that detects objects in both point clouds and images simultaneously and establishes their correlation in a single inference.
Findings
Achieves high detection accuracy on KITTI and DAIR-V2X datasets.
Demonstrates robustness to calibration disturbances.
Provides a new evaluation metric, Consistency Precision (CP).
Abstract
As human-machine interaction continues to evolve, the capacity for environmental perception is becoming increasingly crucial. Integrating the two most common types of sensory data, images, and point clouds, can enhance detection accuracy. Currently, there is no existing model capable of detecting an object's position in both point clouds and images while also determining their corresponding relationship. This information is invaluable for human-machine interactions, offering new possibilities for their enhancement. In light of this, this paper introduces an end-to-end Consistency Object Detection (COD) algorithm framework that requires only a single forward inference to simultaneously obtain an object's position in both point clouds and images and establish their correlation. Furthermore, to assess the accuracy of the object correlation between point clouds and images, this paper…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · CCD and CMOS Imaging Sensors · Advanced Neural Network Applications
MethodsSparse Evolutionary Training
