Monocular 3D Object Detection with LiDAR Guided Semi Supervised Active Learning
Aral Hekimoglu, Michael Schmidt, Alvaro Marcos-Ramiro

TL;DR
This paper introduces MonoLiG, a semi-supervised active learning framework for monocular 3D object detection that effectively utilizes LiDAR data for guiding data selection and training, improving accuracy and reducing labeling costs.
Contribution
The paper presents a novel framework that combines LiDAR guidance with semi-supervised active learning for monocular 3D detection, including a data noise-based weighting and sensor consistency-based sample selection.
Findings
Outperforms state-of-the-art active learning methods in labeling efficiency.
Achieves top results on KITTI 3D and BEV detection benchmarks.
Improves BEV AP by 2.02 on KITTI dataset.
Abstract
We propose a novel semi-supervised active learning (SSAL) framework for monocular 3D object detection with LiDAR guidance (MonoLiG), which leverages all modalities of collected data during model development. We utilize LiDAR to guide the data selection and training of monocular 3D detectors without introducing any overhead in the inference phase. During training, we leverage the LiDAR teacher, monocular student cross-modal framework from semi-supervised learning to distill information from unlabeled data as pseudo-labels. To handle the differences in sensor characteristics, we propose a data noise-based weighting mechanism to reduce the effect of propagating noise from LiDAR modality to monocular. For selecting which samples to label to improve the model performance, we propose a sensor consistency-based selection score that is also coherent with the training objective. Extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Monocular 3D Object Detection With LiDAR Guided Semi Supervised Active Learning· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms
