Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene
Ruiyang Zhang, Hu Zhang, Hang Yu, Zhedong Zheng

TL;DR
This paper introduces LiSe, a novel unsupervised 3D object detection method that combines LiDAR and 2D images with self-paced learning to improve detection accuracy in sparse, unstructured environments.
Contribution
We propose a new framework integrating LiDAR and 2D images with adaptive sampling and weak model aggregation for unsupervised 3D detection.
Findings
Achieved +7.1% AP$_{BEV}$ and +3.4% AP$_{3D}$ on nuScenes
Achieved +8.3% AP$_{BEV}$ and +7.4% AP$_{3D}$ on Lyft
Demonstrated significant improvements over existing methods.
Abstract
The unsupervised 3D object detection is to accurately detect objects in unstructured environments with no explicit supervisory signals. This task, given sparse LiDAR point clouds, often results in compromised performance for detecting distant or small objects due to the inherent sparsity and limited spatial resolution. In this paper, we are among the early attempts to integrate LiDAR data with 2D images for unsupervised 3D detection and introduce a new method, dubbed LiDAR-2D Self-paced Learning (LiSe). We argue that RGB images serve as a valuable complement to LiDAR data, offering precise 2D localization cues, particularly when scarce LiDAR points are available for certain objects. Considering the unique characteristics of both modalities, our framework devises a self-paced learning pipeline that incorporates adaptive sampling and weak model aggregation strategies. The adaptive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection · Advanced Neural Network Applications · Video Surveillance and Tracking Methods
